Оптимизация работы MPI-программ с учётом особенностей топологии кластеров, использующих коммуникационную сеть Ангара

М. Р. Халилов; А. В. Тимофеев

Publications

?

Оптимизация работы MPI-программ с учётом особенностей топологии кластеров, использующих коммуникационную сеть Ангара

С. 641–649.

Khalilov M., Timofeev A.

Language: Russian

Full text

Text on another site

Keywords: MPI technology параллельное программирование Angara MPI Ангара топология суперкомпьютера

In book

Суперкомпьютерные дни в России: Труды международной конференции (25-26 сентября 2017 г., г. Москва)

М.: Издательство МГУ, 2017.

Алгоритм матричного произведения на графических ускорителях для платформ с неравномерными каналами передачи данных

Choi Y. R., Мальковский С. И., Stegailov V., В кн.: "Информационные технологии и высокопроизводительные вычисления": Материалы VIII Международной научно-практической конференции, Хабаровск, 15-17 сентября 2025 г.: Хабаровск: Хабаровский Федеральный исследовательский центр, 2025. Гл. 81 С. 317–320.

Работа посвящена разработке и экспериментальному исследованию параллельных алгоритмов матричного умножения и матричной экспоненты с асинхронным обменом данными, использующих принцип наложения вычислений и коммуникаций для максимизации производительности, для систем с несколькими графическими ускорителями и неоднородной топологией. Также представлены теоретические модели оптимизации размера блоков для повышения эффективности расчетов. Алгоритм матричной экспоненты реализован с поддержкой комплексных матриц через ...

Added: October 15, 2025

«Cтройка» – компьютерная игра для знакомства с параллельным программированием

Воронова К. Д., Plaksin M. A., В кн.: Актуальные проблемы математики, механики и информатики 2022: Сборник статей по материалам студенческой конференции (г. Пермь, ПГНИУ, 25 мая – 10 июня 2022 г.).: Пермь: ПГНИУ, 2022. С. 25–29.

The rapid development of parallel computing technologies makes it urgent to include propaedeutics of parallel computing in the school computer science course. Since this topic is not yet included in the school curriculum, it can be done through extracurricular activities, in particular, through Internet contests. Since 2013, parallel computing tasks have become a compulsory part ...

Added: February 29, 2024

Graph based routing algorithm for torus topology and its evaluation for the Angara interconnect

Mukosey A., Semenov A., Tretiakov A., Journal of Parallel and Distributed Computing 2024 Vol. 183 Article 104765

Several approaches and techniques exist to resolve load balancing problem in general and torus topology networks. Graph methods are natural ways to perform balancing of routing paths. A routing balancing algorithm must operate within the constraints of the underlying network architecture that limits several parameters, such as the number of logical paths in the network. In this paper, we consider a ...

Added: November 25, 2023

ЗНАКОМСТВО С ПАРАЛЛЕЛЬНЫМИ ВЫЧИСЛЕНИЯМИ В РАМКАХ ДИСТАНЦИОННОГО КОНКУРСА «ТРИЗФОРМАШКА-2022»

Воронова К. Д., Plaksin M. A., В кн.: Дистанционное обучение – образовательная среда XXI века : материалы XII Междунар. науч.-метод. конф. (Республика Беларусь, Минск, 26 мая 2022 года).: Мн.: БГУИР, 2022. С. 163–163.

It is proposed to introduce schoolchildren and students to the basics of parallel computing using the distance competition "TRIZformashka". For the competition "TRIZformashka-2022" the computer game "Builder" was specially developed for teaching how to build parallel algorithms. A description of the game and a download link are given. ...

Added: October 31, 2022

Simulation of Utilization and Energy Saving of the Angara Interconnect

Mukosey A., Semenov A., Lobachevskii Journal of Mathematics 2022 Vol. 43 P. 873–881

In this paper we address the problem of node allocation for high performance computer systems based on the Angara interconnect with the torus topology. Most allocation strategies for the torus topologies assume redundancy, i.e. for a user job it is possible to allocate more nodes than required. We propose the new node allocation algorithm for ...

Added: October 26, 2022

Algorithm for Adaptive Mesh Redistribution in Lattice Boltzmann Simulations

Ziganurova L., Shchur L., Lobachevskii Journal of Mathematics 2022 Vol. 43 No. 2 P. 513–518

The Lattice Boltzmann method (LBM) is the alternative approach for hydrodynamic equation solving. Two factors make it a favorite approach nowadays. Firstly, the attractive feature of LBM is that it is intrinsic for parallel simulations due to the linear structure of the algorithm. Secondly, what makes LBM special for the research, it is well applicable to the simulations ...

Added: May 25, 2022

Workshop on Exascale MPI at Supercomputing Conference (ExaMPI)

IEEE, 2021.

2021 Workshop on Exascale MPI (ExaMPI) DOI: 10.1109/ExaMPI54564.2021 14-14 Nov. 2021 ...

Added: January 20, 2022

Оптимизация графов потока управления в промежуточных представлениях языка функционально-потокового параллельного программирования

Васильев В. С., Легалов А. И., Научный вестник Новосибирского государственного технического университета 2020 № 4 С. 37–46

Functional dataflow programming languages are intended for the development of architecture-independent parallel programs and support the control of computations based on data availability. Due to the fact that at present parallel computing systems are very widespread, and their programming in imperative languages is associated with portability problems, the development of architecturally independent parallel programming tools ...

Added: August 26, 2021

Особенности семантики статически типизированного языка функционально-потокового параллельного программирования

Легалов А. И., Легалов И. А., Матковский И. В., В кн.: Научный сервис в сети Интернет: труды XXI Всероссийской научной конференции.: Институт прикладной математики им. М.В. Келдыша РАН, 2019. С. 489–500.

The features of the dataflow functional parallel programming language using static data typing are considered. The previously developed language Pifagor supports only dynamic typing, which does not provide an effective transformation of written programs into programs for modern parallel computing systems. The analysis of changes in the dataflow functional model of calculations and programming language ...

Added: November 1, 2020

Статически типизированная версия языка функционально-потокового параллельного программирования

Легалов А. И., Легалов И. А., Матковский И. В., В кн.: Параллельные вычислительные технологии – XIV международная конференция, ПаВТ’2020.: Челябинск: Издательский центр ЮУрГУ, 2020. Гл. 19 С. 185–192.

Предлагается расширение языка функционально потокового параллельного программирования включением в него статической системы типов. Это ведет к изменению функционально-потоковой модели параллельных вычислений и, как следствие, методов трансформации, а также подходов к использованию функционально-потоковых параллельных программ. Проводится обзор модели вычислений и операторов языка программирования, сформированных в результате проведенных изменений. Отмечается, как эти изменения влияют на синтаксис и ...

Added: November 1, 2020

Добавление статической типизации в язык функционально-потокового параллельного программирования.

Легалов А. И., Легалов И. А., Матковский И. В., Электронные библиотеки 2020 Т. 23 № 4 С. 788–807

It is proposed to add a static system of types to the dataflow functional model of parallel computing and the dataflow functional parallel programming language developed on its basis. The use of static typing increases the possibility of transforming dataflow functional parallel programs into programs running on modern parallel computing systems. Language constructions are proposed. ...

Added: November 1, 2020

Algorithm for the replica redistribution in the implementation of parallel annealing method on the hybrid supercomputer architecture

Russkov A., Roman Chulkevich, Shchur L., / Series arXiv "math". 2020. No. 2006.00561.

The parallel annealing method is one of the promising approaches for large scale simulations as potentially scalable on any parallel architecture. We present an implementation of the algorithm on the hybrid program architecture combining CUDA and MPI. The problem is to keep all general-purpose graphics processing unit devices as busy as possible redistributing replicas and ...

Added: June 2, 2020

Производительность современных вычислительных платформ при обработке данных расчетов молекулярной динамики мембранных и белок-мембранных систем

Krylov N., Nolde D., Телегин П. Н. et al., Труды НИИСИ РАН 2018 Т. 8 № 6 С. 74–78

We studied the performance of two algorithms for processing results of molecular dynamics (MD) simulation on modern computing platforms: calculations of radial distribution function (RDF) and energies. We found that both algorithms effectively parallelize both on systems with shared memory and on clusters with distributed memory. For processing the results of medium-sized MD systems, the parallelization efficiency of ...

Added: February 10, 2020

Исследование масштабируемости FlowVision на кластере с интерконнектом Ангара

Акимов В. С., Силаев Д. П., Симонов А. С. et al., Вычислительные методы и программирование: новые вычислительные технологии 2017 Т. 18 С. 406–415

The scalability of computations in FlowVision CFD software on the Angara-C1 cluster equipped with Angara interconnect is studied. Several test problems with 260 thousand, 5.5 million and 26.8 million computational cells are considered. Computations in FlowVision are performed using a new solver of linear systems based on the algebraic multigrid (AMG) method. It is shown ...

Added: October 30, 2019

Optimization of MPI-Process Mapping for Clusters with Angara Interconnect

Khalilov M., Timofeev A., Lobachevskii Journal of Mathematics 2018 Vol. 39 No. 9 P. 1188–1198

An algorithm of MPI processes mapping optimization is adapted for supercomputers with interconnect Angara. The mapping algorithm is based on partitioning of parallel program communication pattern. It is performed in such a way that the processes between which the most intensive exchanges take place are tied to the nodes/processors with the highest bandwidth. The algorithm ...

Added: March 10, 2019

Пропедевтика параллельных вычислений в школьной информатике: компьютерная игра «Пожарные танки»

Plaksin M. A., Щелкунов А. А., Современные информационные технологии и ИТ-образование 2018 Т. 14 № 4 С. 1000–1011

The article contains the methodological materials for inclusion of the topic “Parallel Computing” in the school informatics. The computer games “Tank crew”, “Swarm of robots”, “Firefighting vehicles” are considered. The goal of the first game is to program joint actions of tank crew members. The plot of the second game is the putting on foot ...

Added: January 7, 2019

Оптимизация утилизации при выделении ресурсов для высокопроизводительных вычислительных систем с сетью Ангара

Мукосей А. В., Semenov A., В кн.: Суперкомпьютерные дни в России: Труды международной конференции (24-25 сентября 2018 г., г. Москва).: М.: МГУ, 2018. С. 831–840.

Работа посвящена оптимизации утилиции при выделении вычислительных узлов для заданий в суперкомпьютере с сетью Ангара, имеющей топологию «многомерный тор». В работе показано, что перестановка пользовательских заданий в очереди одновременно с использованием метода выделения ресурсов, сокращающего фрагментацию системы, в среднем дает прирост утилизации ресурсов на 7% и на 36,6% сокращает значение время ожидания задания в очереди ...

Added: November 14, 2018