Projectdetail

Grant DOI 10.55776/P33884
Funding program Principal Investigator Projects
Status ended
Start July 1, 2021
End February 28, 2025
Funding amount € 260,673
E-mail

Disciplines

Computer Sciences (100%)

Keywords

Autotuning, MPI, Benchmarking, Reproducibility, Performance Models

Abstract

Many scientific applications, such as weather forecast or earthquake simulations, need to be execu- ted on large, parallel machines to speed up the computation. These parallel machines are comprised of hundreds or thousands of compute nodes, where each compute node is similar to a common desktop machine. These parallel applications are most often built on top of the Message Passing Interface (MPI), which is a standard for data communication. As a result, the run-time of these applications depends on the efficiency of the underlying MPI implementation. It is therefore of utmost importance to provide the best possible MPI implementation for a given system. Much research has been done to develop scalable, efficient implementations of specific MPI functions. For this reason, MPI libraries offer a large set of algorithms and provide many run-time parameters for the purpose of adapting (tuning) themselves to a given parallel machine. In our project, we will tackle the problem of optimizing the run-time parameters of MPI libraries in an automated fashion. The problem is that current MPI libraries provide several hundreds of tunable parameters, which results in a tremendously large search space. Therefore, a brute-force approach of testing every combination of parameters would take far too long and is thus impractical. Statistical methods can help us to successively reduce the number of parameters that need to be considered. In order to select the best possible algorithm for specific use cases, we apply modern machine learning techniques. Overall, we will devise and develop a software prototype that can automatically tune MPI libraries to a given parallel machine.

Research institution(s)

Technische Universität Wien - 50%
Universität Wien - 50%

Project participants

Siegfried Benkner, Universität Wien , associated research partner

International project participants

Balazs Gerofi, Reserach Center for Computational Science - Japan
George Bosilca, University of Tennessee - USA

Research Output

11 Citations
5 Publications

Publications

Title	Exploring Scalability in C++ Parallel STL Implementations
DOI	10.1145/3673038.3673065
Type	Conference Proceeding Abstract
Author	Laso R
Pages	284-293
Link	Publication

Title	OMPICollTune: Autotuning MPI Collectives by Incremental Online Learning
DOI	10.1109/pmbs56514.2022.00016
Type	Conference Proceeding Abstract
Author	Hunold S
Pages	123-128

Title	Cache-oblivious Hilbert Curve-based Blocking Scheme for Matrix Transposition
DOI	10.1145/3555353
Type	Journal Article
Author	Alves J
Journal	ACM Transactions on Mathematical Software
Pages	1-28
Link	Publication

Title	MicroBench Maker: Reproduce, Reuse, Improve
DOI	10.1109/pmbs54543.2021.00013
Type	Conference Proceeding Abstract
Author	Hunold S
Pages	69-74

Title	A Novel Triangular Space-Filling Curve for Cache-Oblivious In-Place Transposition of Square Matrices
DOI	10.1109/ipdps54959.2023.00045
Type	Conference Proceeding Abstract
Author	Alves J
Pages	368-378

Go to overview page Discover

Go to overview page Funding

Go to overview page About Us

Go to overview page News

Offline and Online Autotuning of Parallel Applications

Offline and Online Autotuning of Parallel Applications

Disciplines

Keywords

Research Output

Contact

General information

Go to overview page Discover

Go to overview page Funding

Go to overview page About Us

Go to overview page News

SOCIAL MEDIA

SCILOG

Offline and Online Autotuning of Parallel Applications

Offline and Online Autotuning of Parallel Applications

Disciplines

Keywords

Research Output