Offline and Online Autotuning of Parallel Applications
Offline and Online Autotuning of Parallel Applications
Disciplines
Computer Sciences (100%)
Keywords
-
Autotuning,
MPI,
Benchmarking,
Reproducibility,
Performance Models
Many scientific applications, such as weather forecast or earthquake simulations, need to be execu- ted on large, parallel machines to speed up the computation. These parallel machines are comprised of hundreds or thousands of compute nodes, where each compute node is similar to a common desktop machine. These parallel applications are most often built on top of the Message Passing Interface (MPI), which is a standard for data communication. As a result, the run-time of these applications depends on the efficiency of the underlying MPI implementation. It is therefore of utmost importance to provide the best possible MPI implementation for a given system. Much research has been done to develop scalable, efficient implementations of specific MPI functions. For this reason, MPI libraries offer a large set of algorithms and provide many run-time parameters for the purpose of adapting (tuning) themselves to a given parallel machine. In our project, we will tackle the problem of optimizing the run-time parameters of MPI libraries in an automated fashion. The problem is that current MPI libraries provide several hundreds of tunable parameters, which results in a tremendously large search space. Therefore, a brute-force approach of testing every combination of parameters would take far too long and is thus impractical. Statistical methods can help us to successively reduce the number of parameters that need to be considered. In order to select the best possible algorithm for specific use cases, we apply modern machine learning techniques. Overall, we will devise and develop a software prototype that can automatically tune MPI libraries to a given parallel machine.
- Technische Universität Wien - 50%
- Universität Wien - 50%
- Siegfried Benkner, Universität Wien , associated research partner
- Balazs Gerofi, Reserach Center for Computational Science - Japan
- George Bosilca, University of Tennessee - USA
Research Output
- 11 Citations
- 5 Publications
-
2024
Title Exploring Scalability in C++ Parallel STL Implementations DOI 10.1145/3673038.3673065 Type Conference Proceeding Abstract Author Laso R Pages 284-293 Link Publication -
2022
Title OMPICollTune: Autotuning MPI Collectives by Incremental Online Learning DOI 10.1109/pmbs56514.2022.00016 Type Conference Proceeding Abstract Author Hunold S Pages 123-128 -
2022
Title Cache-oblivious Hilbert Curve-based Blocking Scheme for Matrix Transposition DOI 10.1145/3555353 Type Journal Article Author Alves J Journal ACM Transactions on Mathematical Software Pages 1-28 Link Publication -
2021
Title MicroBench Maker: Reproduce, Reuse, Improve DOI 10.1109/pmbs54543.2021.00013 Type Conference Proceeding Abstract Author Hunold S Pages 69-74 -
2023
Title A Novel Triangular Space-Filling Curve for Cache-Oblivious In-Place Transposition of Square Matrices DOI 10.1109/ipdps54959.2023.00045 Type Conference Proceeding Abstract Author Alves J Pages 368-378