Premium
Optimized all‐to‐all communication on multicore architectures applied to FFTs with pencil decomposition
Author(s) -
Jocksch Andreas,
Kraushaar Matthias,
Daverio David
Publication year - 2018
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.4964
Subject(s) - computer science , parallel computing , parallelism (grammar) , message passing interface , message passing , exploit , shared memory , multi core processor , distributed memory , decomposition , pencil (optics) , interface (matter) , data parallelism , mechanical engineering , ecology , computer security , bubble , maximum bubble pressure method , biology , engineering
Summary All‐to‐all communication is a basic functionality of parallel communication libraries such as the Message Passing Interface (MPI). Typically, there are multiple different underlying algorithms, which are chosen according to message size. We propose a communication algorithm, which exploits the fact that modern supercomputers combine shared memory parallelism and distributed memory parallelism. The application example of our algorithm is FFTs with pencil decomposition. Furthermore, we propose an extension of the MPI standard in order to accommodate this and other algorithms in an efficient way.