Premium
Collective communication: theory, practice, and experience
Author(s) -
Chan Ernie,
Heimlich Marcel,
Purkayastha Avi,
van de Geijn Robert
Publication year - 2007
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.1206
Subject(s) - pentium , implementation , computer science , xeon , xeon phi , computer architecture , parallel computing , supercomputer , operating system , domain (mathematical analysis) , computer cluster , software engineering , mathematical analysis , mathematics
We discuss the design and high‐performance implementation of collective communications operations on distributed‐memory computer architectures. Using a combination of known techniques (many of which were first proposed in the 1980s and early 1990s) along with careful exploitation of communication modes supported by MPI, we have developed implementations that have improved performance in most situations compared to those currently supported by public domain implementations of MPI such as MPICH. Performance results from a large Intel Xeon/Pentium 4 (R) processor cluster are included. Copyright © 2007 John Wiley & Sons, Ltd.