Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers.
Author(s) -
Francois Tessier,
Preeti Malakar,
Venkatram Vishwanath,
Emmanuel Jeannot,
Florin Isaila
Publication year - 2016
Publication title -
2016 first international workshop on communication optimizations in hpc (comhpc)
Language(s) - English
DOI - 10.1109/com-hpc.2016.13
Reading and writing data efficiently from storage systems is critical for high performance data-centric applications. These I/O systems are being increasingly characterized by complex topologies and deeper memory hierarchies. Effective parallel I/O solutions are needed to scale applications on current and future supercomputers. Data aggregation is an efficient approach consisting of electing some processes in charge of aggregating data from a set of neighbors and writing the aggregated data into storage. Thus, the bandwidth use can be optimized while the contention is reduced. In this work, we take into account the network topology for mapping aggregators and we propose an optimized buffering system in order to reduce the aggregation cost. We validate our approach using micro-benchmarks and the I/O kernel of a large-scale cosmology simulation. We show improvements up to 15X faster for I/O operations compared to a standard implementation of MPI I/O.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom