Novel views of performance data to analyze large-scale adaptive applications
Author(s) -
Abhinav Bhatele,
Todd Gamblin,
Katherine E. Isaacs,
Brian T. N. Gunney,
Martin Schulz,
Peer-Timo Bremer,
Bernd Hamann
Publication year - 2013
Publication title -
2012 international conference for high performance computing, networking, storage and analysis
Language(s) - English
Resource type - Conference proceedings
SCImago Journal Rank - 0.363
H-Index - 56
eISSN - 2167-4337
pISSN - 2167-4329
ISBN - 978-1-4673-0806-9
DOI - 10.1109/sc.2012.80
Subject(s) - computing and processing
Performance analysis of parallel scientific codes is becoming increasingly difficult due to the rapidly growing complexity of applications and architectures. Existing tools fall short in providing intuitive views that facilitate the process of performance debugging and tuning. In this paper, we extend recent ideas of projecting and visualizing performance data for faster, more intuitive analysis of applications. We collect detailed per-level and per-phase measurements for a dynamically load-balanced, structured AMR library and project per-core data collected in the hardware domain on to the application's communication topology. We show how our projections and visualizations lead to a rapid diagnosis of and mitigation strategy for a previously elusive scaling bottleneck in the library that is hard to detect using conventional tools. Our new insights have resulted in a 22% performance improvement for a 65,536-core run of the AMR library on an IBM Blue Gene/P system.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom