Making Large-Scale Systems Observable - Another Inescapable Step Towards Exascale
Author(s) -
Dmitry Nikitenko,
Sergey Zhumatiy,
Pavel Shvets
Publication year - 2016
Publication title -
supercomputing frontiers and innovations
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.375
H-Index - 16
eISSN - 2409-6008
pISSN - 2313-8734
DOI - 10.14529/jsfi160205
Subject(s) - computer science , control reconfiguration , scalability , supercomputer , visualization , exascale computing , distributed computing , computer architecture , embedded system , operating system , data mining
The effective mastering of extremely parallel HPC system is impossible without deep understanding of all internal processes and behavior of the whole diversity of the components: computing processors and nodes, memory usage, interconnect, storage, whole software stack, cooling and so forth in detail. There are numerous visualization tools that provide information on certain components and system as a whole, but most of them have severe issues that limit appliance in real life, thus becoming inacceptable for the future system scales. Predefined monitoring systems and data sources, lack of dynamic on-the-fly reconfiguration, inflexible visualization and screening options are among the most popular issues.The proposed approach to monitoring data processing resolves the majority of known problems, providing a scalable and flexible solution based on any available monitoring systems and other data sources. The approach implementation is successfully used in every-day practice of the largest in Russia supercomputer center of Moscow State University.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom