Automated job monitoring in a high performance computing environment
Author(s) -
Robert F. Cromp,
Gilad Suberri
Publication year - 2004
Publication title -
international conference on autonomic computing, 2004. proceedings.
Language(s) - English
DOI - 10.1109/icac.2004.12
We are developing software that monitors high performance computing assets while users' batch jobs execute, and actively performs site-established corrective actions to handle routine system/queuing issues normally performed by Unix administrators. The automated job monitor is independent of both platform and queueing system, and is customizable for numerous domains.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom