FAULT TOLERANCE FOR HPC BY USING LOCAL CHECKPOINTS
Author(s) -
Алексей Алексеевич Бондаренко,
Михаил Владимирович Якобовский
Publication year - 2014
Publication title -
bulletin of the south ural state university series computational mathematics and software engineering
Language(s) - English
Resource type - Journals
eISSN - 2410-7034
pISSN - 2305-9052
DOI - 10.14529/cmse140302
Subject(s) - dependability , computer science , fault tolerance , snapshot (computer storage) , distributed computing , overhead (engineering) , protocol (science) , rollback , computation , node (physics) , parallel computing , database transaction , operating system , algorithm , database , engineering , medicine , alternative medicine , software engineering , structural engineering , pathology
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom