
FAULT TOLERANCE FOR HPC BY USING LOCAL CHECKPOINTS
Publication year - 2014
Publication title -
vestnik ûžno-uralʹskogo gosudarstvennogo universiteta. seriâ vyčislitelʹnaâ matematika i informatika
Language(s) - English
Resource type - Journals
eISSN - 2410-7034
pISSN - 2305-9052
DOI - 10.14529/cmse140302
Subject(s) - dependability , computer science , fault tolerance , snapshot (computer storage) , distributed computing , overhead (engineering) , protocol (science) , rollback , computation , node (physics) , parallel computing , database transaction , operating system , algorithm , database , engineering , medicine , alternative medicine , software engineering , structural engineering , pathology