Combining replication and checkpointing redundancies for reducing resiliency overhead | Zendy

Motallebi Hassan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Combining replication and checkpointing redundancies for reducing resiliency overhead

Author(s) -

Motallebi Hassan

Publication year - 2020

Publication title -

etri journal

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.295

H-Index - 46

eISSN - 2233-7326

pISSN - 1225-6463

DOI - 10.4218/etrij.2018-0684

Subject(s) - computer science , distributed computing , redundancy (engineering) , fault tolerance , replication (statistics) , workflow , overhead (engineering) , parallel computing , scheduling (production processes) , computation , algorithm , mathematical optimization , operating system , statistics , mathematics , database

We herein propose a heuristic redundancy selection algorithm that combines resubmission, replication, and checkpointing redundancies to reduce the resiliency overhead in fault‐tolerant workflow scheduling. The appropriate combination of these redundancies for workflow tasks is obtained in two consecutive phases. First, to compute the replication vector (number of task replicas), we apportion the set of provisioned resources among concurrently executing tasks according to their needs. Subsequently, we obtain the optimal checkpointing interval for each task as a function of the number of replicas and characteristics of tasks and computational environment. We formulate the problem of obtaining the optimal checkpointing interval for replicated tasks in situations where checkpoint files can be exchanged among computational resources. The results of our simulation experiments, on both randomly generated workflow graphs and real‐world applications, demonstrated that both the proposed replication vector computation algorithm and the proposed checkpointing scheme reduced the resiliency overhead.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore