Premium
Fault‐tolerant execution of large parameter sweep applications across multiple VOs with storage constraints
Author(s) -
Ayyub Shahaan,
Abramson David,
Enticott Colin,
Garic Slavisa,
Tan Jefferson
Publication year - 2009
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.1353
Subject(s) - computer science , fault tolerance , delegation , task (project management) , distributed computing , garbage collection , fault (geology) , garbage , database , programming language , engineering , systems engineering , political science , law , seismology , geology
Abstract Applications that span multiple virtual organizations (VOs) are of great interest to the e‐science community. However, our recent attempts to execute large‐scale parameter sweep applications (PSAs) for real‐world climate studies with the Nimrod/G tool have exposed problems in the areas of fault tolerance, data storage and trust management. In response, we have implemented a task‐splitting approach that facilitates breaking up large PSAs into a sequence of dependent subtasks, improving fault tolerance; provides a garbage collection technique that deletes unnecessary data; and employs a trust delegation technique that facilitates flexible third party data transfers across different VOs. Copyright © 2008 John Wiley & Sons, Ltd.