z-logo
open-access-imgOpen Access
Uma Estratégia para Versionamento dos Dados de Workflows Científicos Executados em Nuvem
Author(s) -
Fabrício G. Nogueira,
Kary Ocaña,
Vítor Silva,
Vanessa Braganholo,
Daniel de Oliveira
Publication year - 2017
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5753/bresci.2017.9920
Subject(s) - overhead (engineering) , computer science , workflow , cloud computing , task (project management) , distributed computing , real time computing , database , operating system , engineering , systems engineering
Scientific experiments usually run hundreds or thousands of times, generating a huge amount of data that requires to be managed. Analizing and comparing the results of such experiments is na extremely complex task. This becomes even more complex for workflows running in the cloud because the data is scattered across multiple virtual machines. In order to alleviate this proble, previous work proposed the use of a version control system to manage the data consumed and generated by scientific experiments. However, they add considerable overhead to the experiment, increasing the processing time and the use of disk space. In this article, we propose an alternative strategy to reduce time and space. Our initial experiments show that the time overhead of our approach is still high, but disk overhead was 5 times smaller than the approaches in the literature.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here