Premium
Enabling cloud bursting for life sciences within Galaxy
Author(s) -
Afgan Enis,
Coraor Nate,
Chilton John,
Baker Dan,
Taylor James
Publication year - 2015
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.3536
Subject(s) - cloud computing , software deployment , scope (computer science) , computer science , field (mathematics) , process (computing) , data science , world wide web , systems engineering , software engineering , engineering , operating system , mathematics , pure mathematics , programming language
Summary Fueled by the radically increased capacity to generate data over the past decade, the field of biomedical research has been constrained by the ability to analyze data. Galaxy, a Web‐based, open‐source data integration and analysis platform for life science research, has been democratizing access to data analysis tools. However, the scale of data and the scope of tools required have proven to be a significant challenge for any monolithic deployment of the Galaxy application. We have found that a distributed and federated approach to utilizing compute and storage resources is necessary. This paper describes the ongoing efforts in creating a ubiquitous platform capable of simultaneously utilizing dedicated as well as on‐demand cloud resources. Specifically, the requirements, process, and an implementation of a cloud‐bursting system are detailed. Copyright © 2010 John Wiley & Sons, Ltd.