z-logo
open-access-imgOpen Access
Flexible and Accessible Workflows for Improved Proteogenomic Analysis Using the Galaxy Framework
Author(s) -
Pratik Jagtap,
James E. Johnson,
Getiria Onsongo,
Fredrik Sadler,
Kevin Murray,
Yuanbo Wang,
Gloria M. Shenykman,
Sricharan Bandhakavi,
Lloyd M. Smith,
Timothy J. Griffin
Publication year - 2014
Publication title -
journal of proteome research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.644
H-Index - 161
eISSN - 1535-3907
pISSN - 1535-3893
DOI - 10.1021/pr500812t
Subject(s) - proteogenomics , computer science , workflow , software , personalization , flexibility (engineering) , computational biology , genomics , biology , genome , database , world wide web , biochemistry , statistics , mathematics , gene , programming language
Proteogenomics combines large-scale genomic and transcriptomic data with mass-spectrometry-based proteomic data to discover novel protein sequence variants and improve genome annotation. In contrast with conventional proteomic applications, proteogenomic analysis requires a number of additional data processing steps. Ideally, these required steps would be integrated and automated via a single software platform offering accessibility for wet-bench researchers as well as flexibility for user-specific customization and integration of new software tools as they emerge. Toward this end, we have extended the Galaxy bioinformatics framework to facilitate proteogenomic analysis. Using analysis of whole human saliva as an example, we demonstrate Galaxy's flexibility through the creation of a modular workflow incorporating both established and customized software tools that improve depth and quality of proteogenomic results. Our customized Galaxy-based software includes automated, batch-mode BLASTP searching and a Peptide Sequence Match Evaluator tool, both useful for evaluating the veracity of putative novel peptide identifications. Our complex workflow (approximately 140 steps) can be easily shared using built-in Galaxy functions, enabling their use and customization by others. Our results provide a blueprint for the establishment of the Galaxy framework as an ideal solution for the emerging field of proteogenomics.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom