z-logo
Premium
Integrated platform for manual and high‐throughput statistical validation of tandem mass spectra
Author(s) -
Yu Kebing,
Sabelli Anthony,
DeKeukelaere Lisa,
Park Richard,
Sindi Suzanne,
Gatsonis Constantine A.,
Salomon Arthur
Publication year - 2009
Publication title -
proteomics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.26
H-Index - 167
eISSN - 1615-9861
pISSN - 1615-9853
DOI - 10.1002/pmic.200800899
Subject(s) - computer science , pipeline (software) , workflow , software , data mining , false discovery rate , database , programming language , chemistry , biochemistry , gene
Abstract As proteomic data sets increase in size and complexity, the necessity for database‐centric software systems able to organize, compare, and visualize all the proteomic experiments in a lab grows. We recently developed an integrated platform called high‐throughput autonomous proteomic pipeline (HTAPP) for the automated acquisition and processing of quantitative proteomic data, and integration of proteomic results with existing external protein information resources within a lab‐based relational database called PeptideDepot. Here, we introduce the peptide validation software component of this system, which combines relational database‐integrated electronic manual spectral annotation in Java with a new software tool in the R programming language for the generation of logistic regression spectral models from user‐supplied validated data sets and flexible application of these user‐generated models in automated proteomic workflows. This logistic regression spectral model uses both variables computed directly from SEQUEST output in addition to deterministic variables based on expert manual validation criteria of spectral quality. In the case of linear quadrupole ion trap (LTQ) or LTQ‐FTICR LC/MS data, our logistic spectral model outperformed both XCorr (242% more peptides identified on average) and the X!Tandem E ‐value (87% more peptides identified on average) at a 1% false discovery rate estimated by decoy database approach.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here