z-logo
Premium
Maximizing the sensitivity and reliability of peptide identification in large‐scale proteomic experiments by harnessing multiple search engines
Author(s) -
Yu Wen,
Taylor J. Alex,
Davis Michael T.,
Bonilla Leo E.,
Lee Kimberly A.,
Auger Paul L.,
Farnsworth Chris C.,
Welcher Andrew A.,
Patterson Scott D.
Publication year - 2010
Publication title -
proteomics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.26
H-Index - 167
eISSN - 1615-9861
pISSN - 1615-9853
DOI - 10.1002/pmic.200900074
Subject(s) - false discovery rate , false positive rate , normalization (sociology) , computer science , data mining , database search engine , sensitivity (control systems) , benchmarking , search engine , artificial intelligence , chemistry , information retrieval , engineering , biochemistry , marketing , sociology , electronic engineering , anthropology , business , gene
Despite recent advances in qualitative proteomics, the automatic identification of peptides with optimal sensitivity and accuracy remains a difficult goal. To address this deficiency, a novel algorithm, Multiple Search Engines, Normalization and Consensus is described. The method employs six search engines and a re‐scoring engine to search MS/MS spectra against protein and decoy sequences. After the peptide hits from each engine are normalized to error rates estimated from the decoy hits, peptide assignments are then deduced using a minimum consensus model. These assignments are produced in a series of progressively relaxed false‐discovery rates, thus enabling a comprehensive interpretation of the data set. Additionally, the estimated false‐discovery rate was found to have good concordance with the observed false‐positive rate calculated from known identities. Benchmarking against standard proteins data sets (ISBv1, sPRG2006) and their published analysis, demonstrated that the Multiple Search Engines, Normalization and Consensus algorithm consistently achieved significantly higher sensitivity in peptide identifications, which led to increased or more robust protein identifications in all data sets compared with prior methods. The sensitivity and the false‐positive rate of peptide identification exhibit an inverse‐proportional and linear relationship with the number of participating search engines.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here