Tandem Mass Spectrum Identification via Cascaded Search | Zendy

Attila KertészFarkas | Zendy; Uri Keich | Zendy; William Stafford Noble | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Tandem Mass Spectrum Identification via Cascaded Search

Author(s) -

Attila KertészFarkas,

Uri Keich,

William Stafford Noble

Publication year - 2015

Publication title -

journal of proteome research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.644

H-Index - 161

eISSN - 1535-3907

pISSN - 1535-3893

DOI - 10.1021/pr501173s

Subject(s) - cascade , database search engine , computer science , false discovery rate , a priori and a posteriori , computational biology , data mining , search engine , chemistry , biology , information retrieval , genetics , chromatography , philosophy , epistemology , gene

Accurate assignment of peptide sequences to observed fragmentation spectra is hindered by the large number of hypotheses that must be considered for each observed spectrum. A high score assigned to a particular peptide-spectrum match (PSM) may not end up being statistically significant after multiple testing correction. Researchers can mitigate this problem by controlling the hypothesis space in various ways: considering only peptides resulting from enzymatic cleavages, ignoring possible post-translational modifications or single nucleotide variants, etc. However, these strategies sacrifice identifications of spectra generated by rarer types of peptides. In this work, we introduce a statistical testing framework, cascade search, that directly addresses this problem. The method requires that the user specify a priori a statistical confidence threshold as well as a series of peptide databases. For instance, such a cascade of databases could include fully tryptic, semitryptic, and nonenzymatic peptides or peptides with increasing numbers of modifications. Cascaded search then gradually expands the list of candidate peptides from more likely peptides toward rare peptides, sequestering at each stage any spectrum that is identified with a specified statistical confidence. We compare cascade search to a standard procedure that lumps all of the peptides into a single database, as well as to a previously described group FDR procedure that computes the FDR separately within each database. We demonstrate, using simulated and real data, that cascade search identifies more spectra at a fixed FDR threshold than with either the ungrouped or grouped approach. Cascade search thus provides a general method for maximizing the number of identified spectra in a statistically rigorous fashion.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research