z-logo
open-access-imgOpen Access
Submodular Generalized Matching for Peptide Identification in Tandem Mass Spectrometry
Author(s) -
Wenruo Bai,
Jeffrey A. Bilmes,
William Stafford Noble
Publication year - 2018
Publication title -
ieee/acm transactions on computational biology and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.745
H-Index - 71
eISSN - 1557-9964
pISSN - 1545-5963
DOI - 10.1109/tcbb.2018.2822280
Subject(s) - matching (statistics) , matroid , submodular set function , function (biology) , computer science , greedy algorithm , mathematics , algorithm , combinatorics , statistics , biology , evolutionary biology
Identification of spectra produced by a shotgun proteomics mass spectrometry experiment is commonly performed by searching the observed spectra against a peptide database. The heart of this search procedure is a score function that evaluates the quality of a hypothesized match between an observed spectrum and a theoretical spectrum corresponding to a particular peptide sequence. Accordingly, the success of a spectrum analysis pipeline depends critically upon this peptide-spectrum score function. We develop peptide-spectrum score functions that compute the maximum value of a submodular function under $m$ m matroid constraints. We call this procedure a submodular generalized matching (SGM) since it generalizes bipartite matching. We use a greedy algorithm to compute maximization, which can achieve a solution whose objective is guaranteed to be at least $\frac{1}{1+m}$ 1 1 + m of the true optimum. The advantage of the SGM framework is that known long-range properties of experimental spectra can be modeled by designing suitable submodular functions and matroid constraints. Experiments on four data sets from various organisms and mass spectrometry platforms show that the SGM approach leads to significantly improved performance compared to several state-of-the-art methods. Supplementary information, C++ source code, and data sets can be found at https://melodi-lab.github.io/SGM.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom