MASCOT HTML and XML parser: An implementation of a novel object model for protein identification data | Zendy

Yang Chunguang G. | Zendy; Granite Stephen J. | Zendy; Van Eyk Jennifer E. | Zendy; Winslow Raimond L. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

MASCOT HTML and XML parser: An implementation of a novel object model for protein identification data

Author(s) -

Yang Chunguang G.,

Granite Stephen J.,

Van Eyk Jennifer E.,

Winslow Raimond L.

Publication year - 2006

Publication title -

proteomics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.26

H-Index - 167

eISSN - 1615-9861

pISSN - 1615-9853

DOI - 10.1002/pmic.200600157

Subject(s) - mascot , computer science , parsing , xml , java , identification (biology) , programming language , automatic summarization , simple api for xml , information retrieval , database , xml signature , efficient xml interchange , world wide web , botany , political science , law , biology

Protein identification using MS is an important technique in proteomics as well as a major generator of proteomics data. We have designed the protein identification data object model (PDOM) and developed a parser based on this model to facilitate the analysis and storage of these data. The parser works with HTML or XML files saved or exported from MASCOT MS/MS ions search in peptide summary report or MASCOT PMF search in protein summary report. The program creates PDOM objects, eliminates redundancy in the input file, and has the capability to output any PDOM object to a relational database. This program facilitates additional analysis of MASCOT search results and aids the storage of protein identification information. The implementation is extensible and can serve as a template to develop parsers for other search engines. The parser can be used as a stand-alone application or can be driven by other Java programs. It is currently being used as the front end for a system that loads HTML and XML result files of MASCOT searches into a relational database. The source code is freely available at http://www.ccbm.jhu.edu and the program uses only free and open-source Java libraries.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research