MS1, MS2, and SQT—three unified, compact, and easily parsed file formats for the storage of shotgun proteomic spectra and identifications | Zendy

McDonald W. Hayes | Zendy; Tabb David L. | Zendy; Sadygov Rovshan G. | Zendy; MacCoss Michael J. | Zendy; Venable John | Zendy; Graumann Johannes | Zendy; Johnson Jeff R. | Zendy; Cociorva Daniel | Zendy; Yates John R. | Zendy

Premium

MS1, MS2, and SQT—three unified, compact, and easily parsed file formats for the storage of shotgun proteomic spectra and identifications

Author(s) -

McDonald W. Hayes,

Tabb David L.,

Sadygov Rovshan G.,

MacCoss Michael J.,

Venable John,

Graumann Johannes,

Johnson Jeff R.,

Cociorva Daniel,

Yates John R.

Publication year - 2004

Publication title -

rapid communications in mass spectrometry

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.528

H-Index - 136

eISSN - 1097-0231

pISSN - 0951-4198

DOI - 10.1002/rcm.1603

Subject(s) - shotgun , parsing , computer science , database , relational database , data mining , information retrieval , chemistry , artificial intelligence , biochemistry , gene

As the speed with which proteomic labs generate data increases along with the scale of projects they are undertaking, the resulting data storage and data processing problems will continue to challenge computational resources. This is especially true for shotgun proteomic techniques that can generate tens of thousands of spectra per instrument each day. One design factor leading to many of these problems is caused by storing spectra and the database identifications for a given spectrum as individual files. While these problems can be addressed by storing all of the spectra and search results in large relational databases, the infrastructure to implement such a strategy can be beyond the means of academic labs. We report here a series of unified text file formats for storing spectral data (MS1 and MS2) and search results (SQT) that are compact, easily parsed by both machine and humans, and yet flexible enough to be coupled with new algorithms and data‐mining strategies. Copyright © 2004 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research