Open Access
Design and Initial Characterization of the SC-200 Proteomics Standard Mixture
Author(s) -
Andrew T. Bauman,
Roger Higdon,
Sean Rapson,
Brenton Loiue,
Jason M. Hogan,
Robin Stacy,
Alberto J. Napuli,
Wenjin Guo,
Wesley Van Voorhis,
Jared C. Roach,
Vincent Lu,
Elizabeth V. Landorf,
Elizabeth Stewart,
Natali Kolker,
F. Collart,
Peter J. Myler,
Gerald van Belle,
Eugene Kolker
Publication year - 2011
Publication title -
omics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.811
H-Index - 58
eISSN - 1557-8100
pISSN - 1536-2310
DOI - 10.1089/omi.2010.0118
Subject(s) - proteomics , orbitrap , computer science , workflow , quantitative proteomics , computational biology , modular design , protein sequencing , false discovery rate , identification (biology) , mass spectrometry , chemistry , biology , peptide sequence , chromatography , biochemistry , database , gene , operating system , botany
High-throughput (HTP) proteomics studies generate large amounts of data. Interpretation of these data requires effective approaches to distinguish noise from biological signal, particularly as instrument and computational capacity increase and studies become more complex. Resolving this issue requires validated and reproducible methods and models, which in turn requires complex experimental and computational standards. The absence of appropriate standards and data sets for validating experimental and computational workflows hinders the development of HTP proteomics methods. Most protein standards are simple mixtures of proteins or peptides, or undercharacterized reference standards in which the identity and concentration of the constituent proteins is unknown. The Seattle Children's 200 (SC-200) proposed proteomics standard mixture is the next step toward developing realistic, fully characterized HTP proteomics standards. The SC-200 exhibits a unique modular design to extend its functionality, and consists of 200 proteins of known identities and molar concentrations from 6 microbial genomes, distributed into 10 molar concentration tiers spanning a 1,000-fold range. We describe the SC-200's design, potential uses, and initial characterization. We identified 84% of SC-200 proteins with an LTQ-Orbitrap and 65% with an LTQ-Velos (false discovery rate = 1% for both). There were obvious trends in success rate, sequence coverage, and spectral counts with protein concentration; however, protein identification, sequence coverage, and spectral counts vary greatly within concentration levels.