z-logo
open-access-imgOpen Access
Probing the Virtual Proteome to Identify Novel Disease Biomarkers
Author(s) -
Jonathan D. Mosley,
Mark D. Benson,
J. G. Smith,
Olle Melander,
Debby Ngo,
Christian M. Shaffer,
Jane F. Ferguson,
Matthew Herzig,
Catherine A. McCarty,
Christopher G. Chute,
Gail P. Jarvik,
Allan Gordon,
Melody R. Palmer,
David R. Crosslin,
Eric B. Larson,
David Carrell,
Iftikhar J. Kullo,
Jennifer A. Pacheco,
Peggy Peissig,
Murray H. Brilliant,
Terrie Kitchner,
James G. Linneman,
Bahram Namjou,
Marc S. Williams,
Marylyn D. Ritchie,
Kenneth M. Borthwick,
Krzysztof Kiryluk,
Frank Mentch,
Patrick Sleiman,
Elizabeth W. Karlson,
Shefali S. Verma,
Yineng Zhu,
Ramachandran S. Vasan,
Qiong Yang,
Joshua C. Denny,
Dan M. Roden,
Robert E. Gerszten,
Thomas J. Wang
Publication year - 2018
Publication title -
circulation
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 7.795
H-Index - 607
eISSN - 1524-4539
pISSN - 0009-7322
DOI - 10.1161/circulationaha.118.036063
Subject(s) - medicine , biobank , biomarker , genome wide association study , bioinformatics , apolipoprotein e , biomarker discovery , disease , cohort , computational biology , proteomics , genetics , genotype , biology , single nucleotide polymorphism , gene
Background: Proteomic approaches allow measurement of thousands of proteins in a single specimen, which can accelerate biomarker discovery. However, applying these technologies to massive biobanks is not currently feasible because of the practical barriers and costs of implementing such assays at scale. To overcome these challenges, we used a “virtual proteomic” approach, linking genetically predicted protein levels to clinical diagnoses in >40 000 individuals. Methods: We used genome-wide association data from the Framingham Heart Study (n=759) to construct genetic predictors for 1129 plasma protein levels. We validated the genetic predictors for 268 proteins and used them to compute predicted protein levels in 41 288 genotyped individuals in the Electronic Medical Records and Genomics (eMERGE) cohort. We tested associations for each predicted protein with 1128 clinical phenotypes. Lead associations were validated with directly measured protein levels and either low-density lipoprotein cholesterol or subclinical atherosclerosis in the MDCS (Malmö Diet and Cancer Study; n=651). Results: In the virtual proteomic analysis in eMERGE, 55 proteins were associated with 89 distinct diagnoses at a false discovery rate q<0.1. Among these, 13 associations involved lipid (n=7) or atherosclerosis (n=6) phenotypes. We tested each association for validation in MDCS using directly measured protein levels. At Bonferroni-adjusted significance thresholds, levels of apolipoprotein E isoforms were associated with hyperlipidemia, and circulating C-type lectin domain family 1 member B and platelet-derived growth factor receptor-β predicted subclinical atherosclerosis. Odds ratios for carotid atherosclerosis were 1.31 (95% CI, 1.08–1.58;P =0.006) per 1-SD increment in C-type lectin domain family 1 member B and 0.79 (0.66–0.94;P =0.008) per 1-SD increment in platelet-derived growth factor receptor-β.Conclusions: We demonstrate a biomarker discovery paradigm to identify candidate biomarkers of cardiovascular and other diseases.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom