
Defining Phenotypes from Clinical Data to Drive Genomic Research
Author(s) -
Jamie R. Robinson,
WeiQi Wei,
Dan M. Roden,
Joshua C. Denny
Publication year - 2018
Publication title -
annual review of biomedical data science
Language(s) - English
Resource type - Journals
ISSN - 2574-3414
DOI - 10.1146/annurev-biodatasci-080917-013335
Subject(s) - phenome , repurposing , biobank , data science , drug repositioning , genomics , computational biology , health records , computer science , personalized medicine , drug discovery , drug response , phenotype , bioinformatics , genome , biology , drug , genetics , health care , pharmacology , ecology , gene , economics , economic growth
The rise in available longitudinal patient information in electronic health records (EHRs) and their coupling to DNA biobanks has resulted in a dramatic increase in genomic research using EHR data for phenotypic information. EHRs have the benefit of providing a deep and broad data source of health-related phenotypes, including drug response traits, expanding the phenome available to researchers for discovery. The earliest efforts at repurposing EHR data for research involved manual chart review of limited numbers of patients but now typically involve applications of rule-based and machine learning algorithms operating on sometimes huge corpora for both genome-wide and phenome-wide approaches. We highlight here the current methods, impact, challenges, and opportunities for repurposing clinical data to define patient phenotypes for genomics discovery. Use of EHR data has proven a powerful method for elucidation of genomic influences on diseases, traits, and drug-response phenotypes and will continue to have increasing applications in large cohort studies.