z-logo
open-access-imgOpen Access
Leveraging functional annotation to identify genes associated with complex diseases
Author(s) -
Wei Liu,
Mo Li,
Wenfeng Zhang,
Geyu Zhou,
Weijie Xing,
Jiawei Wang,
Qiongshi Lu,
Hongyu Zhao
Publication year - 2020
Publication title -
plos computational biology/plos computational biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.628
H-Index - 182
eISSN - 1553-7358
pISSN - 1553-734X
DOI - 10.1371/journal.pcbi.1008315
Subject(s) - expression quantitative trait loci , biology , linkage disequilibrium , epigenetics , genetics , computational biology , single nucleotide polymorphism , genome wide association study , genetic association , gene , quantitative trait locus , phenotype , identification (biology) , transcriptome , trait , gene expression , genotype , botany , computer science , programming language
To increase statistical power to identify genes associated with complex traits, a number of transcriptome-wide association study (TWAS) methods have been proposed using gene expression as a mediating trait linking genetic variations and diseases. These methods first predict expression levels based on inferred expression quantitative trait loci (eQTLs) and then identify expression-mediated genetic effects on diseases by associating phenotypes with predicted expression levels. The success of these methods critically depends on the identification of eQTLs, which may not be functional in the corresponding tissue, due to linkage disequilibrium (LD) and the correlation of gene expression between tissues. Here, we introduce a new method called T-GEN ( T ranscriptome-mediated identification of disease-associated G enes with E pigenetic a N notation) to identify disease-associated genes leveraging epigenetic information. Through prioritizing SNPs with tissue-specific epigenetic annotation, T-GEN can better identify SNPs that are both statistically predictive and biologically functional. We found that a significantly higher percentage (an increase of 18.7% to 47.2%) of eQTLs identified by T-GEN are inferred to be functional by ChromHMM and more are deleterious based on their Combined Annotation Dependent Depletion (CADD) scores. Applying T-GEN to 207 complex traits, we were able to identify more trait-associated genes (ranging from 7.7% to 102%) than those from existing methods. Among the identified genes associated with these traits, T-GEN can better identify genes with high (>0.99) pLI scores compared to other methods. When T-GEN was applied to late-onset Alzheimer’s disease, we identified 96 genes located at 15 loci, including two novel loci not implicated in previous GWAS. We further replicated 50 genes in an independent GWAS, including one of the two novel loci.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here