ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
Author(s) -
David Stacey,
Eric B. Fauman,
Daniel Ziemek,
Benjamin B. Sun,
Eric L. Harshfield,
Angela Wood,
Adam S. Butterworth,
Karsten Suhre,
Dirk S. Paul
Publication year - 2018
Publication title -
nucleic acids research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.008
H-Index - 537
eISSN - 1362-4954
pISSN - 0305-1048
DOI - 10.1093/nar/gky837
Subject(s) - quantitative trait locus , biology , candidate gene , genetics , computational biology , gene , family based qtl mapping , expression quantitative trait loci , genomics , genome , locus (genetics) , epistasis , gene annotation , annotation , gene mapping , genotype , single nucleotide polymorphism , chromosome
Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the ‘ Pr ioritization o f candidate causal Ge nes at M olecular QTLs’ (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories. We assessed the performance of ProGeM using a reference set of 227 previously reported and extensively curated metabolite QTLs. For 98% of these loci, the expert-curated gene was one of the candidate causal genes prioritized by ProGeM. Benchmarking analyses revealed that 69% of the causal candidates were nearest to the sentinel variant at the investigated molecular QTLs, indicating that genomic proximity is the most reliable indicator of ‘true positive’ causal genes. In contrast, cis -gene expression QTL data led to three false positive candidate causal gene assignments for every one true positive assignment. We provide evidence that these conclusions also apply to other molecular phenotypes, suggesting that ProGeM is a powerful and versatile tool for annotating molecular QTLs. ProGeM is freely available via GitHub.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom