Inclusion of biological knowledge in a Bayesian shrinkage model for joint estimation of SNP effects | Zendy

Pereira Miguel | Zendy; Thompson John R. | Zendy; Weichenberger Christian X. | Zendy; Thomas Duncan C. | Zendy; Minelli Cosetta | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Inclusion of biological knowledge in a Bayesian shrinkage model for joint estimation of SNP effects

Author(s) -

Pereira Miguel,

Thompson John R.,

Weichenberger Christian X.,

Thomas Duncan C.,

Minelli Cosetta

Publication year - 2017

Publication title -

genetic epidemiology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.301

H-Index - 98

eISSN - 1098-2272

pISSN - 0741-0395

DOI - 10.1002/gepi.22038

Subject(s) - single nucleotide polymorphism , bayesian probability , statistics , snp , computer science , ranking (information retrieval) , prior probability , mathematics , biology , artificial intelligence , genetics , gene , genotype

With the aim of improving detection of novel single‐nucleotide polymorphisms (SNPs) in genetic association studies, we propose a method of including prior biological information in a Bayesian shrinkage model that jointly estimates SNP effects. We assume that the SNP effects follow a normal distribution centered at zero with variance controlled by a shrinkage hyperparameter. We use biological information to define the amount of shrinkage applied on the SNP effects distribution, so that the effects of SNPs with more biological support are less shrunk toward zero, thus being more likely detected. The performance of the method was tested in a simulation study (1,000 datasets, 500 subjects with ∼200 SNPs in 10 linkage disequilibrium (LD) blocks) using a continuous and a binary outcome. It was further tested in an empirical example on body mass index (continuous) and overweight (binary) in a dataset of 1,829 subjects and 2,614 SNPs from 30 blocks. Biological knowledge was retrieved using the bioinformatics tool Dintor, which queried various databases. The joint Bayesian model with inclusion of prior information outperformed the standard analysis: in the simulation study, the mean ranking of the true LD block was 2.8 for the Bayesian model versus 3.6 for the standard analysis of individual SNPs; in the empirical example, the mean ranking of the six true blocks was 8.5 versus 9.3 in the standard analysis. These results suggest that our method is more powerful than the standard analysis. We expect its performance to improve further as more biological information about SNPs becomes available.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research