z-logo
Premium
Multivariate Bayesian variable selection exploiting dependence structure among outcomes: Application to air pollution effects on DNA methylation
Author(s) -
Lee Kyu Ha,
Tadesse Mahlet G.,
Baccarelli Andrea A.,
Schwartz Joel,
Coull Brent A.
Publication year - 2017
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/biom.12557
Subject(s) - multivariate statistics , selection (genetic algorithm) , bayesian probability , multivariate analysis , dna methylation , variable (mathematics) , feature selection , computational biology , computer science , statistics , econometrics , environmental science , biology , mathematics , genetics , artificial intelligence , gene , mathematical analysis , gene expression
Summary The analysis of multiple outcomes is becoming increasingly common in modern biomedical studies. It is well‐known that joint statistical models for multiple outcomes are more flexible and more powerful than fitting a separate model for each outcome; they yield more powerful tests of exposure or treatment effects by taking into account the dependence among outcomes and pooling evidence across outcomes. It is, however, unlikely that all outcomes are related to the same subset of covariates. Therefore, there is interest in identifying exposures or treatments associated with particular outcomes, which we term outcome‐specific variable selection. In this work, we propose a variable selection approach for multivariate normal responses that incorporates not only information on the mean model, but also information on the variance–covariance structure of the outcomes. The approach effectively leverages evidence from all correlated outcomes to estimate the effect of a particular covariate on a given outcome. To implement this strategy, we develop a Bayesian method that builds a multivariate prior for the variable selection indicators based on the variance–covariance of the outcomes. We show via simulation that the proposed variable selection strategy can boost power to detect subtle effects without increasing the probability of false discoveries. We apply the approach to the Normative Aging Study (NAS) epigenetic data and identify a subset of five genes in the asthma pathway for which gene‐specific DNA methylations are associated with exposures to either black carbon, a marker of traffic pollution, or sulfate, a marker of particles generated by power plants.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here