z-logo
Premium
Variable selection for high dimensional Bayesian density estimation: application to human exposure simulation
Author(s) -
Reich Brian J.,
Kalendra Eric,
Storlie Curtis B.,
Bondell Howard D.,
Fuentes Montserrat
Publication year - 2012
Publication title -
journal of the royal statistical society: series c (applied statistics)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.205
H-Index - 72
eISSN - 1467-9876
pISSN - 0035-9254
DOI - 10.1111/j.1467-9876.2011.00772.x
Subject(s) - estimation , bayesian probability , environmental epidemiology , variable (mathematics) , computer science , human health , conditional probability distribution , statistical model , statistics , air pollution , exposure assessment , econometrics , machine learning , environmental health , artificial intelligence , mathematics , engineering , medicine , ecology , biology , mathematical analysis , systems engineering
Summary.  Numerous studies have linked ambient air pollution and adverse health outcomes. Many studies of this nature relate outdoor pollution levels measured at a few monitoring stations with health outcomes. Recently, computational methods have been developed to model the distribution of personal exposures, rather than ambient concentration, and then relate the exposure distribution to the health outcome. Although these methods show great promise, they are limited by the computational demands of the exposure model. We propose a method to alleviate these computational burdens with the eventual goal of implementing a national study of the health effects of air pollution exposure. Our approach is to develop a statistical emulator for the exposure model, i.e. we use Bayesian density estimation to predict the conditional exposure distribution as a function of several variables, such as temperature, human activity and physical characteristics of the pollutant. This poses a challenging statistical problem because there are many predictors of the exposure distribution and density estimation is notoriously difficult in high dimensions. To overcome this challenge, we use stochastic search variable selection to identify a subset of the variables that have more than just additive effects on the mean of the exposure distribution. We apply our method to emulate an ozone exposure model in Philadelphia.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here