z-logo
Premium
A flexible hierarchical framework for improving inference in area‐referenced environmental health studies
Author(s) -
Pirani Monica,
Mason Alexina J.,
Hansell Anna L.,
Richardson Sylvia,
Blangiardo Marta
Publication year - 2020
Publication title -
biometrical journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.108
H-Index - 63
eISSN - 1521-4036
pISSN - 0323-3847
DOI - 10.1002/bimj.201900241
Subject(s) - confounding , causal inference , environmental epidemiology , propensity score matching , inference , statistics , covariate , computer science , bayesian probability , hierarchical database model , bayesian inference , econometrics , sample size determination , data mining , mathematics , medicine , artificial intelligence , environmental health
Study designs where data have been aggregated by geographical areas are popular in environmental epidemiology. These studies are commonly based on administrative databases and, providing a complete spatial coverage, are particularly appealing to make inference on the entire population. However, the resulting estimates are often biased and difficult to interpret due to unmeasured confounders, which typically are not available from routinely collected data. We propose a framework to improve inference drawn from such studies exploiting information derived from individual‐level survey data. The latter are summarized in an area‐level scalar score by mimicking at ecological level the well‐known propensity score methodology. The literature on propensity score for confounding adjustment is mainly based on individual‐level studies and assumes a binary exposure variable. Here, we generalize its use to cope with area‐referenced studies characterized by a continuous exposure. Our approach is based upon Bayesian hierarchical structures specified into a two‐stage design: (i) geolocated individual‐level data from survey samples are up‐scaled at ecological level, then the latter are used to estimate a generalized ecological propensity score (EPS) in the in‐sample areas; (ii) the generalized EPS is imputed in the out‐of‐sample areas under different assumptions about the missingness mechanisms, then it is included into the ecological regression, linking the exposure of interest to the health outcome. This delivers area‐level risk estimates, which allow a fuller adjustment for confounding than traditional areal studies. The methodology is illustrated by using simulations and a case study investigating the risk of lung cancer mortality associated with nitrogen dioxide in England (UK).

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here