Premium
High‐dimensional propensity score algorithm in comparative effectiveness research with time‐varying interventions
Author(s) -
Neugebauer Romain,
Schmittdiel Julie A.,
Zhu Zheng,
Rassen Jeremy A.,
Seeger John D.,
Schneeweiss Sebastian
Publication year - 2015
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.6377
Subject(s) - covariate , propensity score matching , inverse probability weighting , confounding , observational study , marginal structural model , computer science , causal inference , matching (statistics) , weighting , selection bias , comparative effectiveness research , psychological intervention , set (abstract data type) , statistics , machine learning , health care , medicine , mathematics , radiology , psychiatry , programming language , economics , economic growth
The high‐dimensional propensity score (hdPS) algorithm was proposed for automation of confounding adjustment in problems involving large healthcare databases. It has been evaluated in comparative effectiveness research (CER) with point treatments to handle baseline confounding through matching or covariance adjustment on the hdPS. In observational studies with time‐varying interventions, such hdPS approaches are often inadequate to handle time‐dependent confounding and selection bias. Inverse probability weighting (IPW) estimation to fit marginal structural models can adequately handle these biases under the fundamental assumption of no unmeasured confounders. Upholding of this assumption relies on the selection of an adequate set of covariates for bias adjustment. We describe the application and performance of the hdPS algorithm to improve covariate selection in CER with time‐varying interventions based on IPW estimation and explore stabilization of the resulting estimates using Super Learning. The evaluation is based on both the analysis of electronic health records data in a real‐world CER study of adults with type 2 diabetes and a simulation study. This report (i) establishes the feasibility of IPW estimation with the hdPS algorithm based on large electronic health records databases, (ii) demonstrates little impact on inferences when supplementing the set of expert‐selected covariates using the hdPS algorithm in a setting with extensive background knowledge, (iii) supports the application of the hdPS algorithm in discovery settings with little background knowledge or limited data availability, and (iv) motivates the application of Super Learning to stabilize effect estimates based on the hdPS algorithm. Copyright © 2014 John Wiley & Sons, Ltd.