Premium
Informing power and sample size calculations when using inverse probability of treatment weighting using the propensity score
Author(s) -
Austin Peter C.
Publication year - 2021
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.9176
Subject(s) - propensity score matching , inverse probability weighting , sample size determination , weighting , statistics , econometrics , inverse , predictive power , sample (material) , statistical power , power (physics) , average treatment effect , inverse probability , mathematics , computer science , medicine , posterior probability , bayesian probability , chemistry , chromatography , physics , radiology , geometry , quantum mechanics
Propensity score weighting is increasingly being used in observational studies to estimate the effects of treatments. The use of such weights induces a within‐person homogeneity in outcomes that must be accounted for when estimating the variance of the estimated treatment effect. Knowledge of the variance inflation factor (VIF), which describes the extent to which the effective sample size has been reduced by weighting, allows for conducting sample size and power calculations for observational studies that use propensity score weighting. However, estimation of the VIF requires knowledge of the weights, which are only known once the study has been conducted. We describe methods to estimate the VIF based on two characteristics of the observational study: the anticipated prevalence of treatment and the anticipated c‐statistic of the propensity score model. We considered five different sets of weights: those for estimating the average treatment effect (ATE), the average treated effect in the treated (ATT), and three recently described sets of weights: overlap weights, matching weights, and entropy weights. The VIF was substantially smaller for the latter three sets of weights than for the first two sets of weights. Once the VIF has been estimated during the design phase of the study, sample size and power calculations can be done using calculations appropriate for a randomized controlled trial with similar prevalence of treatment and similar outcome variable, and then multiplying the requisite sample size by the estimated VIF. Implementation of these methods allows for improving the design and reporting of observational studies that use propensity score weighting.