
A Note on Proposed Estimation Procedures for Claims-Based Frailty Indexes
Author(s) -
Dane R. Van Domelen,
Karen BandeenRoche
Publication year - 2019
Publication title -
american journal of epidemiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.33
H-Index - 256
eISSN - 1476-6256
pISSN - 0002-9262
DOI - 10.1093/aje/kwz247
Subject(s) - imputation (statistics) , confounding , inference , covariate , statistics , medicine , missing data , point estimation , causal inference , econometrics , computer science , mathematics , artificial intelligence
Two groups (Segal et al. Med Care. 2017;55(7):716-722; Segal et al. Am J Epidemiol. 2017;186(6):745-747; and Kim et al. J Gerontol A Biol Sci Med Sci. 2018;73(7):980-987) recently proposed methods for modeling frailty in studies where a reference standard frailty measure is not directly observed, but Medicare claims data are available. The groups use competing frailty measures, but the premise is similar: In a validation data set, model the frailty measure versus claims variables; in the primary data set, impute frailty status from claims variables, and conduct inference with those imputed values in place of the unobserved frailty measure. Potential use cases include risk prediction, confounding control, and prevalence estimation. In this commentary, we describe validity issues underlying these approaches, focusing mainly on risk prediction. Our main concern is that these approaches do not permit valid estimation of associations between the reference standard frailty measure (i.e., "frailty") and health outcomes. We argue that Segal's approach is akin to multiple imputation but with the outcome variable omitted from the imputation model, while Kim's is akin to regression calibration but with many variables improperly treated as surrogates. We discuss alternatives for risk prediction, including a secondary approach previously considered by Kim et al., and briefly comment on other use cases.