Premium
Comparing multiple imputation methods for systematically missing subject‐level data
Author(s) -
Kline David,
Andridge Rebecca,
Kaizar Eloise
Publication year - 2017
Publication title -
research synthesis methods
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.376
H-Index - 35
eISSN - 1759-2887
pISSN - 1759-2879
DOI - 10.1002/jrsm.1192
Subject(s) - imputation (statistics) , missing data , computer science , subject (documents) , data collection , statistics , data mining , machine learning , mathematics , library science
When conducting research synthesis, the collection of studies that will be combined often do not measure the same set of variables, which creates missing data. When the studies to combine are longitudinal, missing data can occur on the observation‐level (time‐varying) or the subject‐level (non‐time‐varying). Traditionally, the focus of missing data methods for longitudinal data has been on missing observation‐level variables. In this paper, we focus on missing subject‐level variables and compare two multiple imputation approaches: a joint modeling approach and a sequential conditional modeling approach. We find the joint modeling approach to be preferable to the sequential conditional approach, except when the covariance structure of the repeated outcome for each individual has homogenous variance and exchangeable correlation. Specifically, the regression coefficient estimates from an analysis incorporating imputed values based on the sequential conditional method are attenuated and less efficient than those from the joint method. Remarkably, the estimates from the sequential conditional method are often less efficient than a complete case analysis, which, in the context of research synthesis, implies that we lose efficiency by combining studies. Copyright © 2015 John Wiley & Sons, Ltd.