Missing Data in Clinical Studies: Issues and Methods
Author(s) -
Joseph G. Ibrahim,
Haitao Chu,
MingHui Chen
Publication year - 2012
Publication title -
journal of clinical oncology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 10.482
H-Index - 548
eISSN - 1527-7755
pISSN - 0732-183X
DOI - 10.1200/jco.2011.38.7589
Subject(s) - missing data , covariate , medicine , proportional hazards model , clinical trial , statistics , data mining , computer science , mathematics
Missing data are a prevailing problem in any type of data analyses. A participant variable is considered missing if the value of the variable (outcome or covariate) for the participant is not observed. In this article, various issues in analyzing studies with missing data are discussed. Particularly, we focus on missing response and/or covariate data for studies with discrete, continuous, or time-to-event end points in which generalized linear models, models for longitudinal data such as generalized linear mixed effects models, or Cox regression models are used. We discuss various classifications of missing data that may arise in a study and demonstrate in several situations that the commonly used method of throwing out all participants with any missing data may lead to incorrect results and conclusions. The methods described are applied to data from an Eastern Cooperative Oncology Group phase II clinical trial of liver cancer and a phase III clinical trial of advanced non–small-cell lung cancer. Although the main area of application discussed here is cancer, the issues and methods we discuss apply to any type of study.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom