Using the Fraction of Missing Information to Identify Auxiliary Variables for Imputation Procedures via Proxy Pattern‐mixture Models | Zendy

Andridge Rebecca | Zendy; Thompson Katherine Jenny | Zendy

Premium

Using the Fraction of Missing Information to Identify Auxiliary Variables for Imputation Procedures via Proxy Pattern‐mixture Models

Author(s) -

Andridge Rebecca,

Thompson Katherine Jenny

Publication year - 2015

Publication title -

international statistical review

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.051

H-Index - 54

eISSN - 1751-5823

pISSN - 0306-7734

DOI - 10.1111/insr.12091

Subject(s) - imputation (statistics) , covariate , missing data , statistics , proxy (statistics) , computer science , regression , econometrics , regression analysis , data mining , mathematics

Summary In many surveys, imputation procedures are used to account for non‐response bias induced by either unit non‐response or item non‐response. Such procedures are optimised (in terms of reducing non‐response bias) when the models include covariates that are highly predictive of both response and outcome variables. To achieve this, we propose a method for selecting sets of covariates used in regression imputation models or to determine imputation cells for one or more outcome variables, using the fraction of missing information (FMI) as obtained via a proxy pattern‐mixture (PMM) model as the key metric. In our variable selection approach, we use the PPM model to obtain a maximum likelihood estimate of the FMI for separate sets of candidate imputation models and look for the point at which changes in the FMI level off and further auxiliary variables do not improve the imputation model. We illustrate our proposed approach using empirical data from the Ohio Medicaid Assessment Survey and from the Service Annual Survey.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research