z-logo
open-access-imgOpen Access
Semiparametric Estimation with Data Missing Not at Random Using an Instrumental Variable
Author(s) -
Baoluo Sun,
Lan Liu,
Wang Miao,
Kathleen E. Wirth,
James M. Robins,
Eric J. Tchetgen Tchetgen
Publication year - 2017
Publication title -
statistica sinica
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.24
H-Index - 77
eISSN - 1996-8507
pISSN - 1017-0405
DOI - 10.5705/ss.202016.0324
Subject(s) - missing data , instrumental variable , inverse probability weighting , estimator , econometrics , outcome (game theory) , estimation , statistics , average treatment effect , identification (biology) , nonparametric statistics , causal inference , population , inference , conditional expectation , selection bias , conditional probability distribution , computer science , mathematics , economics , artificial intelligence , medicine , management , botany , mathematical economics , environmental health , biology
Missing data occur frequently in empirical studies in health and social sciences, often compromising our ability to make accurate inferences. An outcome is said to be missing not at random (MNAR) if, conditional on the observed variables, the missing data mechanism still depends on the unobserved outcome. In such settings, identification is generally not possible without imposing additional assumptions. Identification is sometimes possible, however, if an instrumental variable (IV) is observed for all subjects which satisfies the exclusion restriction that the IV affects the missingness process without directly influencing the outcome. In this paper, we provide necessary and sufficient conditions for nonparametric identification of the full data distribution under MNAR with the aid of an IV. In addition, we give sufficient identification conditions that are more straightforward to verify in practice. For inference, we focus on estimation of a population outcome mean, for which we develop a suite of semiparametric estimators that extend methods previously developed for data missing at random. Specifically, we propose inverse probability weighted estimation, outcome regression-based estimation and doubly robust estimation of the mean of an outcome subject to MNAR. For illustration, the methods are used to account for selection bias induced by HIV testing refusal in the evaluation of HIV seroprevalence in Mochudi, Botswana, using interviewer characteristics such as gender, age and years of experience as IVs.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom