Causal mediation analysis with double machine learning
Author(s) -
Helmut Farbmacher,
Martin Huber,
Lukáš Lafférs,
Henrika Langen,
Martin Spindler
Publication year - 2022
Publication title -
econometrics journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.861
H-Index - 36
eISSN - 1368-423X
pISSN - 1368-4221
DOI - 10.1093/ectj/utac003
Subject(s) - overfitting , mediation , estimator , lasso (programming language) , confounding , computer science , artificial intelligence , machine learning , outcome (game theory) , blocking (statistics) , econometrics , psychology , mathematics , statistics , mathematical economics , political science , law , computer network , world wide web , artificial neural network
Summary This paper combines causal mediation analysis with double machine learning for a data-driven control of observed confounders in a high-dimensional setting. The average indirect effect of a binary treatment and the unmediated direct effect are estimated based on efficient score functions, which are robust with respect to misspecifications of the outcome, mediator, and treatment models. This property is key for selecting these models by double machine learning, which is combined with data splitting to prevent overfitting. We demonstrate that the effect estimators are asymptotically normal and $n^{-1/2}$-consistent under specific regularity conditions and investigate the finite sample properties of the suggested methods in a simulation study when considering lasso as machine learner. We also provide an empirical application to the US National Longitudinal Survey of Youth, assessing the indirect effect of health insurance coverage on general health operating via routine checkups as mediator, as well as the direct effect.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom