Premium
Imputing Top‐Coded Income Data in Longitudinal Surveys *
Author(s) -
Tan Li
Publication year - 2021
Publication title -
oxford bulletin of economics and statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.131
H-Index - 73
eISSN - 1468-0084
pISSN - 0305-9049
DOI - 10.1111/obes.12400
Subject(s) - imputation (statistics) , bayes' theorem , econometrics , statistics , nonparametric statistics , survey data collection , missing data , computer science , mathematics , bayesian probability
Abstract The incomes of top earners are typically top‐coded in survey data. I show that the accuracy of imputed income values for top earners in longitudinal surveys can be improved significantly by incorporating information from multiple time periods into the imputation process in a simple way. Moreover, I introduce an innovative, nonparametric empirical Bayes imputation method that further improves imputation quality. I show that the empirical Bayes imputation method reduces the RMSE of imputed income values by 19–51% relative to standard approaches in the literature. I also illustrate the benefits of the empirical Bayes method for investigating multi‐year income inequality.