Premium
Enriching Surveys with Supplementary Data and its Application to Studying Wage Regression
Author(s) -
Leung Denis Heng Yan,
Yamada Ken,
Zhang Biao
Publication year - 2015
Publication title -
scandinavian journal of statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.359
H-Index - 65
eISSN - 1467-9469
pISSN - 0303-6898
DOI - 10.1111/sjos.12100
Subject(s) - survey data collection , survey sampling , econometrics , sample (material) , statistics , data set , missing data , regression , regression analysis , population , parametric statistics , mathematics , computer science , data mining , chemistry , demography , chromatography , sociology
We consider the problem of supplementing survey data with additional information from a population. The framework we use is very general; examples are missing data problems, measurement error models and combining data from multiple surveys. We do not require the survey data to be a simple random sample of the population of interest. The key assumption we make is that there exists a set of common variables between the survey and the supplementary data. Thus, the supplementary data serve the dual role of providing adjustments to the survey data for model consistencies and also enriching the survey data for improved efficiency. We propose a semi‐parametric approach using empirical likelihood to combine data from the two sources. The method possesses favourable large and moderate sample properties. We use the method to investigate wage regression using data from the National Longitudinal Survey of Youth Study.