Premium
Unbiased regression estimation under correlated linkage errors
Author(s) -
Kim Gunky,
Chambers Raymond
Publication year - 2015
Publication title -
stat
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.61
H-Index - 18
ISSN - 2049-1573
DOI - 10.1002/sta4.76
Subject(s) - linkage (software) , record linkage , regression , computer science , regression analysis , population , statistics , data mining , mathematics , genetics , biology , demography , sociology , gene
Linkage errors can occur when probability‐based methods are used to link records from two or more distinct data sets corresponding to the same target population. Recent research on allowing for these errors when carrying out regression analysis based on linked data assumes that the linkage errors are independent when more than two data sets are used to generate these data. In this paper, we extend these results to accommodate the more realistic scenario of dependent linkage errors. Our simulation results show that an incorrect assumption of independent linkage errors can lead to insufficient linkage error bias correction, while an approach that allows for correlated linkage errors appears to overcome this problem. Copyright © 2015 John Wiley & Sons, Ltd.