Premium
Identifying Patients With High Data Completeness to Improve Validity of Comparative Effectiveness Research in Electronic Health Records Data
Author(s) -
Lin Kueiyu Joshua,
Singer Daniel E.,
Glynn Robert J.,
Murphy Shawn N.,
Lii Joyce,
Schneeweiss Sebastian
Publication year - 2018
Publication title -
clinical pharmacology and therapeutics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.941
H-Index - 188
eISSN - 1532-6535
pISSN - 0009-9236
DOI - 10.1002/cpt.861
Subject(s) - generalizability theory , electronic health record , medicine , comparative effectiveness research , medical record , health records , data mining , statistics , computer science , health care , alternative medicine , mathematics , pathology , economics , economic growth
Electronic health record (EHR)‐discontinuity, i.e., having medical information recorded outside of the study EHR system, is associated with substantial information bias in EHR‐based comparative effectiveness research (CER). We aimed to develop and validate a prediction model identifying patients with high EHR‐continuity to reduce this bias. Based on 183,739 patients aged ≥65 in EHRs from two US provider networks linked with Medicare claims data from 2007–2014, we quantified EHR‐continuity by mean proportion of encounters captured (MPEC) by the EHR system. We built a prediction model for MPEC using one EHR system as training and the other as the validation set. Patients with top 20% predicted EHR‐continuity had 3.5–5.8‐fold smaller misclassification of 40 CER‐relevant variables, compared to the remaining study population. The comorbidity profiles did not differ substantially by predicted EHR‐continuity. These findings suggest that restriction of CER to patients with high predicted EHR‐continuity may confer a favorable validity to generalizability trade‐off.