Premium
Identifying Patients With High Data Completeness to Improve Validity of Comparative Effectiveness Research in Electronic Health Records Data
Author(s) -
Lin Kueiyu Joshua,
Singer Daniel E.,
Glynn Robert J.,
Murphy Shawn N.,
Lii Joyce,
Schneeweiss Sebastian
Publication year - 2018
Publication title -
clinical pharmacology and therapeutics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.941
H-Index - 188
eISSN - 1532-6535
pISSN - 0009-9236
DOI - 10.1002/cpt.861
Subject(s) - comparative effectiveness research , health records , completeness (order theory) , data science , computer science , medicine , psychology , data mining , alternative medicine , health care , mathematics , political science , pathology , mathematical analysis , law
Electronic health record (EHR)‐discontinuity, i.e., having medical information recorded outside of the study EHR system, is associated with substantial information bias in EHR‐based comparative effectiveness research (CER). We aimed to develop and validate a prediction model identifying patients with high EHR‐continuity to reduce this bias. Based on 183,739 patients aged ≥65 in EHRs from two US provider networks linked with Medicare claims data from 2007–2014, we quantified EHR‐continuity by mean proportion of encounters captured (MPEC) by the EHR system. We built a prediction model for MPEC using one EHR system as training and the other as the validation set. Patients with top 20% predicted EHR‐continuity had 3.5–5.8‐fold smaller misclassification of 40 CER‐relevant variables, compared to the remaining study population. The comorbidity profiles did not differ substantially by predicted EHR‐continuity. These findings suggest that restriction of CER to patients with high predicted EHR‐continuity may confer a favorable validity to generalizability trade‐off.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom