Finding That Needle in the Haystack: Computable Phenotypes
Author(s) -
Dorey A. Glenn,
Keisha Gibson
Publication year - 2019
Publication title -
journal of the american society of nephrology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 4.451
H-Index - 279
eISSN - 1533-3450
pISSN - 1046-6673
DOI - 10.1681/asn.2019101037
Subject(s) - haystack , clinical trial , isolation (microbiology) , medicine , health care , patient recruitment , internet privacy , family medicine , pathology , bioinformatics , computer science , political science , law , world wide web , biology
In a 1532 publication, Sir Thomas More—an English lawyer and social philosopher—likened one’s ability to find a line in St. Austin’s writing to finding a needle in a meadow—later interpreted as a “needle in the haystack.”1 The isolation experienced by patients with rare diseases can often make them feel like lost needles in the meadow of our large complex health care systems. As of 2018, there were 327.2 million people with the potential to have data stored across 5534 registered hospitals in the United States alone along with over 1100 different electronic health record vendors.2 The frustration experienced by those eager to connect individuals with rare diseases to clinical care and research opportunities is magnified by the fact that our electronic medical record systems at times fall short of “meaningful use” and do many things to the exception of connecting to other systems and providing efficient communication. The effect that this complicated meadow has on clinical care has been well described, but the barriers that this poses to patient recruitment into clinical and other research trials are less addressed in published literature. Recruitment of patients for clinical trials is an area ready for innovation. At a time when there is measurable interest and support from industry sponsoring clinical trials in the glomerular disease space, connecting the right patients to the right trials as efficiently as possible is of great import. Harnessing data from electronic health records (EHRs) to streamline and personalize clinical trial recruitment using computable phenotypes has the potential to decrease the time and money associated with clinical trial setup. Indeed, global spending on clinical trial technology and services is expected to reach $69 billion per year by 2025.3 EHR-powered analytic tools are already commercialized for the purposes of protocol feasibility, cohort analysis, and study site identification. Computable phenotypes represent the building blocks with which we can interact with “big” health care data for the purposes of advancing patient care and clinical outcomes. Comparative effectiveness research, quality improvement initiatives, clinical trial recruitment, or even the possibility of conducting high-quality, multicenter, clinical trials within an EHR also exist. Operationalizing these activities using EHR data depends on having validated tools to define patient characteristics, diseases, or clinical events of interest. Numerous computable phenotypes have been developed, validated, and made available publicly, including for CKD, treatment resistant hypertension, type 2 diabetes, steroidinduced osteonecrosis, and many others.4 Most recently, Norton et al.5 published an article in the September 2019 issue of CJASN detailing their development of a pragmatic computable phenotype to identify patients likely to have CKD. In this issue of JASN, Denberg et al.6 report their newly developed computable phenotype for glomerular disease using a rule-based algorithm derived from 561 outpatient nephrology clinic encounters at the Children’s Hospital of Philadelphia. Multiple data elements were evaluated for use in the computable phenotype, including demographic information, diagnostic codes, laboratory data (urine protein to creatinine ratio), medication prescriptions (i.e., angiotensin-converting enzyme inhibitor), and kidney biopsy reports. The authors show that a phenotype using a combination of health care encounters, diagnosis codes, and kidney biopsy procedure codes could be used to identify patients with glomerular disease who have a positive predictive value of 92% and a negative predictive value of 100%. The computable phenotype was then validated using structured chart reviews at eight additional institutions within the PEDSnet collaborative (a large, national community of hospitals and healthcare organizations, researchers and clinicians, and patients and families), totally 800 patients with glomerular disease and 798 patients with nonglomerular disease.6 Several strengths of the study should be noted, including its rigorous development and validation across multiple collaborating children’s hospitals. The authors report that the majority of urine protein assessments were on the basis of urine dipstick measurements, and they were excluded due to concerns that dipsticks might not be able to discriminate glomerular from nonglomerular proteinuria. Published online ahead of print. Publication date available at www.jasn.org.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom