z-logo
open-access-imgOpen Access
sureLDA: A multidisease automated phenotyping method for the electronic health record
Author(s) -
Yuri Ahuja,
Doudou Zhou,
Zhe He,
Jiehuan Sun,
Víctor M. Castro,
Vivian S. Gainer,
Shawn N. Murphy,
Chuan Hong,
Tianxi Cai
Publication year - 2020
Publication title -
journal of the american medical informatics association
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.614
H-Index - 150
eISSN - 1527-974X
pISSN - 1067-5027
DOI - 10.1093/jamia/ocaa079
Subject(s) - computer science , bottleneck , phenotype , feature (linguistics) , artificial intelligence , feature selection , selection (genetic algorithm) , data mining , machine learning , latent dirichlet allocation , cluster analysis , topic model , biology , genetics , gene , embedded system , linguistics , philosophy
A major bottleneck hindering utilization of electronic health record data for translational research is the lack of precise phenotype labels. Chart review as well as rule-based and supervised phenotyping approaches require laborious expert input, hampering applicability to studies that require many phenotypes to be defined and labeled de novo. Though International Classification of Diseases codes are often used as surrogates for true labels in this setting, these sometimes suffer from poor specificity. We propose a fully automated topic modeling algorithm to simultaneously annotate multiple phenotypes.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom