z-logo
open-access-imgOpen Access
A Poisson binomial-based statistical testing framework for comorbidity discovery across electronic health record datasets
Author(s) -
Gordon Lemmon,
Sergiusz Wesołowski,
A. Michael Henrie,
Martin Tristani-Firouzi,
Mark Yandell
Publication year - 2021
Publication title -
nature computational science
Language(s) - English
Resource type - Journals
ISSN - 2662-8457
DOI - 10.1038/s43588-021-00141-9
Subject(s) - comorbidity , confounding , poisson distribution , computer science , data mining , false discovery rate , statistics , medicine , machine learning , mathematics , biology , biochemistry , gene
Discovering the concomitant occurrence of distinct medical conditions in a patient, also known as comorbidities, is a prerequisite for creating patient outcome prediction tools. Current comorbidity discovery applications are designed for small datasets and use stratification to control for confounding variables such as age, sex or ancestry. Stratification lowers false positive rates, but reduces power, as the size of the study cohort is decreased. Here we describe a Poisson binomial-based approach to comorbidity discovery (PBC) designed for big-data applications that circumvents the need for stratification. PBC adjusts for confounding demographic variables on a per-patient basis and models temporal relationships. We benchmark PBC using two datasets to compute comorbidity statistics on 4,623,841 pairs of potentially comorbid medical terms. The results of this computation are provided as a searchable web resource. Compared with current methods, the PBC approach reduces false positive associations while retaining statistical power to discover true comorbidities.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here