
Machine learning in predicting respiratory failure in patients with COVID-19 pneumonia—Challenges, strengths, and opportunities in a global health emergency
Author(s) -
Davide Ferrari,
Jovana Milić,
Roberto Tonelli,
Francesco Ghinelli,
Marianna Meschiari,
Sara Volpi,
Matteo Faltoni,
Giacomo Franceschi,
Vittorio Iadisernia,
Dina Yaacoub,
Giacomo Ciusa,
Erica Bacca,
Carlotta Rogati,
Marco Tutone,
Giulia Burastero,
Alessandro Raimondi,
Marianna Menozzi,
Erica Franceschini,
Gianluca Cuomo,
Luca Corradi,
Gabriella Orlando,
Antonella Santoro,
Margherita Digaetano,
Cinzia Puzzolante,
Federica Carli,
Vanni Borghi,
Andrea Bedini,
Riccardo Fantini,
Luca Tabbì,
Ivana Castaniere,
Stefano Busani,
Enrico Clini,
Massimo Girardis,
Mario Sarti,
Andrea Cossarizza,
Cristina Mussini,
Federica Mandreoli,
Paolo Missier,
Giovanni Guaraldi
Publication year - 2020
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0239172
Subject(s) - medicine , pneumonia , emergency department , mechanical ventilation , emergency medicine , observational study , intensive care medicine , respiratory failure , machine learning , computer science , psychiatry
Aims The aim of this study was to estimate a 48 hour prediction of moderate to severe respiratory failure, requiring mechanical ventilation, in hospitalized patients with COVID-19 pneumonia. Methods This was an observational prospective study that comprised consecutive patients with COVID-19 pneumonia admitted to hospital from 21 February to 6 April 2020. The patients’ medical history, demographic, epidemiologic and clinical data were collected in an electronic patient chart. The dataset was used to train predictive models using an established machine learning framework leveraging a hybrid approach where clinical expertise is applied alongside a data-driven analysis. The study outcome was the onset of moderate to severe respiratory failure defined as PaO 2 /FiO 2 ratio <150 mmHg in at least one of two consecutive arterial blood gas analyses in the following 48 hours. Shapley Additive exPlanations values were used to quantify the positive or negative impact of each variable included in each model on the predicted outcome. Results A total of 198 patients contributed to generate 1068 usable observations which allowed to build 3 predictive models based respectively on 31-variables signs and symptoms, 39-variables laboratory biomarkers and 91-variables as a composition of the two. A fourth “boosted mixed model” included 20 variables was selected from the model 3, achieved the best predictive performance (AUC = 0.84) without worsening the FN rate. Its clinical performance was applied in a narrative case report as an example. Conclusion This study developed a machine model with 84% prediction accuracy, which is able to assist clinicians in decision making process and contribute to develop new analytics to improve care at high technology readiness levels.