Accurately Differentiating Between Patients With COVID-19, Patients With Other Viral Infections, and Healthy Individuals: Multimodal Late Fusion Learning Approach | Zendy

Ming Xu | Zendy; Liu Ouyang | Zendy; Lei Han | Zendy; Kai Sun | Zendy; Tingting Yu | Zendy; Qian Li | Zendy; Hua Tian | Zendy; Lida Safarnejad | Zendy; Hengdong Zhang | Zendy; Yue Gao | Zendy; Forrest Sheng Bao | Zendy; Yuanfang Chen | Zendy; Patrick Robinson | Zendy; Yaorong Ge | Zendy; Baoli Zhu | Zendy; Jie Liu | Zendy; Shi Chen | Zendy

Open Access

Accurately Differentiating Between Patients With COVID-19, Patients With Other Viral Infections, and Healthy Individuals: Multimodal Late Fusion Learning Approach

Author(s) -

Ming Xu,

Liu Ouyang,

Lei Han,

Kai Sun,

Tingting Yu,

Qian Li,

Hua Tian,

Lida Safarnejad,

Hengdong Zhang,

Yue Gao,

Forrest Sheng Bao,

Yuanfang Chen,

Patrick Robinson,

Yaorong Ge,

Baoli Zhu,

Jie Liu,

Shi Chen

Publication year - 2020

Publication title -

journal of medical internet research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.446

H-Index - 142

eISSN - 1439-4456

pISSN - 1438-8871

DOI - 10.2196/25535

Subject(s) - covid-19 , artificial intelligence , random forest , modalities , machine learning , medicine , pneumonia , feature (linguistics) , modality (human–computer interaction) , computer science , disease , social science , linguistics , philosophy , sociology , infectious disease (medical specialty)

Background Effectively identifying patients with COVID-19 using nonpolymerase chain reaction biomedical data is critical for achieving optimal clinical outcomes. Currently, there is a lack of comprehensive understanding in various biomedical features and appropriate analytical approaches for enabling the early detection and effective diagnosis of patients with COVID-19. Objective We aimed to combine low-dimensional clinical and lab testing data, as well as high-dimensional computed tomography (CT) imaging data, to accurately differentiate between healthy individuals, patients with COVID-19, and patients with non-COVID viral pneumonia, especially at the early stage of infection. Methods In this study, we recruited 214 patients with nonsevere COVID-19, 148 patients with severe COVID-19, 198 noninfected healthy participants, and 129 patients with non-COVID viral pneumonia. The participants’ clinical information (ie, 23 features), lab testing results (ie, 10 features), and CT scans upon admission were acquired and used as 3 input feature modalities. To enable the late fusion of multimodal features, we constructed a deep learning model to extract a 10-feature high-level representation of CT scans. We then developed 3 machine learning models (ie, k-nearest neighbor, random forest, and support vector machine models) based on the combined 43 features from all 3 modalities to differentiate between the following 4 classes: nonsevere, severe, healthy, and viral pneumonia. Results Multimodal features provided substantial performance gain from the use of any single feature modality. All 3 machine learning models had high overall prediction accuracy (95.4%-97.7%) and high class-specific prediction accuracy (90.6%-99.9%). Conclusions Compared to the existing binary classification benchmarks that are often focused on single-feature modality, this study’s hybrid deep learning-machine learning framework provided a novel and effective breakthrough for clinical applications. Our findings, which come from a relatively large sample size, and analytical workflow will supplement and assist with clinical decision support for current COVID-19 diagnostic methods and other clinical applications with high-dimensional multimodal biomedical features.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research