z-logo
open-access-imgOpen Access
Identifying COVID-19-Specific Transcriptomic Biomarkers with Machine Learning Methods
Author(s) -
Lei Chen,
Zhandong Li,
Tao Zeng,
Yuhang Zhang,
KaiYan Feng,
Tao Huang,
YuDong Cai
Publication year - 2021
Publication title -
biomed research international
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.772
H-Index - 126
eISSN - 2314-6141
pISSN - 2314-6133
DOI - 10.1155/2021/9939134
Subject(s) - coronavirus , covid-19 , disease , asymptomatic , feature selection , transcriptome , medicine , machine learning , infectious disease (medical specialty) , biology , computer science , gene , gene expression , biochemistry
COVID-19, a severe respiratory disease caused by a new type of coronavirus SARS-CoV-2, has been spreading all over the world. Patients infected with SARS-CoV-2 may have no pathogenic symptoms, i.e., presymptomatic patients and asymptomatic patients. Both patients could further spread the virus to other susceptible people, thereby making the control of COVID-19 difficult. The two major challenges for COVID-19 diagnosis at present are as follows: (1) patients could share similar symptoms with other respiratory infections, and (2) patients may not have any symptoms but could still spread the virus. Therefore, new biomarkers at different omics levels are required for the large-scale screening and diagnosis of COVID-19. Although some initial analyses could identify a group of candidate gene biomarkers for COVID-19, the previous work still could not identify biomarkers capable for clinical use in COVID-19, which requires disease-specific diagnosis compared with other multiple infectious diseases. As an extension of the previous study, optimized machine learning models were applied in the present study to identify some specific qualitative host biomarkers associated with COVID-19 infection on the basis of a publicly released transcriptomic dataset, which included healthy controls and patients with bacterial infection, influenza, COVID-19, and other kinds of coronavirus. This dataset was first analysed by Boruta, Max-Relevance and Min-Redundancy feature selection methods one by one, resulting in a feature list. This list was fed into the incremental feature selection method, incorporating one of the classification algorithms to extract essential biomarkers and build efficient classifiers and classification rules. The capacity of these findings to distinguish COVID-19 with other similar respiratory infectious diseases at the transcriptomic level was also validated, which may improve the efficacy and accuracy of COVID-19 diagnosis.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom