
Characterization of SARS-CoV-2 cases in Mexico using data mining
Author(s) -
Enrique Luna-Ramírez,
Jorge Soria-Cruz,
Apolinar Velarde-Martínez,
Edgar Aurelio Taya-Acosta
Publication year - 2020
Publication title -
revista de cómputo aplicado
Language(s) - English
Resource type - Journals
ISSN - 2531-2952
DOI - 10.35429/jca.2020.15.4.19.25
Subject(s) - copd , panorama , disease , government (linguistics) , test (biology) , covid-19 , medicine , diabetes mellitus , asthma , obesity , pulmonary disease , environmental health , computer science , infectious disease (medical specialty) , artificial intelligence , pathology , biology , paleontology , linguistics , philosophy , endocrinology
In this paper, it is realized an analysis of the data published by the Federal Government of Mexico on the cases related to the test for detecting the presence of the SARS-CoV-2 virus, that originates the COVID-19 disease. More than a million cases were analyzed, most of which were positive to the test. For this study, twenty-one significant variables were considered, included the result of the test and the cases of death, going through the different factors that complicate a person’s health such as diabetes, chronic obstructive pulmonary disease (COPD), asthma, hypertension, obesity and smoking, among others. At the beginning of the study, the preparation of the data was carried out so that they could be treated using data mining techniques, based on the CRISP-DM methodology for extraction of knowledge. Thus, with the help of this type of techniques, data models were generated to characterize the development of the COVID-19 disease in the national and local (by States) panorama. As an important part of the models, various rules or correlations were observed among the different variables, which could be used to predict, in part, the future development of the COVID-19 disease in Mexico and, consequently, to establish best practices that target to reduce its social impact.