Classification Accuracy of Hepatitis C Virus Infection Outcome: Data Mining Approach
Author(s) -
Mario Frías,
Jose M. Moyano,
Antonio RiveroJuárez,
José María Luna,
Ángela Camacho,
Habib M. Fardoun,
Isabel Machuca,
Mohamed Al-Twijri,
Antonio Rivero,
Sebastián Ventura
Publication year - 2020
Publication title -
journal of medical internet research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.446
H-Index - 142
eISSN - 1439-4456
pISSN - 1438-8871
DOI - 10.2196/18766
Subject(s) - medicine , hepatitis c virus , computer science , virology , data mining , virus
Background The dataset from genes used to predict hepatitis C virus outcome was evaluated in a previous study using a conventional statistical methodology. Objective The aim of this study was to reanalyze this same dataset using the data mining approach in order to find models that improve the classification accuracy of the genes studied. Methods We built predictive models using different subsets of factors, selected according to their importance in predicting patient classification. We then evaluated each independent model and also a combination of them, leading to a better predictive model. Results Our data mining approach identified genetic patterns that escaped detection using conventional statistics. More specifically, the partial decision trees and ensemble models increased the classification accuracy of hepatitis C virus outcome compared with conventional methods. Conclusions Data mining can be used more extensively in biomedicine, facilitating knowledge building and management of human diseases.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom