
Predicting heart ailment in patients with varying number of features using data mining techniques
Author(s) -
T R Stella Mary,
Shoney Sebastian
Publication year - 2019
Publication title -
international journal of informatics and communication technology/international journal of informatics and communication technology (ij-ict)
Language(s) - English
Resource type - Journals
eISSN - 2722-2616
pISSN - 2252-8776
DOI - 10.11591/ijict.v8i1.pp56-62
Subject(s) - naive bayes classifier , random forest , computer science , data mining , artificial intelligence , c4.5 algorithm , bayes classifier , classifier (uml) , machine learning , pattern recognition (psychology) , support vector machine
Data mining can be defined as a process of extracting unknown, verifiable and possibly helpful data from information. Among the various ailments, heart ailment is one of the primary reason behind death of individuals around the globe, hence in order to curb this, a detailed analysis is done using Data Mining. Many a times we limit ourselves with minimal attributes that are required to predict a patient with heart disease. By doing so we are missing on a lot of important attributes that are main causes for heart diseases. Hence, this research aims at considering almost all the important features affecting heart disease and performs the analysis step by step with minimal to maximum set of attributes using Data Mining techniques to predict heart ailments. The various classification methods used are Naïve Bayes classifier, Random Forest and Random Tree which are applied on three datasets with different number of attributes but with a common class label. From the analysis performed, it shows that there is a gradual increase in prediction accuracies with the increase in the attributes irrespective of the classifiers used and Naïve Bayes and Random Forest algorithms comparatively outperforms with these sets of data.