z-logo
open-access-imgOpen Access
Oversampling Method on Classifying Hypertension Using Naive Bayes, Decision Tree, and Artificial Neural Network
Author(s) -
Nurul Chamidah,
Mayanda Mega Santoni,
Nurhafifah Matondang
Publication year - 2020
Publication title -
jurnal resti (rekayasa sistem dan teknologi informasi)
Language(s) - English
Resource type - Journals
ISSN - 2580-0760
DOI - 10.29207/resti.v4i4.2015
Subject(s) - oversampling , naive bayes classifier , decision tree , artificial intelligence , artificial neural network , computer science , machine learning , class (philosophy) , tree (set theory) , data mining , pattern recognition (psychology) , mathematics , support vector machine , computer network , mathematical analysis , bandwidth (computing)
Oversampling is a technique to balance the number of data records for each class by generating data with a small number of records in a class, so that the amount is balanced with data with a class with a large number of records. Oversampling in this study is applied to hypertension dataset where hypertensive class has a small number of records when compared to the number of records for non-hypertensive classes. This study aims to evaluate the effect of oversampling on the classification of hypertension dataset consisting of hypertensive and non-hypertensive classes by utilizing the Naïve Bayes, Decision Tree, and Artificial Neural Network (ANN) as well as finding the best model of the three algorithms. Evaluation of the use of oversampling on hypertension dataset is done by processing the data by imputing missing values, oversampling, and transforming data into the same range, then using the Naïve Bayes, Decision Tree, and ANN to build classification models. By dividing 80% of data as training data to build models and 20% as validation data for testing models, we had an increase in classification performance in the form of accuracy, precision, and recall of the oversampled data when compared without oversampling. The best performance in this study resulted in the highest accuracy using ANN with 0.91, precision 0.86 and recall 0.99.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here