
Analisis perbandingan algoritma Naïve Bayes, k-Nearest Neighbor dan Neural Network untuk permasalahan class-imbalanced data pada kasus credit card fraud dataset
Author(s) -
Mita Yuanika Sahroni,
Niken Ayu Setifani,
Devinta Nurul Fitriana
Publication year - 2021
Publication title -
teknologi
Language(s) - English
Resource type - Journals
eISSN - 2527-3671
pISSN - 2087-8893
DOI - 10.26594/teknologi.v11i2.2393
Subject(s) - undersampling , computer science , artificial neural network , k nearest neighbors algorithm , naive bayes classifier , artificial intelligence , credit card fraud , credit card , class (philosophy) , bayes' theorem , machine learning , pattern recognition (psychology) , data mining , algorithm , support vector machine , bayesian probability , world wide web , payment
The high public interest in transactions using credit cards in the banking sector has the potential for higher credit card fraud. This study uses a credit card fraud dataset that consisting of 284,807 data obtained from Kaggle. The dataset in this study is class-imbalanced data with a comparison between the major class of 99.8% and the minor class of 0.2%. This class-imbalanced data problem will be solved by applying undersampling. In order to determine the performance of the classification algorithm that is most suitable for solving class-imbalanced data problems, a comparison of the Naïve Bayes, k-Nearest Neighbor (kNN) and Neural Network algorithms will be carried out. The t-test in this study was conducted to determine the significance of differences between algorithms. Algorithm performance evaluation uses accuracy and AUC (area under the curve) values. The test results in this study is Neural Network has better performance than other algorithms because it has the highest accuracy value of 93.59% and AUC value of 0.977. Based on the t-test results, the Neural Network with k-NN has a significant difference, in contrast to the Neural Network with Naïve Bayes there is no significant difference.