Prediction of Credit-Card Defaulters: A Comparative Study on Performance of Classifiers
Author(s) -
Apoorva D. Ajay,
Ajay Venkatesh,
S.V. Juno Bella Gracia
Publication year - 2016
Publication title -
international journal of computer applications
Language(s) - English
Resource type - Journals
ISSN - 0975-8887
DOI - 10.5120/ijca2016910702
Subject(s) - computer science , credit card , machine learning , artificial intelligence , credit card fraud , world wide web , payment
Data mining is an emerging area of research that aims at extracting meaningful patterns from available data. This paper highlights the significance of classification in predicting new trends from voluminous data. Performance analysis of various data mining algorithm viz. BayesNet, Meta-Stacking, Naïve Bayes, Random Forest, SMO and ZeroR in predicting creditcard defaulters is discussed in this paper. Dataset from the UCI machine learning repository comprising of 25 attributes and 30000 instances have been employed to analyze the performance of algorithms. Moreover, the effect of feature selection has also been identified with respect to each classification algorithm. It has been concluded from the experimental results that both Correlation Feature Subset and Information Gain feature selection methods yield the most useful features for prediction and the accuracy of Random Forest Ensemble method is highest in predicting credit card defaulters. General Terms Data Mining, Credit card defaulters dataset.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom