z-logo
open-access-imgOpen Access
Investigating the Performance of Smote for Class Imbalanced Learning: A Case Study of Credit Scoring Datasets
Author(s) -
Maira Anis,
Mohsin Ali
Publication year - 2017
Publication title -
european scientific journal
Language(s) - English
Resource type - Journals
eISSN - 1857-7881
pISSN - 1857-7431
DOI - 10.19044/esj.2017.v13n33p340
Subject(s) - oversampling , class (philosophy) , resampling , computer science , machine learning , artificial intelligence , sampling (signal processing) , data mining , bandwidth (computing) , computer network , filter (signal processing) , computer vision
Classification of datasets is one of the major issues encountered by the data mining community. This problem heightens when the real world datasets is also imbalanced in nature. A dataset happens to be imbalanced when the numbers of observations belonging to rare class are greatly outnumbered by the observations of another class. Class with greater number of observation is called the majority or the negative class, while the other with rare observations is referred to as the minority or the positive class. Literature represents number of resampling techniques that address the problem of class imbalance. One of the most important strategies is to resample the datasets that aim to balance the number of minority or majority observations by over-sampling or under-sampling respectively. This paper aims to investigates and analyze the performance of most widely used oversampling procedure Synthetic Minority Oversampling Technique (SMOTE) for different thresholds of oversampling using four classifiers for three credit scoring datasets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here