
Study of Bagging Application in the Safe-Level Smote Method in Handling Unbalanced Classification
Author(s) -
Qorry Meidianingsih,
Debby Agustine
Publication year - 2021
Publication title -
indonesian journal of statistics and applications
Language(s) - English
Resource type - Journals
ISSN - 2599-0802
DOI - 10.29244/ijsa.v5i1p105-116
Subject(s) - support vector machine , artificial intelligence , classifier (uml) , machine learning , computer science , pattern recognition (psychology) , data mining , binary classification
The problems of imbalanced class classification have been found in many real applications. It has potential to make the minority class instances tend to be classified into the majority class. This study examined the performance of bagging method’s application in safe-level SMOTE based on Support Vector Machine classifier. The data used consisted of three types based on the proportion of observations in the majority and minority classes. Each type of data has three variables, two independent variables and one variable dependent. The observations of independent variables were generated based on multivariate normal distribution, while dependent variables are binary. The results showed that the classifier has a high accuracy and sensitivity for all types of data for both in the imbalanced class and the balanced class (obtained by safe-level SMOTE and safe-level SMOTEBagging). Nevertheless, specificity was the main measure in assessing the performance of the classifier because it provides accuracy in classifying the minority class observations. The specificity increased when the number of observations between the two classes were approximately balance due to the implementation of safe-level SMOTE. The best performance of the Support Vector Machine in predicting minority class observations was achieved when bagging were applied in safe-level SMOTE. The specificity rate for all types of data were 77.93 percent, 78.46 percent, and 85.69 percent, respectively.