
A novel imbalanced data classification approach using both under and over sampling
Author(s) -
Seyyed-Mohammad Javadi-Moghaddam,
Asadollah Noroozi
Publication year - 2021
Publication title -
bulletin of electrical engineering and informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.251
H-Index - 12
ISSN - 2302-9285
DOI - 10.11591/eei.v10i5.2785
Subject(s) - oversampling , class (philosophy) , computer science , sampling (signal processing) , artificial intelligence , data mining , machine learning , pattern recognition (psychology) , bandwidth (computing) , computer network , filter (signal processing) , computer vision
The performance of the data classification has encountered a problem when the data distribution is imbalanced. This fact results in the classifiers tend to the majority class which has the most of the instances. One of the popular approaches is to balance the dataset using over and under sampling methods. This paper presents a novel pre-processing technique that performs both over and under sampling algorithms for an imbalanced dataset. The proposed method uses the SMOTE algorithm to increase the minority class. Moreover, a cluster-based approach is performed to decrease the majority class which takes into consideration the new size of the minority class. The experimental results on 10 imbalanced datasets show the suggested algorithm has better performance in comparison to previous approaches.