z-logo
open-access-imgOpen Access
A novel approach for solving skewed classification problem using cluster based ensemble method
Author(s) -
Gillala Rekha,
V. Krishna Reddy,
Amit Kumar Tyagi
Publication year - 2020
Publication title -
mathematical foundations of computing
Language(s) - English
Resource type - Journals
ISSN - 2577-8838
DOI - 10.3934/mfc.2020001
Subject(s) - oversampling , boosting (machine learning) , adaboost , ensemble learning , computer science , artificial intelligence , machine learning , classifier (uml) , class (philosophy) , cluster (spacecraft) , pattern recognition (psychology) , data mining , computer network , bandwidth (computing) , programming language
In numerous real-world applications, the class imbalance problem is prevalent. When training samples of one class immensely outnumber samples of the other classes, the traditional machine learning algorithms show bias towards the majority class (a class with more number of samples) lead to significant losses of model performance. Several techniques have been proposed to handle the problem of class imbalance, including data sampling and boosting. In this paper, we present a cluster-based oversampling with boosting algorithm (Cluster+Boost) for learning from imbalanced data. We evaluate the performance of the proposed approach with state-of-the-art methods based on ensemble learning like AdaBoost, RUSBoost and SMOTEBoost. We conducted experiments on 22 data sets with various imbalance ratios. The experimental results are promising and provide an alternative approach for improving the performance of the classifier when learned on highly imbalanced data sets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom