z-logo
open-access-imgOpen Access
Noise Avoidance SMOTE in Ensemble Learning for Imbalanced Data
Author(s) -
Kyoungok Kim
Publication year - 2021
Publication title -
ieee access
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.587
H-Index - 127
ISSN - 2169-3536
DOI - 10.1109/access.2021.3120738
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Class imbalance is a common problem in many real-world applications. To deal with class imbalance, several techniques, including resampling and ensemble approaches, have been proposed and resampling and ensemble methods have been proven effective for imbalanced data. Moreover, hybrid methods that combine resampling and ensemble have been verified to be highly effective in dealing with imbalance problems. this study proposes new hybrid sampling/ensemble algorithms based on a modification of SMOTE, called NASBoost and NASBagging, which avoids selecting noise samples in the minority class while maintaining diversity among training sets. The proposed sampling method introduces new measures to identify samples that may generate noisy synthetic samples during sampling in SMOTE. Experimental results on 16 imbalanced datasets show that the hybrid of the proposed sampling procedure and ensemble algorithms improves the classification performance by preventing the generation of noise while allowing samples in the minority class to be evenly chosen.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here