Imbalanced data classification based on hybrid resampling and twin support vector machine | Zendy

Lu Cao | Zendy; Hong Shen | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Imbalanced data classification based on hybrid resampling and twin support vector machine

Author(s) -

Lu Cao,

Hong Shen

Publication year - 2017

Publication title -

computer science and information systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.244

H-Index - 24

eISSN - 2406-1018

pISSN - 1820-0214

DOI - 10.2298/csis161221017l

Subject(s) - oversampling , computer science , support vector machine , resampling , artificial intelligence , machine learning , pattern recognition (psychology) , data sampling , focus (optics) , structured support vector machine , data mining , sampling (signal processing) , training set , computer network , physics , bandwidth (computing) , filter (signal processing) , optics , computer vision

Imbalanced datasets exist widely in real life. The identification of the minority class in imbalanced datasets tends to be the focus of classification. As a variant of enhanced support vector machine (SVM), the twin support vector machine (TWSVM) provides an effective technique for data classification. TWSVM is based on a relative balance in the training sample dataset and distribution to improve the classification accuracy of the whole dataset, however, it is not effective in dealing with imbalanced data classification problems. In this paper, we propose to combine a re-sampling technique, which utilizes oversampling and under-sampling to balance the training data, with TWSVM to deal with imbalanced data classification. Experimental results show that our proposed approach outperforms other state-of-art methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research