z-logo
open-access-imgOpen Access
Imbalanced dataset classification algorithm based on NDSVM
Author(s) -
Liu Yueting
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1871/1/012153
Subject(s) - support vector machine , artificial intelligence , decision boundary , pattern recognition (psychology) , classifier (uml) , computer science , residual , class (philosophy) , algorithm , statistical classification , data mining , machine learning , mathematics
Because of uneven distribution and indistinct boundary in imbalanced dataset, imbalanced dataset classification algorithm based on neighbors density support vector machine (NDSVM)is proposed. In this algorithm, the neighbor range density of each sample in the majority class is calculated firstly. According to the density value, the data which on the majority class border or close to the border is equal to the minority samples in quantity, which are selected, then the minority class complete SVM initial classification. Then the resulting support vector machine and residual data in the majority class optimize the initial classifier. The simulation results of experiments on the manual and UCI dataset show that compared with WSVM、 ALSMOTE-SVM and SVM, NDSVM has better classification performance, which effectively improve the classification performance of SVM algorithm on the uneven distribution and indistinct boundary in imbalanced dataset.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here