z-logo
Premium
High‐dimensional data classification model based on random projection and Bagging‐ support vector machine
Author(s) -
Sun Yujia,
Platoš Jan
Publication year - 2020
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.6095
Subject(s) - support vector machine , projection (relational algebra) , random projection , computer science , artificial intelligence , construct (python library) , random forest , pattern recognition (psychology) , test data , machine learning , sample (material) , process (computing) , data mining , algorithm , chemistry , chromatography , programming language , operating system
Aiming at the long training time when classifying high‐dimensional data, a parallel classification model is proposed based on random projection and Bagging‐support vector machine (SVM) to process high‐dimensional data. The model first uses random projection to project the input data into the low‐dimensional space. Then, we used the Bagging method to construct multiple training data subsets and used SVM to train the training subset in parallel and generate several subclassifiers. Finally, various classifiers vote to determine the category of the test sample. The model has been verified using two standard datasets. The experimental results show that the model can significantly improve the training speed and classification performance of high‐dimensional data with little accuracy loss.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here