Feature selection for high-dimensional classification using a competitive swarm optimizer
Author(s) -
Shenkai Gu,
Ran Cheng,
Yaochu Jin
Publication year - 2016
Publication title -
soft computing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.626
H-Index - 81
eISSN - 1433-7479
pISSN - 1432-7643
DOI - 10.1007/s00500-016-2385-6
Subject(s) - feature selection , particle swarm optimization , computer science , benchmark (surveying) , feature (linguistics) , selection (genetic algorithm) , artificial intelligence , metaheuristic , optimization problem , machine learning , evolutionary computation , swarm behaviour , evolutionary algorithm , algorithm , linguistics , philosophy , geodesy , geography
When solving many machine learning problems such as classification, there exists a large number of input features. However, not all features are relevant for solving the problem, and sometimes, including irrelevant features may deteriorate the learning performance.Please check the edit made in the article title Therefore, it is essential to select the most relevant features, which is known as feature selection. Many feature selection algorithms have been developed, including evolutionary algorithms or particle swarm optimization (PSO) algorithms, to find a subset of the most important features for accomplishing a particular machine learning task. However, the traditional PSO does not perform well for large-scale optimization problems, which degrades the effectiveness of PSO for feature selection when the number of features dramatically increases. In this paper, we propose to use a very recent PSO variant, known as competitive swarm optimizer (CSO) that was dedicated to large-scale optimization, for solving high-dimensional feature selection problems. In addition, the CSO, which was originally developed for continuous optimization, is adapted to perform feature selection that can be considered as a combinatorial optimization problem. An archive technique is also introduced to reduce computational cost. Experiments on six benchmark datasets demonstrate that compared to the canonical PSO-based and a state-of-the-art PSO variant for feature selection, the proposed CSO-based feature selection algorithm not only selects a much smaller number of features, but result in better classification performance as well
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom