Premium
Evolutionary computation for feature selection in classification problems
Author(s) -
de la Iglesia Beatriz
Publication year - 2013
Publication title -
wiley interdisciplinary reviews: data mining and knowledge discovery
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.506
H-Index - 47
eISSN - 1942-4795
pISSN - 1942-4787
DOI - 10.1002/widm.1106
Subject(s) - computer science , feature selection , genetic programming , heuristic , selection (genetic algorithm) , evolutionary computation , machine learning , artificial intelligence , data pre processing , feature (linguistics) , data mining , particle swarm optimization , genetic algorithm , preprocessor , evolutionary algorithm , philosophy , linguistics
Feature subset selection ( FSS ) has received a great deal of attention in statistics, machine learning, and data mining. Real world data analyzed by data mining algorithms can involve a large number of redundant or irrelevant features or simply too many features for a learning algorithm to handle them efficiently. Feature selection is becoming essential as databases grow in size and complexity. The selection process is expected to bring benefits in terms of better performing models, computational efficiency, and simpler more understandable models. Evolutionary computation ( EC ) encompasses a number of naturally inspired techniques such as genetic algorithms, genetic programming, ant colony optimization, or particle swarm optimization algorithms. Such techniques are well suited to feature selection because the representation of a feature subset is straightforward and the evaluation can also be easily accomplished through the use of wrapper or filter algorithms. Furthermore, the capability of such heuristic algorithms to efficiently search large search spaces is of great advantage to the feature selection problem. Here, we review the use of different EC paradigms for feature selection in classification problems. We discuss details of each implementation including representation, evaluation, and validation. The review enables us to uncover the best EC algorithms for FSS and to point at future research directions. WIREs Data Mining Knowl Discov 2013, 3:381–407. doi: 10.1002/widm.1106 This article is categorized under: Technologies > Classification Technologies > Computational Intelligence Technologies > Data Preprocessing