Premium
Feature selection in single and ensemble learning‐based bankruptcy prediction models
Author(s) -
Lin WeiChao,
Lu YuHsin,
Tsai ChihFong
Publication year - 2019
Publication title -
expert systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.365
H-Index - 38
eISSN - 1468-0394
pISSN - 0266-4720
DOI - 10.1111/exsy.12335
Subject(s) - feature selection , computer science , artificial intelligence , boosting (machine learning) , machine learning , naive bayes classifier , support vector machine , data pre processing , preprocessor , ensemble learning , pattern recognition (psychology) , predictive modelling , data mining , selection (genetic algorithm) , feature (linguistics) , linguistics , philosophy
Feature selection is an important data preprocessing step for the construction of an effective bankruptcy prediction model. The prediction performance can be affected by the employed feature selection and classification techniques. However, there have been very few studies of bankruptcy prediction that identify the best combination of feature selection and classification techniques. In this study, two types of feature selection methods, including filter‐ and wrapper‐based methods, are considered, and two types of classification techniques, including statistical and machine learning techniques, are employed in the development of the prediction methods. In addition, bagging and boosting ensemble classifiers are also constructed for comparison. The experimental results based on three related datasets that contain different numbers of input features show that the genetic algorithm as the wrapper‐based feature selection method performs better than the filter‐based one by information gain. It is also shown that the lowest prediction error rates for the three datasets are provided by combining the genetic algorithm with the naïve Bayes and support vector machine classifiers without bagging and boosting.