z-logo
open-access-imgOpen Access
The Efficiency of Aggregation Methods in Ensemble Filter Feature Selection Models
Author(s) -
Noureldien Noureldien,
Saffa Mohmoud
Publication year - 2021
Publication title -
transactions on machine learning and artificial intelligence
Language(s) - English
Resource type - Journals
ISSN - 2054-7390
DOI - 10.14738/tmlai.94.10101
Subject(s) - computer science , feature selection , feature (linguistics) , selection (genetic algorithm) , filter (signal processing) , pattern recognition (psychology) , aggregate (composite) , data aggregator , weighted arithmetic mean , artificial intelligence , data mining , machine learning , computer network , philosophy , linguistics , materials science , wireless sensor network , composite material , computer vision
Ensemble feature selection is recommended as it proves to produce a more stable subset of features and a better classication accuracy when compared to the individual feature selection methods. In this approach, the output of feature selection methods, called base selectors, are combined using some aggregation methods. For filter feature selection methods, a list aggregation method is needed to aggregate output ranked lists into a single list, and since many list aggregation methods have been proposed the decision on which method to use to build the optimum ensemble model is a de facto question. In this paper, we investigate the efficiency of four aggregation methods, namely; Min, Median, Arithmetic Mean, and Geometric Mean. The performance of aggregation methods is evaluated using five datasets from different scientific fields with a variant number of instances and features. Besides, the classifies used in the evaluation are selected from three different classes, Trees, Rules, and Bayes. The experimental results show that 11 out of the 15 best performance results are corresponding to ensemble models. And out of the 11 best performance ensemble models, the most efficient aggregation methods are Median (5/11), followed by Arithmetic Mean (3/11) and Min (3/11). Also, results show that as the number of features increased, the efficient aggregation method changes from Min to Median to Arithmetic Mean. This may suggest that for a very high number of features the efficient aggregation method is the Arithmetic Mean. And generally, there is no aggregation method that is the best for all cases.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here