Feature Selection with Attributes Clustering by Maximal Information Coefficient
Author(s) -
Xi Zhao,
Wei Deng,
Yong Shi
Publication year - 2013
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2013.05.011
Subject(s) - computer science , feature selection , cluster analysis , feature (linguistics) , data mining , artificial intelligence , selection (genetic algorithm) , reuse , pattern recognition (psychology) , machine learning , philosophy , linguistics , ecology , biology
Feature selection is usually a separate procedure which can not benefit from result of the data exploration. In this paper, we propose a unsupervised feature selection method which could reuse a specific data exploration result. Furthermore, our algorithm follows the idea of clustering attributes and combines two state-of-the-art data analyzing methods, that's maximal information coefficient and affinity propagation. Classification problems with different classifiers were tested to validation our method and others. Data experiments result exhibits our unsupervised algorithm is comparable with classical feature selection methods and even outperforms some supervised learning algorithms. Data simulation with one credit dataset of our own from a bank of China shows the capability of our method for real world application
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom