z-logo
open-access-imgOpen Access
Information preserving multi-objective feature selection for unsupervised learning
Author(s) -
Ingo Mierswa,
Michael Wurst
Publication year - 2006
Publication title -
citeseer x (the pennsylvania state university)
Language(s) - English
Resource type - Conference proceedings
ISBN - 1-59593-186-4
DOI - 10.1145/1143997.1144248
Subject(s) - computer science , artificial intelligence , feature selection , cluster analysis , unsupervised learning , machine learning , feature (linguistics) , selection (genetic algorithm) , pareto principle , pattern recognition (psychology) , point (geometry) , feature learning , supervised learning , data mining , artificial neural network , mathematics , mathematical optimization , philosophy , linguistics , geometry
In this work we propose a novel, sound framework for evolutionary feature selection in unsupervised machine learning problems. We show that unsupervised feature selection is inherently multi-objective and behaves differently from supervised feature selection in that the number of features must be maximized instead of being minimized. Although this might sound surprising from a supervised learning point of view, we exemplify this relationship on the problem of data clustering and show that existing approaches do not pose the optimization problem in an appropriate way. Another important consequence of this paradigm change is a method which segments the Pareto sets produced by our approach. Inspecting only prototypical points from these segments drastically reduces the amount of work for selecting a final solution. We compare our methods against existing approaches on eight data sets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom