On the selection of dimension reduction techniques for scientific applications
Author(s) -
Yi Fan,
Chandrika Kamath
Publication year - 2012
Publication title -
osti oai (u.s. department of energy office of scientific and technical information)
Language(s) - English
Resource type - Reports
DOI - 10.2172/1036865
Subject(s) - dimensionality reduction , dimension (graph theory) , intrinsic dimension , computer science , selection (genetic algorithm) , curse of dimensionality , data mining , reduction (mathematics) , artificial intelligence , space (punctuation) , multidimensional data , pattern recognition (psychology) , machine learning , mathematics , operating system , geometry , pure mathematics
Many dimension reduction methods have been proposed to discover the intrinsic, lower dimensional structure of a high-dimensional dataset. However, determining critical features in datasets that consist of a large number of features is still a challenge. In this paper, through a series of carefully designed experiments on real-world datasets, we investigate the performance of different dimension reduction techniques, ranging from feature subset selection to methods that transform the features into a lower dimensional space. We also discuss methods that calculate the intrinsic dimensionality of a dataset in order to understand the reduced dimension. Using several evaluation strategies, we show how these different methods can provide useful insights into the data. These comparisons enable us to provide guidance to a user on the selection of a technique for their dataset
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom