Feature Selection: a Useful Preprocessing Step | Zendy

Isabelle Moulinier | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Feature Selection: a Useful Preprocessing Step

Author(s) -

Isabelle Moulinier

Publication year - 1997

Publication title -

electronic workshops in computing

Language(s) - English

Resource type - Conference proceedings

ISSN - 1477-9358

DOI - 10.14236/ewic/ir1997.6

Subject(s) - feature selection , computer science , categorization , preprocessor , artificial intelligence , feature (linguistics) , selection (genetic algorithm) , machine learning , curse of dimensionality , data pre processing , text categorization , data mining , feature extraction , task (project management) , pattern recognition (psychology) , engineering , philosophy , linguistics , systems engineering

Statistical classification techniques and machine learning methods have been applied to some Information Retrieval (IR) problems: routing, filtering and categorization. Most of these methods are usually awkward and sometimes intractable in highly dimensional feature spaces. In order to reduce dimensionality, feature selection has been introduced as a pre-processing step. In this paper, we assess to what extent feature selection can be used without causing a loss in effectiveness. This problem can be tackled since a couple of recent learners do not require a preprocessing step. On a text categorization task, using the Reuters-22,173 collection, we give empirical evidence that feature selection is useful: first, the size of the collection index can be drastically reduced without causing a significant loss in categorization effectiveness. Then, we show that feature selection speeds up the time required to automatically build the categorization system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research