z-logo
Premium
Coselection of features and instances for unsupervised rare category analysis
Author(s) -
He Jingrui,
Carbonell Jaime
Publication year - 2010
Publication title -
statistical analysis and data mining: the asa data science journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.381
H-Index - 33
eISSN - 1932-1872
pISSN - 1932-1864
DOI - 10.1002/sam.10091
Subject(s) - computer science , selection (genetic algorithm) , feature selection , artificial intelligence , class (philosophy) , machine learning , feature (linguistics) , data mining , pattern recognition (psychology) , philosophy , linguistics
Rare category analysis is of key importance both in theory and in practice. Previous research work focuses on supervised rare category analysis, such as rare category detection and rare category classification. In this paper, for the first time, we address the challenge of unsupervised rare category analysis, including feature selection and rare category selection. We propose to jointly deal with the two correlated tasks, so that they can benefit from each other. To this end, we design an optimization framework which is able to coselect the relevant features and the examples from the rare category (a.k.a. the minority class). It is well justified theoretically. Furthermore, we develop the Partial Augmented Lagrangian Method (PALM) to solve the optimization problem. Experimental results on both synthetic and real data sets show the effectiveness of the proposed method. Copyright © 2010 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 3: 417‐430, 2010

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here