Premium
Correlation pursuit: forward stepwise variable selection for index models
Author(s) -
Zhong Wenxuan,
Zhang Tingting,
Zhu Yu,
Liu Jun S.
Publication year - 2012
Publication title -
journal of the royal statistical society: series b (statistical methodology)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 6.523
H-Index - 137
eISSN - 1467-9868
pISSN - 1369-7412
DOI - 10.1111/j.1467-9868.2011.01026.x
Subject(s) - feature selection , mathematics , stepwise regression , linear regression , correlation , variable (mathematics) , statistics , selection (genetic algorithm) , variables , dimension (graph theory) , function (biology) , computer science , artificial intelligence , combinatorics , mathematical analysis , geometry , evolutionary biology , biology
Summary. A stepwise procedure, correlation pursuit (COP), is developed for variable selection under the sufficient dimension reduction framework, in which the response variable Y is influenced by the predictors through an unknown function of a few linear combinations of them. Unlike linear stepwise regression, COP does not impose a special form of relationship (such as linear) between the response variable and the predictor variables. The COP procedure selects variables that attain the maximum correlation between the transformed response and the linear combination of the variables. Various asymptotic properties of the COP procedure are established and, in particular, its variable selection performance under a diverging number of predictors and sample size is investigated. The excellent empirical performance of the COP procedure in comparison with existing methods is demonstrated by both extensive simulation studies and a real example in functional genomics.