z-logo
Premium
Flexible variable selection for recovering sparsity in nonadditive nonparametric models
Author(s) -
Fang Zaili,
Kim Inyoung,
Schaumont Patrick
Publication year - 2016
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/biom.12518
Subject(s) - nonparametric statistics , selection (genetic algorithm) , feature selection , variable (mathematics) , computer science , model selection , econometrics , statistics , mathematics , artificial intelligence , mathematical analysis
Summary Variable selection for recovering sparsity in nonadditive and nonparametric models with high‐dimensional variables has been challenging. This problem becomes even more difficult due to complications in modeling unknown interaction terms among high‐dimensional variables. There is currently no variable selection method to overcome these limitations. Hence, in this article we propose a variable selection approach that is developed by connecting a kernel machine with the nonparametric regression model. The advantages of our approach are that it can: (i) recover the sparsity; (ii) automatically model unknown and complicated interactions; (iii) connect with several existing approaches including linear nonnegative garrote and multiple kernel learning; and (iv) provide flexibility for both additive and nonadditive nonparametric models. Our approach can be viewed as a nonlinear version of a nonnegative garrote method. We model the smoothing function by a Least Squares Kernel Machine (LSKM) and construct the nonnegative garrote objective function as the function of the sparse scale parameters of kernel machine to recover sparsity of input variables whose relevances to the response are measured by the scale parameters. We also provide the asymptotic properties of our approach. We show that sparsistency is satisfied with consistent initial kernel function coefficients under certain conditions. An efficient coordinate descent/backfitting algorithm is developed. A resampling procedure for our variable selection methodology is also proposed to improve the power.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here