z-logo
Premium
How to Make Model‐free Feature Screening Approaches for Full Data Applicable to the Case of Missing Response?
Author(s) -
Wang Qihua,
Li Yongjin
Publication year - 2018
Publication title -
scandinavian journal of statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.359
H-Index - 65
eISSN - 1467-9469
pISSN - 0303-6898
DOI - 10.1111/sjos.12290
Subject(s) - missing data , imputation (statistics) , covariate , property (philosophy) , feature (linguistics) , data mining , computer science , mathematics , statistics , machine learning , philosophy , linguistics , epistemology
It is quite a challenge to develop model‐free feature screening approaches for missing response problems because the existing standard missing data analysis methods cannot be applied directly to high dimensional case. This paper develops some novel methods by borrowing information of missingness indicators such that any feature screening procedures for ultrahigh‐dimensional covariates with full data can be applied to missing response case. The first method is the so‐called missing indicator imputation screening, which is developed by proving that the set of the active predictors of interest for the response is a subset of the active predictors for the product of the response and missingness indicator under some mild conditions. As an alternative, another method called Venn diagram‐based approach is also developed. The sure screening property is proven for both methods. It is shown that the complete case analysis can also keep the sure screening property of any feature screening approach with sure screening property.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here