STatistical Inference Relief (STIR) feature selection | Zendy

Trang T. Le | Zendy; Ryan J. Urbanowicz | Zendy; Jason H. Moore | Zendy; Brett A. McKinney | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

STatistical Inference Relief (STIR) feature selection

Author(s) -

Trang T. Le,

Ryan J. Urbanowicz,

Jason H. Moore,

Brett A. McKinney

Publication year - 2018

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/bty788

Subject(s) - inference , computer science , feature (linguistics) , feature selection , statistical inference , selection (genetic algorithm) , artificial intelligence , machine learning , data mining , pattern recognition (psychology) , statistics , mathematics , philosophy , linguistics

Relief is a family of machine learning algorithms that uses nearest-neighbors to select features whose association with an outcome may be due to epistasis or statistical interactions with other features in high-dimensional data. Relief-based estimators are non-parametric in the statistical sense that they do not have a parameterized model with an underlying probability distribution for the estimator, making it difficult to determine the statistical significance of Relief-based attribute estimates. Thus, a statistical inferential formalism is needed to avoid imposing arbitrary thresholds to select the most important features. We reconceptualize the Relief-based feature selection algorithm to create a new family of STatistical Inference Relief (STIR) estimators that retains the ability to identify interactions while incorporating sample variance of the nearest neighbor distances into the attribute importance estimation. This variance permits the calculation of statistical significance of features and adjustment for multiple testing of Relief-based scores. Specifically, we develop a pseudo t-test version of Relief-based algorithms for case-control data.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research