Sequential feature selection and inference using multi-variate random forests | Zendy

Joshua Mayer | Zendy; Raziur Rahman | Zendy; Souparno Ghosh | Zendy; Ranadip Pal | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Sequential feature selection and inference using multi-variate random forests

Author(s) -

Joshua Mayer,

Raziur Rahman,

Souparno Ghosh,

Ranadip Pal

Publication year - 2017

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btx784

Subject(s) - interpretability , feature selection , computer science , random forest , inference , machine learning , artificial intelligence , ranking (information retrieval) , feature (linguistics) , data mining , selection (genetic algorithm) , tree (set theory) , set (abstract data type) , flexibility (engineering) , mathematics , statistics , mathematical analysis , philosophy , linguistics , programming language

Random forest (RF) has become a widely popular prediction generating mechanism. Its strength lies in its flexibility, interpretability and ability to handle large number of features, typically larger than the sample size. However, this methodology is of limited use if one wishes to identify statistically significant features. Several ranking schemes are available that provide information on the relative importance of the features, but there is a paucity of general inferential mechanism, particularly in a multi-variate set up. We use the conditional inference tree framework to generate a RF where features are deleted sequentially based on explicit hypothesis testing. The resulting sequential algorithm offers an inferentially justifiable, but model-free, variable selection procedure. Significant features are then used to generate predictive RF. An added advantage of our methodology is that both variable selection and prediction are based on conditional inference framework and hence are coherent.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research