z-logo
Premium
A comparative review of variable selection techniques for covariate dependent Dirichlet process mixture models
Author(s) -
Barcella William,
Iorio Maria,
Baio Gianluca
Publication year - 2017
Publication title -
canadian journal of statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.804
H-Index - 51
eISSN - 1708-945X
pISSN - 0319-5724
DOI - 10.1002/cjs.11323
Subject(s) - covariate , latent variable , mathematics , dirichlet distribution , dirichlet process , feature selection , statistics , econometrics , computer science , artificial intelligence , mathematical analysis , bayesian probability , boundary value problem
Dirichlet Process Mixture (DPM) models have been increasingly employed to specify random partition models that take into account possible patterns within covariates. Furthermore with large numbers of covariates methods for selecting the most important covariates have been proposed. Commonly the covariates are chosen either for their importance in determining the clustering of the observations or for their effect on the level of a response variable (when a regression model is specified). Typically both strategies involve the specification of latent indicators that regulate the inclusion of the covariates in the model. Common examples involve the use of spike and slab prior distributions. In this work we review the most relevant DPM models that include covariate information in the induced partition of the observations and we focus on available variable selection techniques for these models. We highlight the main features of each model and demonstrate them using simulations and a real data application. The Canadian Journal of Statistics 45: 254–273; 2017 © 2017 Statistical Society of Canada

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here