clustvarsel: A Package Implementing Variable Selection for Gaussian Model-Based Clustering in R | Zendy

Luca Scrucca | Zendy; Adrian E. Raftery | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

clustvarsel: A Package Implementing Variable Selection for Gaussian Model-Based Clustering in R

Author(s) -

Luca Scrucca,

Adrian E. Raftery

Publication year - 2018

Publication title -

journal of statistical software

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 7.636

H-Index - 145

ISSN - 1548-7660

DOI - 10.18637/jss.v084.i01

Subject(s) - cluster analysis , computer science , feature selection , selection (genetic algorithm) , r package , data mining , mixture model , gaussian , variable (mathematics) , model selection , greedy algorithm , artificial intelligence , algorithm , mathematics , mathematical analysis , physics , computational science , quantum mechanics

Finite mixture modeling provides a framework for cluster analysis based on parsimonious Gaussian mixture models. Variable or feature selection is of particular importance in situations where only a subset of the available variables provide clustering information. This enables the selection of a more parsimonious model, yielding more efficient estimates, a clearer interpretation and, often, improved clustering partitions. This paper describes the R package clustvarsel which performs subset selection for model-based clustering. An improved version of the Raftery and Dean (2006) methodology is implemented in the new release of the package to find the (locally) optimal subset of variables with group/cluster information in a dataset. Search over the solution space is performed using either a step-wise greedy search or a headlong algorithm. Adjustments for speeding up these algorithms are discussed, as well as a parallel implementation of the stepwise search. Usage of the package is presented through the discussion of several data examples.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research