z-logo
Premium
Model‐based cluster analysis
Author(s) -
Stahl Daniel,
Sallis Hannah
Publication year - 2012
Publication title -
wiley interdisciplinary reviews: computational statistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.693
H-Index - 38
eISSN - 1939-0068
pISSN - 1939-5108
DOI - 10.1002/wics.1204
Subject(s) - cluster analysis , computer science , exploratory data analysis , heuristic , data mining , mixture model , population , model selection , cluster (spacecraft) , consensus clustering , machine learning , artificial intelligence , correlation clustering , cure data clustering algorithm , demography , sociology , programming language
Cluster analysis seeks to identify homogeneous subgroups of cases in a population. This article provides an introduction to model‐based clustering using finite mixture models and extensions. Finite mixtures have been successfully used for more than a hundred years for clustering and classification, but have become increasingly popular in the last decade due to recent advances in computer technology and software availability. Unlike traditional methods of cluster analysis, which are based on heuristic or distance‐based procedures, finite mixture modeling provides a formal statistical framework on which to base the clustering procedure. Finite mixture models assume that the population is made up of several distinct subsets (or clusters), each following a different multivariate probability density distribution. Model‐based cluster analysis can deal with a mix of nominal, ordinal, count, or continuous variables, any of which may contain missing values. We will demonstrate how the problems of determining the number of clusters and choosing an appropriate clustering method reduce to a model selection problem, for which objective procedures exist. We briefly discuss how model‐based cluster analysis can be used to analyze complex and structured (e.g., longitudinal) datasets. WIREs Comput Stat 2012 doi: 10.1002/wics.1204 This article is categorized under: Statistical Learning and Exploratory Methods of the Data Sciences > Clustering and Classification Statistical Learning and Exploratory Methods of the Data Sciences > Modeling Methods

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here