Model‐based cluster analysis | Zendy

Stahl Daniel | Zendy; Sallis Hannah | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Model‐based cluster analysis

Author(s) -

Stahl Daniel,

Sallis Hannah

Publication year - 2012

Publication title -

wiley interdisciplinary reviews: computational statistics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.693

H-Index - 38

eISSN - 1939-0068

pISSN - 1939-5108

DOI - 10.1002/wics.1204

Subject(s) - cluster analysis , computer science , exploratory data analysis , heuristic , data mining , mixture model , population , model selection , cluster (spacecraft) , consensus clustering , machine learning , artificial intelligence , correlation clustering , cure data clustering algorithm , demography , sociology , programming language

Cluster analysis seeks to identify homogeneous subgroups of cases in a population. This article provides an introduction to model‐based clustering using finite mixture models and extensions. Finite mixtures have been successfully used for more than a hundred years for clustering and classification, but have become increasingly popular in the last decade due to recent advances in computer technology and software availability. Unlike traditional methods of cluster analysis, which are based on heuristic or distance‐based procedures, finite mixture modeling provides a formal statistical framework on which to base the clustering procedure. Finite mixture models assume that the population is made up of several distinct subsets (or clusters), each following a different multivariate probability density distribution. Model‐based cluster analysis can deal with a mix of nominal, ordinal, count, or continuous variables, any of which may contain missing values. We will demonstrate how the problems of determining the number of clusters and choosing an appropriate clustering method reduce to a model selection problem, for which objective procedures exist. We briefly discuss how model‐based cluster analysis can be used to analyze complex and structured (e.g., longitudinal) datasets. WIREs Comput Stat 2012 doi: 10.1002/wics.1204 This article is categorized under: Statistical Learning and Exploratory Methods of the Data Sciences > Clustering and Classification Statistical Learning and Exploratory Methods of the Data Sciences > Modeling Methods

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research