Premium
Clustering mixed data
Author(s) -
Hunt Lynette,
Jorgensen Murray
Publication year - 2011
Publication title -
wiley interdisciplinary reviews: data mining and knowledge discovery
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.506
H-Index - 47
eISSN - 1942-4795
pISSN - 1942-4787
DOI - 10.1002/widm.33
Subject(s) - categorical variable , cluster analysis , multivariate statistics , data mining , computer science , mixture model , artificial intelligence , machine learning
Abstract Mixture model clustering proceeds by fitting a finite mixture of multivariate distributions to data, the fitted mixture density then being used to allocate the data to one of the components. Common model formulations assume that either all the attributes are continuous or all the attributes are categorical. In this paper, we consider options for model formulation in the more practical case of mixed data : multivariate data sets that contain both continuous and categorical attributes. © 2011 John Wiley & Sons, Inc. WIREs Data Mining Knowl Discov 2011 1 352–361 DOI: 10.1002/widm.33 This article is categorized under: Algorithmic Development > Structure Discovery Technologies > Structure Discovery and Clustering