z-logo
open-access-imgOpen Access
Model‐based clustering using S‐PLUS
Author(s) -
Leese Morven,
Landau Sabine
Publication year - 2006
Publication title -
international journal of methods in psychiatric research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.275
H-Index - 73
eISSN - 1557-0657
pISSN - 1049-8931
DOI - 10.1002/mpr.191
Subject(s) - cluster analysis , ordinal data , heuristic , computer science , data mining , ordinal scale , multivariate statistics , statistics , artificial intelligence , machine learning , mathematics
Cluster analysis can be used to identify homogenous subgroups in many fields, including psychology and psychiatry. However, most clustering methods implemented in general‐purpose statistical packages are heuristic and can be criticized in principle for their lack of an underlying statistical model. Furthermore correlations between variables are generally ignored by standard methods. The question addressed here is whether currently available commercial software (S‐PLUS), which provides model‐based methods for clustering correlated continuous data, should be used for clustering data derived from questionnaires. Such data may be either continuous or ordinal in nature and typically exhibit correlations. Performance is assessed in this study on simulated data sets containing distinct multivariate normal subpopulations, both before and after mapping the simulated data onto an ordinal scale. A practical example showing how correlated data can be cluster‐analysed using these methods is given. The conclusion is that model‐based methods are certainly worthwhile for continuous data. However, their benefit, in particular their ability to deal with correlated data, is not marked for ordinal data. Simpler methods such as Ward's method may be almost as effective in this situation. Copyright © 2006 John Wiley & Sons, Ltd.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here