z-logo
Premium
K‐means clustering: A half‐century synthesis
Author(s) -
Steinley Douglas.
Publication year - 2006
Publication title -
british journal of mathematical and statistical psychology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.157
H-Index - 51
eISSN - 2044-8317
pISSN - 0007-1102
DOI - 10.1348/000711005x48266
Subject(s) - initialization , cluster analysis , preprocessor , computer science , variance (accounting) , class (philosophy) , reduction (mathematics) , function (biology) , variable (mathematics) , algorithm , data mining , mathematics , machine learning , artificial intelligence , mathematical analysis , geometry , accounting , evolutionary biology , business , biology , programming language
This paper synthesizes the results, methodology, and research conducted concerning the K ‐means clustering method over the last fifty years. The K ‐means method is first introduced, various formulations of the minimum variance loss function and alternative loss functions within the same class are outlined, and different methods of choosing the number of clusters and initialization, variable preprocessing, and data reduction schemes are discussed. Theoretic statistical results are provided and various extensions of K ‐means using different metrics or modifications of the original algorithm are given, leading to a unifying treatment of K ‐means and some of its extensions. Finally, several future studies are outlined that could enhance the understanding of numerous subtleties affecting the performance of the K ‐means method.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here