Sphericalk-Means Clustering | Zendy

Kurt Hornik | Zendy; Ingo Feinerer | Zendy; Martin Kober | Zendy; Christian Buchta | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Sphericalk-Means Clustering

Author(s) -

Kurt Hornik,

Ingo Feinerer,

Martin Kober,

Christian Buchta

Publication year - 2012

Publication title -

journal of statistical software

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 7.636

H-Index - 145

ISSN - 1548-7660

DOI - 10.18637/jss.v050.i10

Subject(s) - cluster analysis , benchmark (surveying) , algorithm , computer science , machine learning , mathematics , geodesy , geography

Clustering text documents is a fundamental task in modern data analysis, requiring approaches which perform well both in terms of solution quality and computational efficiency. Spherical k-means clustering is one approach to address both issues, employing cosine dissimilarities to perform prototype-based partitioning of term weight representations of the documents. This paper presents the theory underlying the standard spherical k-means problem and suitable extensions, and introduces the R extension package skmeans which provides a computational environment for spherical k-means clustering featuring several solvers: a fixed-point and genetic algorithm, and interfaces to two external solvers (CLUTO and Gmeans). Performance of these solvers is investigated by means of a large scale benchmark experiment.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research