z-logo
open-access-imgOpen Access
Sphericalk-Means Clustering
Author(s) -
Kurt Hornik,
Ingo Feinerer,
Martin Kober,
Christian Buchta
Publication year - 2012
Publication title -
journal of statistical software
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 7.636
H-Index - 145
ISSN - 1548-7660
DOI - 10.18637/jss.v050.i10
Subject(s) - cluster analysis , benchmark (surveying) , algorithm , computer science , machine learning , mathematics , geodesy , geography
Clustering text documents is a fundamental task in modern data analysis, requiring approaches which perform well both in terms of solution quality and computational efficiency. Spherical k-means clustering is one approach to address both issues, employing cosine dissimilarities to perform prototype-based partitioning of term weight representations of the documents. This paper presents the theory underlying the standard spherical k-means problem and suitable extensions, and introduces the R extension package skmeans which provides a computational environment for spherical k-means clustering featuring several solvers: a fixed-point and genetic algorithm, and interfaces to two external solvers (CLUTO and Gmeans). Performance of these solvers is investigated by means of a large scale benchmark experiment.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom