Clustering Mixed-Attribute Data using Random Walk
Author(s) -
Andrew Skabar
Publication year - 2017
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2017.05.083
Subject(s) - computer science , cluster analysis , data mining , similarity measure , similarity (geometry) , fuzzy clustering , consensus clustering , centroid , correlation clustering , measure (data warehouse) , range (aeronautics) , single linkage clustering , artificial intelligence , cure data clustering algorithm , pattern recognition (psychology) , materials science , composite material , image (mathematics)
Most clustering algorithms rely in some fundamental way on a measure of either similarity or distance — either between objects themselves, or between objects and cluster centroids. When the dataset contains mixed attributes, defining a suitable measure can be problematic. This paper presents a general graph-based method for clustering mixed-attribute datasets that does not require any explicit measure of similarity or distance. Empirical results on a range of well-known datasets using a range of evaluation measures show that the method achieves performance competitive with traditional clustering algorithms that require explicit calculation of distance or similarity, as well as with more recently proposed clustering algorithms based on matrix factorization.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom