z-logo
open-access-imgOpen Access
Extension of K-Modes Algorithm for Generating Clusters Automatically
Author(s) -
Anupama Chadha,
Suresh Kumar
Publication year - 2016
Publication title -
international journal of information technology and computer science
Language(s) - English
Resource type - Journals
eISSN - 2074-9015
pISSN - 2074-9007
DOI - 10.5815/ijitcs.2016.03.06
Subject(s) - categorical variable , simplicity , computer science , extension (predicate logic) , algorithm , cluster analysis , euclidean distance , simple (philosophy) , set (abstract data type) , matching (statistics) , dependency (uml) , euclidean geometry , measure (data warehouse) , data mining , mathematics , artificial intelligence , machine learning , statistics , programming language , philosophy , geometry , epistemology
K-Modes is an eminent algorithm for clustering data set with categorical attributes. This algorithm is famous for its simplicity and speed. The KModes is an extension of the K-Means algorithm for categorical data. Since K-Modes is used for categorical data so ‘Simple Matching Dissimilarity’ measure is used instead of Euclidean distance and the ‘Modes’ of clusters are used instead of ‘Means’. However, one major limitation of this algorithm is dependency on prior input of number of clusters K, and sometimes it becomes practically impossible to correctly estimate the optimum number of clusters in advance. In this paper we have proposed an algorithm which will overcome this limitation while maintaining the simplicity of K-Modes algorithm.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom