A genetic algorithm using Calinski-Harabasz index for automatic clustering problem
Author(s) -
Suzane Pereira Lima,
M. D. Soler Cruz
Publication year - 2020
Publication title -
revista brasileira de computação aplicada
Language(s) - English
Resource type - Journals
ISSN - 2176-6649
DOI - 10.5335/rbca.v12i3.11117
Subject(s) - cluster analysis , determining the number of clusters in a data set , fitness function , computer science , algorithm , cluster (spacecraft) , data mining , genetic algorithm , correlation clustering , rand index , index (typography) , single linkage clustering , function (biology) , cure data clustering algorithm , mathematics , artificial intelligence , machine learning , evolutionary biology , world wide web , biology , programming language
Data clustering is a technique that aims to represent a dataset in clusters according to their similarities. In clustering algorithms, it is usually assumed that the number of clusters is known. Unfortunately, the optimal number of clusters is unknown for many applications. This kind of problem is called Automatic Clustering. There are several cluster validity indexes for evaluating solutions and it is known that the quality of a result is influenced by the chosen function. From this, a genetic algorithm is described in this article for the resolution for automatic clustering using the Calinski-Harabasz Index as a form of evaluation. Comparisons between the results and other algorithms in literature are also presented. In a first analysis, fitness values equivalent or higher are found in at least 58% of the cases for each comparison. Our algorithm could also find the correct number of clusters or close values in 33 cases out of 48. In another comparison, some fitness values are lower, even with the correct number of clusters, but graphically the partitioning are adequate. Thus, it is observed that our proposal is justified and that improvements can be studied for cases in which the correct number of clusters is not found.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom