z-logo
open-access-imgOpen Access
Metaheuristics Based Clustering Algorithms on Document Clustering
Author(s) -
Aytuğ Onan
Publication year - 2019
Publication title -
akıllı sistemler ve uygulamaları dergisi
Language(s) - English
Resource type - Journals
ISSN - 2667-6893
DOI - 10.54856/jiswa.201905059
Subject(s) - cluster analysis , computer science , correlation clustering , data mining , cure data clustering algorithm , automatic summarization , document clustering , metaheuristic , canopy clustering algorithm , cuckoo search , firefly algorithm , fuzzy clustering , single linkage clustering , particle swarm optimization , data stream clustering , artificial intelligence , algorithm
Cluster analysis is an important exploratory data analysis technique which divides data into groups based on their similarity. Document clustering is the process of employing clustering algorithms on textual data so that text documents can be retrieved, organized, navigated and summarized in an efficient way. Document clustering can be utilized in the organization, summarization and classification of text documents. Metaheuristic algorithms have been successfully utilized to deal with complex optimization problems, including cluster analysis. In this paper, we analyze the clustering quality of five metaheuristic clustering algorithms (namely, particle swarm optimization, genetic algorithm, cuckoo search, firefly algorithm and yarasa algorithm) on fifteen text collections in term of F-measure. In the empirical analysis, two conventional clustering algorithms (K-means and bi-secting k-means) are also considered. The experimental analysis indicates that swarm-based clustering algorithms outperform conventional clustering algorithms on text document clustering.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here