Solving text clustering problem using a memetic differential evolution algorithm | Zendy

Hossam M. J. Mustafa | Zendy; Masri Ayob | Zendy; Dheeb Albashish | Zendy; Sawsan Abu-Taleb | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Solving text clustering problem using a memetic differential evolution algorithm

Author(s) -

Hossam M. J. Mustafa,

Masri Ayob,

Dheeb Albashish,

Sawsan Abu-Taleb

Publication year - 2020

Publication title -

plos one

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.99

H-Index - 332

ISSN - 1932-6203

DOI - 10.1371/journal.pone.0232816

Subject(s) - cluster analysis , benchmark (surveying) , computer science , memetic algorithm , data mining , cure data clustering algorithm , correlation clustering , hierarchical clustering , canopy clustering algorithm , differential evolution , document clustering , consensus clustering , artificial intelligence , clustering high dimensional data , algorithm , machine learning , evolutionary algorithm , geodesy , geography

The text clustering is considered as one of the most effective text document analysis methods, which is applied to cluster documents as a consequence of the expanded big data and online information. Based on the review of the related work of the text clustering algorithms, these algorithms achieved reasonable clustering results for some datasets, while they failed on a wide variety of benchmark datasets. Furthermore, the performance of these algorithms was not robust due to the inefficient balance between the exploitation and exploration capabilities of the clustering algorithm. Accordingly, this research proposes a Memetic Differential Evolution algorithm (MDETC) to solve the text clustering problem, which aims to address the effect of the hybridization between the differential evolution (DE) mutation strategy with the memetic algorithm (MA). This hybridization intends to enhance the quality of text clustering and improve the exploitation and exploration capabilities of the algorithm. Our experimental results based on six standard text clustering benchmark datasets (i.e. the Laboratory of Computational Intelligence (LABIC)) have shown that the MDETC algorithm outperformed other compared clustering algorithms based on AUC metric, F-measure, and the statistical analysis. Furthermore, the MDETC is compared with the state of art text clustering algorithms and obtained almost the best results for the standard benchmark datasets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research