Premium
An improved genetic algorithm with Lagrange and density method for clustering
Author(s) -
Li Ling,
Zhou Xiangbing,
Li Yiping,
Gu Jiangang,
Shen Shaopeng
Publication year - 2020
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.5969
Subject(s) - cluster analysis , fitness function , computer science , genetic algorithm , mathematical optimization , correlation clustering , cure data clustering algorithm , population , canopy clustering algorithm , data mining , algorithm , mathematics , artificial intelligence , demography , sociology
Summary To overcome the shortcomings of K‐means clustering including clustering numbers, sensitivity to clustering center (seeds) and local optimization, this article proposes an improved genetic algorithm (GA) with a novel Lagrange‐based fitness function and an initial population technique(called NicheClust algorithm); the NicheClust can determine the best chromosomes and then feeds these into K‐means as initial seeds to achieve higher‐quality clustering results by allowing the initial seeds to readjust in terms of clustering demands. The GA approach is proposed to search for a global optimally solution. The initial population method is presented to automatically capture the appropriate number of clusters and find the initial seeds. The Lagrange‐based approach is used to prevent the fitness function from prematurely converging and capture global optimization for K‐means clustering results. Experimental results based on six taxi Global Positioning System (GPS) datasets verify the higher performance of NicheClust compared to other clustering methods and validate the effectiveness with statistical analysis method.