Text Clustering Based on a Divide and Merge Strategy | Zendy

Man Yuan | Zendy; Yong Shi | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Text Clustering Based on a Divide and Merge Strategy

Author(s) -

Man Yuan,

Yong Shi

Publication year - 2015

Publication title -

procedia computer science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.334

H-Index - 76

ISSN - 1877-0509

DOI - 10.1016/j.procs.2015.07.153

Subject(s) - computer science , merge (version control) , cluster analysis , data mining , correlation clustering , benchmark (surveying) , artificial intelligence , pattern recognition (psychology) , information retrieval , geodesy , geography

A text clustering algorithm is proposed to overcome the drawback of division based clustering method on sensitivity of estimated class number. Complex features including synonym and co-occurring words are extracted to make a feature space containing more semantic information. Then the divide and merge strategy helps the iteration converge to a reasonable cluster number. Experimental results showed that the dynamically updated center number prevent the deterioration of clustering result when k deviates from the real class numbers. When k is too small or large, the difference of clustering results between FC-DM and k-means is more obvious and FC-DM also outperformed other benchmark algorithms

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research