z-logo
open-access-imgOpen Access
An Unsupervised Learning Short Text Clustering Method
Author(s) -
Dai Zu-hua,
Kelong Li,
Hongyi Li,
Xiaoting Li
Publication year - 2020
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1650/3/032090
Subject(s) - cluster analysis , computer science , artificial intelligence , sentence , embedding , pattern recognition (psychology) , unsupervised learning , representation (politics) , correlation clustering , feature learning , natural language processing , politics , political science , law
Due to the continuous development of Natural Language Processing (NLP), the task of short text categorization has been paid more and more attention. In short text clustering, the high-dimensional sparseness of text representation matrix becomes a challenging problem. This paper proposes a deep embedded method for feature extraction and clustering allocation using auto encoder of sentence distributed embedding. This method maps from data space to low-dimensional feature space and iteratively optimizes clustering targets. Experimental results on three short Chinese text data sets verify the effectiveness of the method. Moreover, it is superior to the existing correlation clustering methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here