Microblog Hot Topics Detection Based on VSM and HMBTM Model Fusion
Author(s) -
Qiu Liqing,
Wei Jia,
Haiyan Liu,
Xin Fan
Publication year - 2019
Publication title -
ieee access
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.587
H-Index - 127
ISSN - 2169-3536
DOI - 10.1109/access.2019.2932458
Subject(s) - microblogging , computer science , fuse (electrical) , social media , cluster analysis , word (group theory) , key (lock) , vector space model , data mining , artificial intelligence , matrix (chemical analysis) , sensor fusion , world wide web , mathematics , materials science , geometry , computer security , electrical engineering , composite material , engineering
With the rapid development of social media such as Twitter and Sina microblog, short texts are becoming more and more popular. However, because of the sparsity of word co-occurrence patterns in short texts, it is always a challenge to infer topics from the short texts. To solve that sparse problem of the short text data, in this paper, we present a HVCH (Hot topic detection based on the VSM Combined HMBTM) fusion model algorithm. Taking heat as the key factor, the heat matrix of calculated words and pairs is introduced to improve the BTM(Biterm Topic Model) model, and the semantic relationship between words is mined through the common heat between words, which is applied to the detection of hot topics in short text. Subsequently, we fuse the VSM(Vector Space Model) model with the HMBTM(Heat Matrix based BTM) model and uses the optimized Single-Pass clustering algorithm to obtain hot topics. At last, we conduct extensive experiments over real data sets, which demonstrate that our proposal achieves excellent performance to other related algorithms.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom