
A Hybrid Model for Microblog Real‐Time Filtering
Author(s) -
Han Zhongyuan,
Yang Muyun,
Kong Leilei,
Qi Haoliang,
Li Sheng
Publication year - 2016
Publication title -
chinese journal of electronics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.267
H-Index - 25
eISSN - 2075-5597
pISSN - 1022-4653
DOI - 10.1049/cje.2016.05.007
Subject(s) - microblogging , social media , computer science , real time computing , world wide web
The task of real‐time microblog filtering is to decide if the subsequently posted tweets are relevant to a given query representing special information needs. The filters based on the retrieval model or the text classification model are the main solutions for this task. To best exploit the strengths of the two models, a hybrid model using the retrieval model as prior knowledge to rectify the hyperplane of classification is proposed. The hybrid filtering model incorporates the language model and the logistic regression model. Evaluated on the Text RetriEval Conference (TREC) 2012 microblog real‐time filtering track dataset, the experimental results show that the proposed model is significantly better than the logistic regression model and the language model. Especially, it outperforms the best method of the TREC 2012 microblog real‐time filtering track.