Premium
Improving explicit term matching with implicit topic matching for short text conversation
Author(s) -
Wang Jianqiang,
Sun Ying
Publication year - 2019
Publication title -
proceedings of the association for information science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.193
H-Index - 14
ISSN - 2373-9231
DOI - 10.1002/pra2.23
Subject(s) - computer science , matching (statistics) , ranking (information retrieval) , conversation , information retrieval , task (project management) , term (time) , vocabulary , key (lock) , natural language processing , artificial intelligence , linguistics , mathematics , statistics , philosophy , physics , quantum mechanics , management , computer security , economics
Short Text Conversation (STC) is the task of retrieving comments made to a post and reuse them as appropriate comments to a later post. Large vocabulary variations and very limited length of posts and comments pose great challenges to traditional retrieval techniques of matching terms. To address these problems, we propose a new technique of re‐ranking comments initially retrieved through explicit term matching by using evidence of implicit topic matching gained from topic models of large collections of posts and comments. The key of our technique lies in (1) combining each post with the comments responding to it into a longer pseudo‐document for topic modeling purposes, and (2) carrying out implicit topic matching by comparing the most probable topics contained in the topic composition of a comment and that of the query post. Experiments using the NTCIR‐13 STC Chinese WeiBo collection showed that with properly tuned parameters, our technique can lead to improved retrieval effectiveness. The paper analyzes and discusses the results and outlines future work.