Premium
Complementary QA network analysis for QA retrieval in social question‐answering websites
Author(s) -
Liu DuenRen,
Chen YuHsuan,
Shen Minxin,
Lu PeiJung
Publication year - 2015
Publication title -
journal of the association for information science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.903
H-Index - 145
eISSN - 2330-1643
pISSN - 2330-1635
DOI - 10.1002/asi.23155
Subject(s) - computer science , question answering , complementarity (molecular biology) , information retrieval , information overload , helpfulness , novelty , the internet , world wide web , similarity (geometry) , construct (python library) , artificial intelligence , psychology , social psychology , philosophy , theology , biology , image (mathematics) , genetics , programming language
With the ubiquity of the Internet and the rapid development of Web 2.0 technology, social question and answering ( SQA ) websites have become popular knowledge‐sharing platforms. As the number of posted questions and answers ( QAs ) continues to increase rapidly, the massive amount of question‐answer knowledge is causing information overload. The problem is compounded by the growing number of redundant QAs . SQA websites such as Yahoo! Answers are open platforms where users can freely ask or answer questions. Users also may wish to learn more about the information provided in an answer so they can use related keywords in the answer to search for extended, complementary information. In this article, we propose a novel approach to identify complementary QAs ( CQAs ) of a target QA . We define two types of complementarity: partial complementarity and extended complementarity. First, we utilize a classification‐based approach to predict complementary relationships between QAs based on three measures: question similarity, answer novelty, and answer correlation. Then we construct a CQA network based on the derived complementary relationships. In addition, we introduce a CQA network analysis technique that searches the QA network to find direct and indirect CQAs of the target QA . The results of experiments conducted on the data collected from Yahoo! Answers T aiwan show that the proposed approach can more effectively identify CQAs than can the conventional similarity‐based method. Case and user study results also validate the helpfulness and the effectiveness of our approach.