Learning to rank for robust question answering
Author(s) -
Arvind Agarwal,
Hema Raghavan,
Karthik Subbian,
Prem Melville,
Richard D. Lawrence,
David Gondek,
James Fan
Publication year - 2012
Publication title -
citeseer x (the pennsylvania state university)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2396761.2396867
Subject(s) - ranking (information retrieval) , computer science , question answering , learning to rank , baseline (sea) , rank (graph theory) , artificial intelligence , domain (mathematical analysis) , machine learning , information retrieval , mathematics , mathematical analysis , oceanography , combinatorics , geology
This paper aims to solve the problem of improving the ranking of answer candidates for factoid based questions in a state-of-the-art Question Answering system. We first provide an extensive comparison of 5 ranking algorithms on two datasets -- from the Jeopardy quiz show and a medical domain. We then show the effectiveness of a cascading approach, where the ranking produced by one ranker is used as input to the next stage. The cascading approach shows sizeable gains on both datasets. We finally evaluate several rank aggregation techniques to combine these algorithms, and find that Supervised Kemeny aggregation is a robust technique that always beats the baseline ranking approach used by Watson for the Jeopardy competition. We further corroborate our results on TREC Question Answering datasets.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom