z-logo
Premium
Learning to expand queries using entities
Author(s) -
Brandão Wladmir C.,
Santos Rodrygo L. T.,
Ziviani Nivio,
Moura Edleno S.,
Silva Altigran S.
Publication year - 2014
Publication title -
journal of the association for information science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.903
H-Index - 145
eISSN - 2330-1643
pISSN - 2330-1635
DOI - 10.1002/asi.23084
Subject(s) - computer science , exploit , discriminative model , query expansion , information retrieval , web search query , term (time) , rank (graph theory) , web query classification , learning to rank , data mining , search engine , ranking (information retrieval) , machine learning , mathematics , computer security , physics , quantum mechanics , combinatorics
A substantial fraction of web search queries contain references to entities, such as persons, organizations, and locations. Recently, methods that exploit named entities have been shown to be more effective for query expansion than traditional pseudorelevance feedback methods. In this article, we introduce a supervised learning approach that exploits named entities for query expansion using Wikipedia as a repository of high‐quality feedback documents. In contrast with existing entity‐oriented pseudorelevance feedback approaches, we tackle query expansion as a learning‐to‐rank problem. As a result, not only do we select effective expansion terms but we also weigh these terms according to their predicted effectiveness. To this end, we exploit the rich structure of Wikipedia articles to devise discriminative term features, including each candidate term's proximity to the original query terms, as well as its frequency across multiple article fields and in category and infobox descriptors. Experiments on three Text RE trieval Conference web test collections attest the effectiveness of our approach, with gains of up to 23.32% in terms of mean average precision, 19.49% in terms of precision at 10, and 7.86% in terms of normalized discounted cumulative gain compared with a state‐of‐the‐art approach for entity‐oriented query expansion.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here