z-logo
open-access-imgOpen Access
Processing Long Queries Against Short Text
Author(s) -
Dongxiang Zhang,
Yuchen Li,
Ju Fan,
Lianli Gao,
Fumin Shen,
Heng Tao Shen
Publication year - 2017
Publication title -
acm transactions on office information systems
Language(s) - English
Resource type - Journals
eISSN - 1558-1152
pISSN - 0734-2047
DOI - 10.1145/3052772
Subject(s) - computer science , block (permutation group theory) , inverted index , rank (graph theory) , boosting (machine learning) , upper and lower bounds , scheme (mathematics) , information retrieval , index (typography) , learning to rank , data mining , block size , search engine indexing , artificial intelligence , world wide web , ranking (information retrieval) , key (lock) , mathematical analysis , mathematics , geometry , combinatorics , computer security
Many real applications in real-time news stream advertising call for efficient processing of long queries against short text. In such applications, dynamic news feeds are regarded as queries to match against an advertisement (ad) database for retrieving the k most relevant ads. The existing approaches to keyword retrieval cannot work well in this search scenario when queries are triggered at a very high frequency. To address the problem, we introduce new techniques to significantly improve search performance. First, we devise a two-level partitioning for tight upper bound estimation and a lazy evaluation scheme to delay full evaluation of unpromising candidates, which can bring three to four times performance boosting in a database with 7 million ads. Second, we propose a novel rank-aware block-oriented inverted index to further improve performance. In this index scheme, each entry in an inverted list is assigned a rank according to its importance in the ad. Then, we introduce a block-at-a-time search strategy based on the index scheme to support a much tighter upper bound estimation and a very early termination. We have conducted experiments with real datasets, and the results show that the rank-aware method can further improve performance by an order of magnitude.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom