z-logo
open-access-imgOpen Access
Distribution-Aware Compressed Full-Text Indexes
Author(s) -
Paolo Ferragina,
Jouni Sirén,
Rossano Venturini
Publication year - 2013
Publication title -
algorithmica
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.647
H-Index - 78
eISSN - 1432-0541
pISSN - 0178-4617
DOI - 10.1007/s00453-013-9782-3
Subject(s) - theory of computation , upper and lower bounds , computer science , graph , index (typography) , path (computing) , distribution (mathematics) , reduction (mathematics) , algorithm , directed acyclic graph , mathematical optimization , space (punctuation) , mathematics , theoretical computer science , combinatorics , mathematical analysis , geometry , world wide web , programming language , operating system
In this paper we address the problem of building a compressed self-index that, given a distribution for the pattern queries and a bound on the space occupancy, minimizes the expected query-time within that index-space bound. We solve this problem by exploiting a reduction to the problem of finding a minimum weight K-link path in a particular Directed Acyclic Graph. Interestingly enough, our solution is independent of the underlying compressed index in use. Our experiments compare this optimal strategy with several other standard approaches, showing its effectiveness in practice.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom