z-logo
open-access-imgOpen Access
Better external memory suffix array construction
Author(s) -
Roman Dementiev,
Juha Kärkkäinen,
Jens Mehnert,
Peter Sanders
Publication year - 2008
Publication title -
acm journal of experimental algorithmics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.494
H-Index - 37
ISSN - 1084-6654
DOI - 10.1145/1227161.1402296
Subject(s) - compressed suffix array , suffix array , suffix , computer science , suffix tree , construct (python library) , simple (philosophy) , data structure , algorithm , asymptotically optimal algorithm , theoretical computer science , parallel computing , programming language , linguistics , philosophy , epistemology
Suffix arrays are a simple and powerful data structure for text processing that can be used for full text indexes, data compression, and many other applications, in particular, in bioinformatics. However, so far, it has appeared prohibitive to build suffix arrays for huge inputs that do not fit into main memory. This paper presents design, analysis, implementation, and experimental evaluation of several new and improved algorithms for suffix array construction. The algorithms are asymptotically optimal in the worst case or on average. Our implementation can construct suffix arrays for inputs of up to 4-GB in hours on a low-cost machine. As a tool of possible independent interest, we present a systematic way to design, analyze, and implement pipelined algorithms.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom