RMIT University
Browse

Optimized succinct data structures for massive data

journal contribution
posted on 2024-11-01, 18:56 authored by Simon Gog, Matthias Petri
Succinct data structures provide the same functionality as their corresponding traditional data structure in compact space. We improve on functions rank and select, which are the basic building blocks of FM-indexes and other succinct data structures. First, we present a cache-optimal, uncompressed bitvector representation that outperforms all existing approaches. Next, we improve, in both space and time, on a recent result by Navarro and Providel on compressed bitvectors. Last, we show techniques to perform rank and select on 64-bit words that are up to three times faster than existing methods. In our experimental evaluation, we first show how our improvements affect cache and runtime performance of both operations on data sets larger than commonly used in the evaluation of succinct data structures. Our experiments show that our improvements to these basic operations significantly improve the runtime performance and compression effectiveness of FM-indexes on small and large data sets. To our knowledge, our improvements result in FM-indexes that are either smaller or faster than all current state of the art implementations.

History

Journal

Software: Practice and Experience

Volume

44

Issue

11

Start page

1287

End page

1314

Total pages

28

Publisher

John Wiley and Sons

Place published

United Kingdom

Language

English

Copyright

© 2013 John Wiley and Sons, Ltd.

Former Identifier

2006052645

Esploro creation date

2020-06-22

Fedora creation date

2015-04-30