RMIT University
Browse

B-tries for disk-based string management

journal contribution
posted on 2024-11-01, 06:07 authored by Nikolas Askitis, Justin Zobel
A wide range of applications require that large quantities of data be maintained in sort order on disk. The B-tree, and its variants, are an efficient general-purpose disk-based data structure that is almost universally used for this task. The B-trie has the potential to be a competitive alternative for the storage of data where strings are used as keys, but has not previously been thoroughly described or tested. We propose new algorithms for the insertion, deletion, and equality search of variable-length strings in a disk-resident B-trie, as well as novel splitting strategies which are a critical element of a practical implementation. We experimentally compare the B-trie against variants of B-tree on several large sets of strings with a range of characteristics. Our results demonstrate that, although the B-trie uses more memory, it is faster, more scalable, and requires less disk space.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1007/s00778-008-0094-1
  2. 2.
    ISSN - Is published in 10668888

Journal

VLDB Journal

Volume

18

Issue

1

Start page

157

End page

179

Total pages

23

Publisher

Springer

Place published

New York

Language

English

Copyright

© 2008 Springer-Verlag.

Former Identifier

2006011858

Esploro creation date

2020-06-22

Fedora creation date

2010-11-19

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC