A locality-preserving cache-oblivious dynamic dictionary

doi:10.1016/J.JALGOR.2004.04.014

Home
/
Papers
/
A locality-preserving cache-oblivious dynamic dictionary

Journal Article•DOI•

A locality-preserving cache-oblivious dynamic dictionary

Michael A. Bender¹, Ziyang Duan¹, John Iacono, Jing Wu¹•Institutions (1)

State University of New York System¹

01 Nov 2004-Journal of Algorithms (Academic Press, Inc.)-Vol. 53, Iss: 2, pp 115-136

TL;DR: The data structure presented here is a simplification of the cache-oblivious B-tree of Bender, Demaine, and Farach-Colton and has memory performance optimized for all levels of the memory hierarchy even though it has no memory-hierarchy-specific parameterization.

read less

About: This article is published in Journal of Algorithms.The article was published on 2004-11-01. It has received 93 citations till now. The article focuses on the topics: Data structure & Memory hierarchy.

...read moreread less

Citations

PDF

Open Access

More filters

Proceedings Article•DOI•

Cache-oblivious streaming B-trees

[...]

Michael A. Bender¹, Martin Farach-Colton², Jeremy T. Fineman³, Yonatan R. Fogel¹, Bradley C. Kuszmaul³, Jelani Nelson³ - Show less +2 more•Institutions (3)

Stony Brook University¹, Rutgers University², Massachusetts Institute of Technology³

09 Jun 2007

TL;DR: A cache-aware version of the COLA, the lookahead array, which achieves the same bounds as Brodal and Fagerberg's (cache-aware) B^ε-tree.

...read moreread less

Abstract: A streaming B-tree is a dictionary that efficiently implements insertions and range queries. We present two cache-oblivious streaming B-trees, the shuttle tree, and the cache-oblivious lookahead array (COLA).For block-transfer size B and on N elements, the shuttle tree implements searches in optimal O(log B+1N) transfers, range queries of L successive elements in optimal O(log B+1N +L/B) transfers, and insertions in O((log B+1N)/BΘ(1/(log log B)2)+(log2N)/B) transfers, which is an asymptotic speedup over traditional B-trees if B ≥ (log N)1+c log log log2N for any constant c >1.A COLA implements searches in O(log N) transfers, range queries in O(log N + L/B) transfers, and insertions in amortized O((log N)/B) transfers, matching the bounds for a (cache-aware) buffered repository tree. A partially deamortized COLA matches these bounds but reduces the worst-case insertion cost to O(log N) if memory size M = Ω(log N). We also present a cache-aware version of the COLA, the lookahead array, which achieves the same bounds as Brodal and Fagerberg's (cache-aware) Be-tree.We compare our COLA implementation to a traditional B-tree. Our COLA implementation runs 790 times faster for random inser-tions, 3.1 times slower for insertions of sorted data, and 3.5 times slower for searches.

...read moreread less

162 citations

Cites background from "A locality-preserving cache-oblivio..."

...Of the extant cache-oblivious dictionaries, the most well-studied is the cache-oblivious B-tree [6, 7, 11], which supports searches in O(logB+1 N) transfers, insertions in amortized O(logB+1 N + (log2 N)/B) transfers, and range queries returning L elements in O(logB+1 N +L/B) transfers....
[...]

Proceedings Article•DOI•

Cache oblivious search trees via binary trees of small height

[...]

Gerth Stølting Brodal¹, Rolf Fagerberg¹, Riko Jacob¹•Institutions (1)

Aarhus University¹

06 Jan 2002

TL;DR: A version of cache oblivious search trees which is simpler than the previous proposal of Bender, Demaine and Farach-Colton and has the same complexity bounds is proposed, and can be implemented as just a single array of data elements without the use of pointers.

...read moreread less

Abstract: We propose a version of cache oblivious search trees which is simpler than the previous proposal of Bender, Demaine and Farach-Colton and has the same complexity bounds. In particular, our data structure avoids the use of weight balanced B-trees, and can be implemented as just a single array of data elements, without the use of pointers. The structure also improves space utilization.For storing n elements, our proposal uses (1 + e)n times the element size of memory, and performs searches in worst case O(logBn) memory transfers, updates in amortized O((log2n)/(eB)) memory transfers, and range queries in worst case O(logBn + k/B) memory transfers, where k is the size of the output.The basic idea of our data structure is to maintain a dynamic binary tree of height log n+O(1) using existing methods, embed this tree in a static binary tree, which in turn is embedded in an array in a cache oblivious fashion, using the van Emde Boas layout of Prokop.We also investigate the practicality of cache obliviousness in the area of search trees, by providing an empirical comparison of different methods for laying out a search tree in memory.

...read moreread less

160 citations

Cites methods from "A locality-preserving cache-oblivio..."

...[11 [9], a matching fl(log 2 n) lower bound for algorithms using this primitive was given....
[...]

Patent•

High-Performance Streaming Dictionary

[...]

Michael A. Bender, Martin Farach-Colton, Yonatan R. Fogel, Zardosht Kasheff, Bradley C. Kuszmaul, Vincenzo Liberatore, Barry Perlman, Richard F. Prohaska, David S. Wells - Show less +5 more

06 Apr 2010

TL;DR: In this article, a high-performance dictionary data structure is defined for storing data in a disk storage system, which supports full transactional semantics, concurrent access from multiple transactions, and logging and recovery.

...read moreread less

Abstract: A method, apparatus and computer program product for storing data in a disk storage system is presented. A high-performance dictionary data structure is defined. The dictionary data structure is stored on a disk storage system. Key-value pairs can be inserted and deleted into the dictionary data structure. Updates run faster than one insertion per disk-head movement. The structure can also be stored on any system with two or more levels of memory. The dictionary is high performance and supports with full transactional semantics, concurrent access from multiple transactions, and logging and recovery. Keys can be looked up with only a logarithmic number of transfers, even for keys that have been recently inserted or deleted. Queries can be performed on ranges of key-value pairs, including recently inserted or deleted pairs, at a constant fraction of the bandwidth of the disk.

...read moreread less

146 citations

A New Data Structure for Representing Sorted Lists

[...]

Kurt Mehlhorn¹, Hartmut Noltemeier•Institutions (1)

Max Planck Society¹

01 Jan 1980

TL;DR: In this article, the worst case cost of sequences of insertions and deletions in weak B-trees is analyzed, where each node has at least a and at most b sons where 2a ≤ b.

...read moreread less

Abstract: In this paper we explore the use of weak B-trees to represent sorted lists. In weak B-trees each node has at least a and at most b sons where 2a ≤ b. We analyse the worst case cost of sequences of insertions and deletions in weak B-trees. This leads to a new data structure (level-linked weak B-trees) for representing sorted lists when the access pattern exhibits a (time-varying) locality of reference. Our structure is substantially simpler than the one proposed by Guibas, McCreight, Plass and Roberts, yet it has many of its properties. Our structure is as simple as the one proposed by Brown/Tarjan, but our structure can treat arbitrary sequences of insertions and deletions whilst theirs can only treat non-interacting insertions and deletions.

...read moreread less

132 citations

Proceedings Article•DOI•

Monotone minimal perfect hashing: searching a sorted table with O(1) accesses

[...]

Djamal Belazzougui, Paolo Boldi¹, Rasmus Pagh², Sebastiano Vigna¹•Institutions (2)

University of Milan¹, University of Copenhagen²

04 Jan 2009

TL;DR: This work considers the problem of monotone minimal perfect hashing, in which the bijection is required to preserve the lexicographical ordering of the keys, and shows how to compute the predecessor (in the sorted order of S) of an arbitrary element, using O(1) accesses in expectation and an index of O(n log w) bits.

...read moreread less

Abstract: A minimal perfect hash function maps a set S of n keys into the set {0, 1,..., n -- 1} bijectively. Classical results state that minimal perfect hashing is possible in constant time using a structure occupying space close to the lower bound of log e bits per element. Here we consider the problem of monotone minimal perfect hashing, in which the bijection is required to preserve the lexicographical ordering of the keys. A monotone minimal perfect hash function can be seen as a very weak form of index that provides ranking just on the set S (and answers randomly outside of S). Our goal is to minimise the description size of the hash function: we show that, for a set S of n elements out of a universe of 2w elements, O(n log log w) bits are sufficient to hash monotonically with evaluation time O(log w). Alternatively, we can get space O(n log w) bits with O(1) query time. Both of these data structures improve a straightforward construction with O(n log w) space and O(log w) query time. As a consequence, it is possible to search a sorted table with O(1) accesses to the table (using additional O(n log log w) bits). Our results are based on a structure (of independent interest) that represents a trie in a very compact way, but admits errors. As a further application of the same structure, we show how to compute the predecessor (in the sorted order of S) of an arbitrary element, using O(1) accesses in expectation and an index of O(n log w) bits, improving the trivial result of O(nw) bits. This implies an efficient index for searching a blocked memory.

...read moreread less

124 citations

Cites background from "A locality-preserving cache-oblivio..."

...Indeed, several recent papers on range queries propose to keep keys in a sorted table (with gaps), rather than in a normal search tree, exactly for this reason [24, 2, 1]....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19

Collapse

References

PDF

Open Access

More filters

Book•

The Art of Computer Programming

[...]

Donald Ervin Knuth

01 Jan 1968

TL;DR: The arrangement of this invention provides a strong vibration free hold-down mechanism while avoiding a large pressure drop to the flow of coolant fluid.

...read moreread less

Abstract: A fuel pin hold-down and spacing apparatus for use in nuclear reactors is disclosed. Fuel pins forming a hexagonal array are spaced apart from each other and held-down at their lower end, securely attached at two places along their length to one of a plurality of vertically disposed parallel plates arranged in horizontally spaced rows. These plates are in turn spaced apart from each other and held together by a combination of spacing and fastening means. The arrangement of this invention provides a strong vibration free hold-down mechanism while avoiding a large pressure drop to the flow of coolant fluid. This apparatus is particularly useful in connection with liquid cooled reactors such as liquid metal cooled fast breeder reactors.

...read moreread less

17,939 citations

Book•

The art of computer programming, volume 3: (2nd ed.) sorting and searching

[...]

Donald E. Knuth¹•Institutions (1)

Stanford University¹

01 Jan 1998

2,322 citations

Journal Article•DOI•

Ubiquitous B-Tree

[...]

Douglas Comer¹•Institutions (1)

Purdue University¹

01 Jun 1979-ACM Computing Surveys

TL;DR: The major variations of the B-tree are discussed, especially the B+-tree, contrasting the merits and costs of each implementation and illustrating a general purpose access method that uses a B- tree.

...read moreread less

Abstract: B-trees have become, de facto, a standard for file organization. File indexes of users, dedicated database systems, and general-purpose access methods have all been proposed and nnplemented using B-trees This paper reviews B-trees and shows why they have been so successful It discusses the major variations of the B-tree, especially the B+-tree, contrasting the relatwe merits and costs of each implementatmn. It illustrates a general purpose access method whmh uses a B-tree.

...read moreread less

2,032 citations

"A locality-preserving cache-oblivio..." refers methods in this paper

...A Locality-Preserving Cache-Oblivious Dynamic Dictionary Michael A. Bender*t Ziyang Duan *t John Iacono§ Jing Wu ....
[...]

Book•

Sorting and Searching

[...]

Donald E. Knuth

01 Jan 1973

TL;DR: The first revision of this third volume is a survey of classical computer techniques for sorting and searching that extends the treatment of data structures to consider both large and small databases and internal and external memories.

...read moreread less

Abstract: The first revision of this third volume is a survey of classical computer techniques for sorting and searching. It extends the treatment of data structures in Volume 1 to consider both large and small databases and internal and external memories.

...read moreread less

1,716 citations

"A locality-preserving cache-oblivio..." refers background in this paper

...The B-tree [12,25,32,37] is the classic external-memory search tree, and it is widely used in both theory and practice....
[...]

Journal Article•DOI•

The input/output complexity of sorting and related problems

[...]

Alok Aggarwal¹, S. Vitter Jeffrey²•Institutions (2)

IBM¹, Brown University²

01 Sep 1988-Communications of The ACM

TL;DR: Tight upper and lower bounds are provided for the number of inputs and outputs (I/OS) between internal memory and secondary storage required for five sorting-related problems: sorting, the fast Fourier transform (FFT), permutation networks, permuting, and matrix transposition.

...read moreread less

Abstract: We provide tight upper and lower bounds, up to a constant factor, for the number of inputs and outputs (I/OS) between internal memory and secondary storage required for five sorting-related problems: sorting, the fast Fourier transform (FFT), permutation networks, permuting, and matrix transposition. The bounds hold both in the worst case and in the average case, and in several situations the constant factors match. Secondary storage is modeled as a magnetic disk capable of transferring P blocks each containing B records in a single time unit; the records in each block must be input from or output to B contiguous locations on the disk. We give two optimal algorithms for the problems, which are variants of merge sorting and distribution sorting. In particular we show for P = 1 that the standard merge sorting algorithm is an optimal external sorting method, up to a constant factor in the number of I/Os. Our sorting algorithms use the same number of I/Os as does the permutation phase of key sorting, except when the internal memory size is extremely small, thus affirming the popular adage that key sorting is not faster. We also give a simpler and more direct derivation of Hong and Kung's lower bound for the FFT for the special case B = P = O(1).

...read moreread less

1,344 citations