Parallel Wavelet Tree Construction

doi:10.1109/DCC.2015.7

Open AccessProceedings ArticleDOI

Parallel Wavelet Tree Construction

- pp 63-72

TLDR

These algorithms improve upon the linear depth of the recent parallel algorithms by Fuentes-Sepulveda et al. and achieve up to 27x speedup over the sequential algorithm on a variety of real-world and artificial inputs.

Abstract:

We present parallel algorithms for wavelet tree construction with polylogarithmic depth, improving upon the linear depth of the recent parallel algorithms by Fuentes-Sepulveda et al. We experimentally show that on a 40-core machine with two-way hyper-threading, we outperform the existing parallel algorithms by 1.3--5.6x and achieve up to 27x speedup over the sequential algorithm on a variety of real-world and artificial inputs. Our algorithms show good scalability with increasing thread count, input size and alphabet size. We also discuss extensions to variants of the standard wavelet tree.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Log(graph): a near-optimal high-performance graph representation

Maciej Besta, +5 more

TL;DR: Log(Graph as mentioned in this paper is a graph representation that combines high compression ratios with very low-overhead decompression to enable cheaper and faster graph processing, which can improve the design of various graph processing engines or libraries on single NUMA nodes.

...read moreread less

Proceedings Article

The Cilk++ concurrency platform

Leiserson

TL;DR: This paper overviews the Cilk++ programming environment, which incorporates a compiler, a runtime system, and a race-detection tool, and provides a “hyperobject” library which allows races on nonlocal variables to be mitigated without lock contention or substantial code restructuring.

...read moreread less

Journal ArticleDOI

Parallel lightweight wavelet tree, suffix array and FM-index construction

Julian Labeit, +2 more

- 01 Mar 2017 -

Journal of Discrete Algorithms

TL;DR: The work and depth of the first parallel wavelet tree algorithm match those of the best existing parallel algorithm while requiring asymptotically less memory and the second algorithm achieves the same asymPTotic bounds for small alphabet sizes.

...read moreread less

Journal ArticleDOI

Parallel construction of wavelet trees on multicore architectures

José Fuentes-Sepúlveda, +3 more

- 01 Jun 2017 -

Knowledge and Information Systems

TL;DR: Two algorithms are introduced that reduce the time complexity of a wavelet tree’s construction by taking advantage of nowadays ubiquitous multicore machines and report good speedup for large real datasets.

...read moreread less

Proceedings ArticleDOI

Improved Parallel Construction of Wavelet Trees and Rank/Select Structures

Julian Shun

TL;DR: In this article, a parallel algorithm for wavelet tree construction with improved work complexity was presented, which has O(n log log n [log σ/√ log n log log N] work and polylogarithmic depth.

...read moreread less

References

PDF

Open Access

More filters

A Block-sorting Lossless Data Compression Algorithm

Michael Burrows, +1 more

TL;DR: A block-sorting, lossless data compression algorithm, and the implementation of that algorithm and the performance of the implementation with widely available data compressors running on the same hardware are compared.

...read moreread less

Book

An introduction to parallel algorithms

Joseph JaJa

TL;DR: This book provides an introduction to the design and analysis of parallel algorithms, with the emphasis on the application of the PRAM model of parallel computation, with all its variants, to algorithm analysis.

...read moreread less

Proceedings ArticleDOI

High-order entropy-compressed text indexes

Roberto Grossi, +2 more

TL;DR: A novel implementation of compressed suffix arrays exhibiting new tradeoffs between search time and space occupancy for a given text (or sequence) of n symbols over an alphabet σ, where each symbol is encoded by lg|σ| bits.

...read moreread less

Dissertation

Compact pat trees

David Richard Clark

TL;DR: This thesis presents several related new representations for a close relative of the suffix tree, the PAT tree, that retain the functionality of suffix trees while requiring a fraction of the storage used by current methods.

...read moreread less

Journal ArticleDOI

Compressed representations of sequences and full-text indexes

Paolo Ferragina, +3 more

- 01 May 2007 -

ACM Transactions on Algorithms

TL;DR: The FM-index is the first that removes the alphabet-size dependance from all query times and the compressed representation of integer sequences with a compression boosting technique to design compressed full-text indexes that scale well with the size of the input alphabet Σ.

...read moreread less

Collapse

Parallel Wavelet Tree Construction

Citations

Log(graph): a near-optimal high-performance graph representation

The Cilk++ concurrency platform

Parallel lightweight wavelet tree, suffix array and FM-index construction

Parallel construction of wavelet trees on multicore architectures

Improved Parallel Construction of Wavelet Trees and Rank/Select Structures

References

A Block-sorting Lossless Data Compression Algorithm

An introduction to parallel algorithms

High-order entropy-compressed text indexes

Compact pat trees

Compressed representations of sequences and full-text indexes

Related Papers (5)

Practical Rank/Select Queries over Arbitrary Sequences

High-order entropy-compressed text indexes

Wavelet trees: A survey

From Theory to Practice: Plug and Play with Succinct Data Structures

Wavelet trees for all