Tight Bounds on the Complexity of Parallel Sorting

doi:10.1109/TC.1985.5009385

Journal ArticleDOI

Tight Bounds on the Complexity of Parallel Sorting

Tom Leighton

- 01 Apr 1985 -

IEEE Transactions on Computers

- Vol. 34, Iss: 4, pp 344-354

TLDR

Tight upper and lower bounds are proved on the number of processors, information transfer, wire area, and time needed to sort N numbers in a bounded-degree fixed-connection network.

Abstract:

In this paper, we prove tight upper and lower bounds on the number of processors, information transfer, wire area, and time needed to sort N numbers in a bounded-degree fixed-connection network. Our most important new results are: 1) the construction of an N-node degree-3 network capable of sorting N numbers in O(log N) word steps; 2) a proof that any network capable of sorting N (7 log N)-bit numbers in T bit steps requires area A where AT2 = ?(N2 log2 N); and 3) the construction of a ``small-constant-factor'' bounded-degree network that sorts N ?(log N)-bit numbers in T = ?(log N) bit steps with A = ?(N2) area.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A bridging model for parallel computation

Leslie G. Valiant

- 01 Aug 1990 -

Communications of The ACM

TL;DR: The bulk-synchronous parallel (BSP) model is introduced as a candidate for this role, and results quantifying its efficiency both in implementing high-level language features and algorithms, as well as in being implemented in hardware.

...read moreread less

Journal ArticleDOI

The input/output complexity of sorting and related problems

Alok Aggarwal, +1 more

- 01 Sep 1988 -

Communications of The ACM

TL;DR: Tight upper and lower bounds are provided for the number of inputs and outputs (I/OS) between internal memory and secondary storage required for five sorting-related problems: sorting, the fast Fourier transform (FFT), permutation networks, permuting, and matrix transposition.

...read moreread less

Book

Fat-trees: universal networks for hardware-efficient supercomputing

Charles E. Leiserson

TL;DR: In this article, the authors presented a new class of universal routing networks, called fat-trees, which might be used to interconnect the processors of a general-purpose parallel supercomputer, and proved that a fat-tree of a given size is nearly the best routing network of that size.

...read moreread less

Journal ArticleDOI

Fat-trees: Universal networks for hardware-efficient supercomputing

Charles E. Leiserson

- 01 Oct 1985 -

IEEE Transactions on Computers

TL;DR: In this article, the authors presented a new class of universal routing networks, called fat-trees, which might be used to interconnect the processors of a general-purpose parallel supercomputer, and proved that a fat-tree of a given size is nearly the best routing network of that size.

...read moreread less

Journal ArticleDOI

External memory algorithms and data structures: dealing with massive data

Jeffrey Scott Vitter

- 01 Jun 2001 -

ACM Computing Surveys

TL;DR: The state of the art in the design and analysis of external memory algorithms and data structures, where the goal is to exploit locality in order to reduce the I/O costs is surveyed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Sorting networks and their applications

Kenneth E. Batcher

TL;DR: To achieve high throughput rates today's computers perform several operations simultaneously; not only are I/O operations performed concurrently with computing, but also, in multiprocessors, several computing operations are done concurrently.

...read moreread less

Journal ArticleDOI

Parallel Processing with the Perfect Shuffle

Harold S. Stone

- 01 Feb 1971 -

IEEE Transactions on Computers

TL;DR: Given a vector of N elements, the perfect shuffle of this vector is a permutation of the elements that are identical to aperfect shuffle of a deck of cards.

...read moreread less

Proceedings ArticleDOI

Universal schemes for parallel communication

Leslie G. Valiant, +1 more

TL;DR: This paper shows that there exists an N-processor computer that can simulate arbitrary N- processor parallel computations with only a factor of O(log N) loss of runtime efficiency, and isolates a combinatorial problem that lies at the heart of this question.

...read moreread less

Proceedings ArticleDOI

An 0(n log n) sorting network

Miklós Ajtai, +2 more

TL;DR: A sorting network of size 0(n log n) and depth 0(log n) is described, and a derived procedure (&egr;-nearsort) are described below, and the sorting network will be centered around these elementary steps.

...read moreread less

Proceedings ArticleDOI

Area-time complexity for VLSI

C. D. Thompson

TL;DR: The complexity of the Discrete Fourier Transform is studied with respect to a new model of computation appropriate to VLSI technology, which focuses on two key parameters, the amount of silicon area and time required to implement a DFT on a single chip.

...read moreread less

Communications of The ACM

The Art of Computer Programming

Donald Ervin Knuth

The Art of Computer Programming: Volume 3: Sorting and Searching

Donald E. Knuth

Tight Bounds on the Complexity of Parallel Sorting

Citations

A bridging model for parallel computation

The input/output complexity of sorting and related problems

Fat-trees: universal networks for hardware-efficient supercomputing

Fat-trees: Universal networks for hardware-efficient supercomputing

External memory algorithms and data structures: dealing with massive data

References

Sorting networks and their applications

Parallel Processing with the Perfect Shuffle

Universal schemes for parallel communication

An 0(n log n) sorting network

Area-time complexity for VLSI

Related Papers (5)

Sorting networks and their applications

Introduction to Parallel Algorithms and Architectures: Arrays, Trees, Hypercubes

The input/output complexity of sorting and related problems

The Art of Computer Programming

The Art of Computer Programming: Volume 3: Sorting and Searching