A fast probabilistic parallel sorting algorithm

doi:10.1109/SFCS.1981.6

Home
/
Papers
/
A fast probabilistic parallel sorting algorithm

Proceedings Article•DOI•

A fast probabilistic parallel sorting algorithm

28 Oct 1981-pp 212-219

TL;DR: A probabilistic parallel algorithm to sort n keys drawn from some arbitrary total ordered set such that the average runtime is bounded by O(log n), which means the product of time and number of processors meets the information theoretic lower bound for sorting.

read less

Abstract: We describe a probabilistic parallel algorithm to sort n keys drawn from some arbitrary total ordered set. This algorithm can be implemented on a parallel computer consisting of n RAMs, each with small private memory, and a common memory of size O(n) such that the average runtime is bounded by O(log n). Hence for this algorithm the product of time and number of processors meets the information theoretic lower bound for sorting.

...read moreread less

Citations

PDF

Open Access

More filters

Posted Content•

Clustering Via Crowdsourcing

[...]

Arya Mazumdar, Barna Saha

07 Apr 2016-arXiv: Data Structures and Algorithms

TL;DR: A major contribution of this paper is to reduce the query complexity to linear or even sublinear in $n$ when mild side information is provided by a machine, and even in presence of crowd errors which are not correctable via resampling.

...read moreread less

Abstract: In recent years, crowdsourcing, aka human aided computation has emerged as an effective platform for solving problems that are considered complex for machines alone. Using human is time-consuming and costly due to monetary compensations. Therefore, a crowd based algorithm must judiciously use any information computed through an automated process, and ask minimum number of questions to the crowd adaptively. One such problem which has received significant attention is {\em entity resolution}. Formally, we are given a graph $G=(V,E)$ with unknown edge set $E$ where $G$ is a union of $k$ (again unknown, but typically large $O(n^\alpha)$, for $\alpha>0$) disjoint cliques $G_i(V_i, E_i)$, $i =1, \dots, k$. The goal is to retrieve the sets $V_i$s by making minimum number of pair-wise queries $V \times V\to\{\pm1\}$ to an oracle (the crowd). When the answer to each query is correct, e.g. via resampling, then this reduces to finding connected components in a graph. On the other hand, when crowd answers may be incorrect, it corresponds to clustering over minimum number of noisy inputs. Even, with perfect answers, a simple lower and upper bound of $\Theta(nk)$ on query complexity can be shown. A major contribution of this paper is to reduce the query complexity to linear or even sublinear in $n$ when mild side information is provided by a machine, and even in presence of crowd errors which are not correctable via resampling. We develop new information theoretic lower bounds on the query complexity of clustering with side information and errors, and our upper bounds closely match with them. Our algorithms are naturally parallelizable, and also give near-optimal bounds on the number of adaptive rounds required to match the query complexity.

...read moreread less

25 citations

Cites methods from "A fast probabilistic parallel sorti..."

...There is an extensive literature of algorithms in the TCS community where the goal is to do either selection or sorting with O(n) comparisons in the fewest interactive rounds, aka parallel algorithms for sorting [49, 47, 5, 6, 4, 9]....
[...]

Proceedings Article•DOI•

Tight complexity bounds for parallel comparison sorting

[...]

Noga Alon¹, Yossi Azar¹, Uzi Vishkin¹•Institutions (1)

Tel Aviv University¹

27 Oct 1986

TL;DR: The lower bound improves on Häggkvist-Hell's lower bound, and it is shown that "approximate sorting" in time 1 requires asymptotically more than nlogn processors.

...read moreread less

Abstract: The time complexity of sorting n elements using p ≥ n processors on Valiant's parallel comparison tree model is considered. The following results are obtained. 1. We show that this time complexity is Θ(logn/log(1+p/n)). This complements the AKS sorting network in settling the wider problem of comparison sort of n elements by p processors, where the problem for p ≤ n was resolved. To prove the lower bound, we show that to achieve time k ≤ logn, we need Ω(kn1+1/k) comparisons. Haggkvist and Hell proved a similar result only for fixed k. 2. For every fixed time k, we show that: (a) Ω(n1+1/k lognl/k) comparisons are required, (O(n1+1/k logn) are known to be sufficient in this case), and (b) there exists a randomized algorithm for comparison sort in time k with an expected number of O(n1+1/k) comparisons. This implies that for every fixed k, any deterministic comparison sort algorithm must be asymptotically worse than this randomized algorithm. The lower bound improves on Haggkvist-Hell's lower bound. 3. We show that "approximate sorting" in time 1 requires asymptotically more than nlogn processors. This settles a problem raised by M. Rabin.

...read moreread less

25 citations

Journal Article•DOI•

An efficient sorting algorithm on the multi-mesh network

[...]

M. De¹, D. Das, M. Ghosh, Bhabani P. Sinha•Institutions (1)

Kalyani Government Engineering College¹

01 Oct 1997-IEEE Transactions on Computers

TL;DR: This paper presents an algorithm for sorting N elements in time O(N/sup 1/4/) on an SIMD multi-mesh architecture, thereby significantly improving the order of the time complexity.

...read moreread less

Abstract: The shear-sort algorithm on an SIMD mesh model requires 4/spl radic/N+o(/spl radic/N) time for sorting N elements arranged on a /spl radic/N/spl times//spl radic/N mesh. In this paper, we present an algorithm for sorting N elements in time O(N/sup 1/4/) on an SIMD multi-mesh architecture, thereby significantly improving the order of the time complexity. The multi-mesh architecture is built around n/sup 2/ blocks, where each block is an n/spl times/n mesh with n=N/sup 1/4/, so that each processor will uniformly have four neighbors in the final topology.

...read moreread less

25 citations

Cites background from "A fast probabilistic parallel sorti..."

...SEVERAL interesting results on the complexity of parallel sorting are given in the literature [1], [2], [3], [4], [5]....
[...]

Book Chapter•DOI•

Sorting and Graphs

[...]

Béla Bollobás¹, Pavol Hell²•Institutions (2)

University of Cambridge¹, Simon Fraser University²

01 Jan 1985

TL;DR: Taking a graph-theoretic view of sorting, generalizations of the problem of serial sorting are motivated, and recent results on parallel sorting, and sorting in rounds, which depend on deep graph- theoretic results are described.

...read moreread less

Abstract: We discuss connections between sorting algorithms and graph theory. Taking a graph-theoretic view of sorting, we motivate generalizations of the problem of serial sorting, and describe recent results on parallel sorting, and sorting in rounds, which depend on deep graph-theoretic results.

...read moreread less

24 citations

Book Chapter•DOI•

On the Power of Probabilistic Choice in Synchronous Parallel Computations

[...]

John H. Reif¹•Institutions (1)

Harvard University¹

12 Jul 1982

TL;DR: It is shown that parallelism uniformly speeds up time bounded Probabilistic sequential RAM computations by nearly a quadratic factor, and that probabilistic choice can be, eliminated from parallel computation by introducing nonuniformity.

...read moreread less

Abstract: This paper introduces probabilistic choice to synchronous parallel machine models; in particular parallel RAMs The power of probabilistic choice in parallel computations is illustrated by parallelizing some known probabilistic sequential algorithms We characterize the computational complexity of time, space, and processor bounded probabilistic prallel RAMs in terms of the computational complexity of probabilistic sequential RAMs We show that parallelism uniformly speeds up time bounded probabilistic sequential RAM computations by nearly a quadratic factor We also show that probabilistic choice can be, eliminated from parallel computations by introducing nonuniformity

...read moreread less

21 citations

1
2
…
3
4
5
6
7
8
9
…
10
11
12
13

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Parallelism in Comparison Problems

[...]

Leslie G. Valiant

01 Sep 1975-SIAM Journal on Computing

TL;DR: The worst-case time complexity of algorithms for multiprocessor computers with binary comparisons as the basic operations is investigated and the algorithm for finding the maximum is shown to be optimal for all values of k and n.

...read moreread less

Abstract: The worst-case time complexity of algorithms for multiprocessor computers with binary comparisons as the basic operations is investigated. It is shown that for the problems of finding the maximum, sorting, and merging a pair of sorted lists, if n, the size of the input set, is not less than k, the number of processors, speedups of at least $O(k/\log \log k)$ can be achieved with respect to comparison operations. The algorithm for finding the maximum is shown to be optimal for all values of k and n.

...read moreread less

412 citations

Journal Article•DOI•

Expected time bounds for selection

[...]

Robert W. Floyd¹, Ronald L. Rivest¹•Institutions (1)

Stanford University¹

01 Mar 1975-Communications of The ACM

TL;DR: A new selection algorithm is presented which is shown to be very efficient on the average, both theoretically and practically.

...read moreread less

Abstract: A new selection algorithm is presented which is shown to be very efficient on the average, both theoretically and practically. The number of comparisons used to select the ith smallest of n numbers is n + min(i,n-i) + o(n). A lower bound within 9 percent of the above formula is also derived.

...read moreread less

319 citations

Journal Article•DOI•

New Parallel-Sorting Schemes

[...]

Preparata¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 Jul 1978-IEEE Transactions on Computers

TL;DR: A family of parallel-sorting algorithms for a multiprocessor system that is enumeration sortings and includes the use of parallel merging to implement count acquisition, matching the performance of Hirschberg's algoithm, which, however, is not free of fetch conflicts.

...read moreread less

Abstract: In this paper, we describe a family of parallel-sorting algorithms for a multiprocessor system. These algorithms are enumeration sortings and comprise the following phases: 1) count acquisition: the keys are subdivided into subsets and for each key we determine the number of smaller keys (count) in every subset; 2) rank determination: the rank of a key is the sum of the previously obtained counts; 3) data rearrangement: each key is placed in the position specified by its rank. The basic novelty of the algorithms is the use of parallel merging to implement count acquisition. By using Valiant's merging scheme, we show that n keys can be sorted in parallel with n log2n processors in time C log 2 n + o(log 2 n); in addition, if memory fetch conflicts are not allowed, using a modified version of Batcher's merging algorithm to implement phase 1), we show that n keys can be sorted with n1 +αprocessors in time (C'/α a) log 2 n + o(log 2 n), thereby matching the performance of Hirschberg's algoithm, which, however, is not free of fetch conflicts.

...read moreread less

169 citations