scispace - formally typeset
Book ChapterDOI

Limits on the Power of Parallel Random Access Machines with Weak Forms of Write Conflict Resolution

TLDR
A lower bound on the time required by the ROBUST PRAM to compute Boolean functions in terms of the number of different values each memory cell of the PRAM can contain and the degree of the function when expressed as a polynomial over a finite field is proved.
Abstract
The ROBUST PRAM is a concurrent-read concurrent-write (CRCW) parallel random access machine in which any value might appear in a memory cell as a result of a write conflict. This paper addresses the question of whether a PRAM with such a weak form of write conflict resolution can compute functions faster than the concurrent-read exclusive-write (CREW) PRAM. We prove a lower bound on the time required by the ROBUST PRAM to compute Boolean functions in terms of the number of different values each memory cell of the PRAM can contain and the degree of the function when expressed as a polynomial over a finite field. In the case of 1-bit memory cells, our lower bound for the problem of computing the OR of n Boolean variables exactly matches Cook, Dwork, and Reischuk's upper bound on the CREW PRAM. We extend our result to obtain a lower bound, depending on the number of processors, for computing Boolean functions on the ROBUST PRAM, even with memory cells of unbounded size. A particular consequence is that the ROBUST PRAM with 2 O( log n) processors requires 0(log n) steps to compute OR. These results are obtained by defining a class of CRCW PRAMs, the fixed adversary PRAMs, all of which are at least as powerful as the ROBUST PRAM. We prove our lower bounds using carefully chosen PRAMs from this class. We also show the limitations of this technique by describing how, with n-bit memory cells, any fixed adversary PRAM can compute OR and, more generally, simulate a PRIORITY PRAM in constant time. Finally, we consider the effect of adding randomization to the ROBUST PRAM. For any algorithm that computes OR without error, its expected running time on its worst input is no better than the worst case deterministic time complexity of computing OR. However, allowing a small probability of error enables the ROBUST PRAM with single bit memory cells to compute OR in almost constant time. ] 1996 Academic Press, Inc.

read more

Citations
More filters
Proceedings ArticleDOI

Methods for message routing in parallel machines

TL;DR: The problem of getting the right data to the right place within a reasonable amount of time is one of the most challenging and important tasks facing the designer (and, in some cases, the user) of a large-scale general-purpose parallel machine.
Proceedings ArticleDOI

The wakeup problem in synchronous broadcast systems (extended abstract)

TL;DR: This paper considers the fundamental problem of waking up all of n processors of a completely connected broadcast system, and proposes randomized and deterministic algorithms for the problem, as well as lower bounds in some of the cases.
Book ChapterDOI

Retrieval of scattered information by EREW, CREW and CRCW PRAMs

TL;DR: It is shown that any EREW PRAM that solves the k-compaction problem requires Ω(√log n) time, even if the number of processors is arbitrarily large and k=2, and that O(log k) time can be achieved on the ROBUST PRAM, a very weak CRCWPRAM model.
Journal ArticleDOI

ERCW PRAMs and optical communication

TL;DR: Algorithms and lower bounds for several fundamental problems on the Exclusive Read, Concurrent Write Parallel Random Access Machine (ERCW PRAM) and some results for unbounded fan-in, bounded fan-out (or ''BFO'') circuits are presented.
Book ChapterDOI

ERCW PRAMs and Optical Communication

TL;DR: Algorithms and lower bounds for several fundamental problems on the ERCW PRAM and some results for unbounded fan-in, bounded fan-out (or ‘BFO’) circuits are presented.
References
More filters
Proceedings ArticleDOI

Parallelism in random access machines

TL;DR: A model of computation based on random access machines operating in parallel and sharing a common memory is presented and can accept in polynomial time exactly the sets accepted by nondeterministic exponential time bounded Turing machines.
Proceedings ArticleDOI

Algebraic methods in the theory of lower bounds for Boolean circuit complexity

TL;DR: It is proved that depth k circuits with gates NOT, OR and MODp where p is a prime require Exp(&Ogr;(n1/2k)) gates to calculate MODr functions for any r ≠ pm.
Journal ArticleDOI

Upper and lower time bounds for parallel random access machines without simultaneous writes

TL;DR: It is shown that even if the authors allow nonuniform algorithms, an arbitrary number of processors, and arbitrary instruction sets, $\Omega (\log n)$ is a lower bound on the time required to compute various simple functions, including sorting n keys and finding the logical “or” of n bits.
Journal ArticleDOI

A universal interconnection pattern for parallel computers

TL;DR: It is argued that conglomerates include all parallel machines which could feasibly be built with fixed connections, and a universal structure is developed which can simulate any other basic interconnection pattern within linear time.
Related Papers (5)