Showing papers in "Journal of Parallel and Distributed Computing in 1987"

PDF

Open Access

Journal Article•DOI•

Image algebra techniques for parallel image processing

[...]

Gerhard X. Ritter¹, Paul D. Gader¹•Institutions (1)

01 Feb 1987-Journal of Parallel and Distributed Computing

TL;DR: This paper shows how the image algebra suggests a general-purpose cellular pyramid array processor for real time image processing tasks and demonstrates how algebraic techniques can be used to develop systematic methods for deriving parallel algorithms.

...read moreread less

227 citations

Journal Article•DOI•

On mapping parallel algorithms into parallel architectures

[...]

Francine Berman¹, Lawrence H. Snyder²•Institutions (2)

University of California, San Diego¹, University of Washington²

01 Oct 1987-Journal of Parallel and Distributed Computing

TL;DR: A solution to the mapping problem when there are topological and cardinality variations for a commonly used class of parallel interconnection structures, which includes shuffle-exchange networks, hypercubes, square meshes, linear systolic arrays, cube-connected cycles, and complete binary trees is presented.

...read moreread less

179 citations

Journal Article•DOI•

Efficient computation of optimal assignments for distributed tasks

[...]

J. B. Sinclair¹•Institutions (1)

Rice University¹

01 Aug 1987-Journal of Parallel and Distributed Computing

TL;DR: A branch-and-bound-with-underestimates algorithm to reduce the size of the search tree, and its average time and space complexity for two underestimating functions through simulation, which shows the minimum independent assignment cost underestimate (MIACU), performs extremely well over a wide range of values of program model parameters.

...read moreread less

110 citations

Journal Article•DOI•

Supporting divide-and-conquer algorithms for image processing

[...]

Quentin F. Stout¹•Institutions (1)

University of Michigan¹

01 Feb 1987-Journal of Parallel and Distributed Computing

TL;DR: Some characteristics of divide-and-conquer algorithms are examined, along with some of their implications for the design of machines and languages which can support the efficient programming and execution of divided algorithms.

...read moreread less

60 citations

Journal Article•DOI•

Costs of quadtree representation of nondense matrices

[...]

David S. Wise¹, John Franco¹•Institutions (1)

Indiana University¹

01 Aug 1987-Journal of Parallel and Distributed Computing

TL;DR: This paper presents worst-case and average-case resource requirements for storing and retrieving familiar families of patterned matrices: packed, symmetric, triangular, Toeplitz, and banded.

...read moreread less

40 citations

Journal Article•DOI•

Vision algorithms for hypercube machines

[...]

Trevor Mudge¹, T. S. Mahman-Abdel¹•Institutions (1)

University of Michigan¹

01 Feb 1987-Journal of Parallel and Distributed Computing

TL;DR: This paper develops a general model for hypercube machines, and uses it to show how vision algorithms can be executed on hypercubes, and the steps in the problem of thick-film inspection are used as a concrete example.

...read moreread less

31 citations

Journal Article•DOI•

P 3 E: New life for projection—based image processing

[...]

Eric B. Hinkle¹, Jorge L. C. Sanz¹, Anil K. Jain², Dragutin Petkovic¹•Institutions (2)

IBM¹, University of California, Davis²

01 Feb 1987-Journal of Parallel and Distributed Computing

TL;DR: There is also an extensive list of key image analysis algorithms that are supported by P 3 E, thus making it a profound and versatile tool for projection-based computer vision.

...read moreread less

30 citations

Journal Article•DOI•

Gaussian elimination on a hypercube automaton

[...]

Peter R. Cappello¹•Institutions (1)

University of California, Santa Barbara¹

01 Jun 1987-Journal of Parallel and Distributed Computing

TL;DR: It is shown that n2/log n processors suffice for achieving O(n log n) time, which improves the best previous result by a factor of log n processors, and is asymptotically optimal.

...read moreread less

27 citations

Journal Article•DOI•

A fault tolerant massively parallel processing architecture

[...]

Vijay Balasubramanian¹, Prithviraj Banerjee¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 Aug 1987-Journal of Parallel and Distributed Computing

TL;DR: A distributed diagnostic and structuring algorithm for the RECBAR is presented that enables the architecture to detect faults and structure itself accordingly within 2 · log2(L) + 1 time steps, thus making it a truly fault tolerant architecture.

...read moreread less

27 citations

Journal Article•DOI•

Performance analysis and optimization of VLSI dataflow arrays

[...]

Sun-Yuan Kung¹, P. S. Lewis¹, S. C. Lo¹•Institutions (1)

University of Southern California¹

01 Dec 1987-Journal of Parallel and Distributed Computing

TL;DR: A datallow graph model for the timing analysis of general (cyclic or acyclic), decision-free asynchronous architectures is introduced and it is shown how the results of this analysis can be used to synthesize optimal special-purpose hardware implementations of both general datallow arrays and regular wavefront arrays.

...read moreread less

24 citations

Journal Article•DOI•

Analysis of interconnection networks with different arbiter designs

[...]

Laxmi N. Bhuyan

01 Aug 1987-Journal of Parallel and Distributed Computing

TL;DR: This paper illustrates that this arbitration policy discriminates against remote or less frequent requests because it rejects them most of the time.

...read moreread less

Journal Article•DOI•

A probabilistic approach to the load-sharing problem in distributed systems

[...]

Eli Shamir¹, Eli Upfal²•Institutions (2)

Hebrew University of Jerusalem¹, IBM²

01 Oct 1987-Journal of Parallel and Distributed Computing

Journal Article•DOI•

Analysis of a distributed algorithm for exterma finding in a ring

[...]

Doron Rotem¹, Ephraim Korach², Nicola Santoro³•Institutions (3)

University of Waterloo¹, Technion – Israel Institute of Technology², Carleton University³

01 Dec 1987-Journal of Parallel and Distributed Computing

TL;DR: This analysis shows that this simple algorithm, which is known to be average case optimal, compares very favorably with all the other known algorithms as it requires O(n log n) messages with probability tending to one.

...read moreread less

Journal Article•DOI•

Low-level image analysis tasks on fine-grained tree-structured SIMD machines

[...]

Hussein A. H. Ibrahim¹, John R. Kender¹, David E. Shaw¹•Institutions (1)

Columbia University¹

01 Dec 1987-Journal of Parallel and Distributed Computing

TL;DR: Novel algorithmic techniques are described, such as vertical pipelining, subproblem partitioning, associative matching, and data duplication, that effectively exploit the massive parallelism available in fine-grained SIMD tree machines while avoiding communication bottlenecks.

...read moreread less

Journal Article•DOI•

Designing cellular permutation networks through coset decompositions of symmetric groups

[...]

A. Yavuz Oruç¹•Institutions (1)

Rensselaer Polytechnic Institute¹

01 Aug 1987-Journal of Parallel and Distributed Computing

TL;DR: A group theoretic representation of these networks is given and it is shown that all existing cellular permutation arrays are the network realizations of the recursive coset decompositions of symmetric groups.

...read moreread less

Journal Article•DOI•

A systolic array for cyclic-by-rows Jacobi algorithms

[...]

Uwe Schwiegelshohn¹, Lothar Thiele¹•Institutions (1)

Technische Universität München¹

01 Jun 1987-Journal of Parallel and Distributed Computing

TL;DR: A systolic array is derived which requires ( n + 1) 2 /4 processing cells and has a time complexity of O ( n ) for each sweep.

...read moreread less

Journal Article•DOI•

The design and implementation of a Pascal-based language for array processor architectures

[...]

Ronald H. Perrott¹, R. W. Lyttle¹, P. S. Dhillon¹•Institutions (1)

Queen's University Belfast¹

01 Jun 1987-Journal of Parallel and Distributed Computing

TL;DR: Work on the implementation of a compiler for the ICL Distributed Array Processor (DAP), which has evolved from the Pascal-based parallel language Actus, is currently under way and some aspects of this implementation are described.

...read moreread less

Journal Article•DOI•

Randomized parallel speedups for list ranking

[...]

Uzi Vishkin¹•Institutions (1)

Courant Institute of Mathematical Sciences¹

01 Jun 1987-Journal of Parallel and Distributed Computing

TL;DR: Using a recently published parallel prefix sums algorithm the list-ranking algorithm can be adapted to run on a concurrent-read concurrent-write parallel random access machine (CRCW PRAM) almost surely in time O(n/p + log n) using p processors.

...read moreread less

Journal Article•DOI•

A residue arithmetic implementation of the fft

[...]

Fred J. Taylor

01 Apr 1987-Journal of Parallel and Distributed Computing

TL;DR: The result of this analysis suggests that the new single-modulus complex RNS may be significantly superior to the alternative FFT design choices.

...read moreread less

Journal Article•DOI•

A parallel first-order linear recurrence solver

[...]

Gerard G. L. Meyer¹, Louis J. Podrazik¹•Institutions (1)

Johns Hopkins University¹

01 Apr 1987-Journal of Parallel and Distributed Computing

TL;DR: A parallel procedure for the solution of first-order linear recurrence systems of size N when the number of processors p is small in relation to N is presented and achieves the lower bound 2 (N − 1) (p + 1) for solving the parallel prefix problem on a p processor machine.

...read moreread less

Journal Article•DOI•

Iteration-level parallel execution of do loops with a reduced set of dependence relations

[...]

Zen Chen¹, Chih-Chih Chang¹•Institutions (1)

National Chiao Tung University¹

01 Oct 1987-Journal of Parallel and Distributed Computing

TL;DR: It is shown that in this model only one kind of dependence relation is needed, and this fact leads to a smaller set of dependence relations and, therefore, results in better parallel performance.

...read moreread less

Journal Article•

Special Issue on Parallel Image Processing and Pattern Recognition.

[...]

Leah H. Jamieson, Steven L. Tanimoto

01 Jan 1987-Journal of Parallel and Distributed Computing

Journal Article•DOI•

An architecture and an interconnection scheme for time-sliced buses

[...]

A. Kovaleski, S. Ratheal¹, Fabrizio Lombardi²•Institutions (2)

Sandia National Laboratories¹, University of Colorado Boulder²

01 Apr 1987-Journal of Parallel and Distributed Computing

TL;DR: In this article, an interconnection scheme based on a bus network consisting of high-speed time-sliced buses and interbus links of matching bandwidths is described, and two contrasting approaches to simulating such a machine are discussed.

...read moreread less

Journal Article•DOI•

Notes on the complexity of systolic programs

[...]

Stephen Taylor¹, Lisa Hellerstein¹, Shmuel Safra¹, Ehud Shapiro¹•Institutions (1)

Weizmann Institute of Science¹

01 Jun 1987-Journal of Parallel and Distributed Computing

TL;DR: In this article, the authors discuss basic theoretical issues concerning systolic programming methodology and demonstrate simple techniques which can be used to structure communication and show how the complexity of two simple algorithms is adversely affected by the cost of data movement in a parallel system.

...read moreread less

Journal Article•DOI•

Distributed process manager for an engineering network computer

[...]

Jason Gait

01 Aug 1987-Journal of Parallel and Distributed Computing

TL;DR: MP is a manager for systems of cooperating processes in a local area network of engineering workstations that exhibits realtime behaviors.

...read moreread less

Journal Article•DOI•

A VLSI multiprecision matrix multiplier and polynomial evaluator

[...]

Darrell Makarenko¹, Jonathan Schaeffer¹•Institutions (1)

University of Alberta¹

01 Dec 1987-Journal of Parallel and Distributed Computing

TL;DR: A VLSI design of a multiprecision matrix multiplier and polynomial evaluator that addresses the issues of implementation is described, consisting of a two-dimensional array of bit-serial multiply and accumulate cells, each implemented as an accumulator and a pair of registers.

...read moreread less

Journal Article•DOI•

Interval arithmetic block cyclic reduction on vector computers

[...]

Hartmut Schwandt

01 Oct 1987-Journal of Parallel and Distributed Computing

TL;DR: Several algorithms for interval arithmetic block cyclic reduction for efficient application to vector computers under the condition that interval arithmetic inclusion properties be preserved are introduced.

...read moreread less

Journal Article•DOI•

Parallel algorithms for message decomposition

[...]

Shang-Hua Teng¹, Bin Wang¹•Institutions (1)

University of Southern California¹

01 Jun 1987-Journal of Parallel and Distributed Computing

TL;DR: An optimal parallel algorithm to decompose prefix-coded messages and uniquely decipherable- coded messages in O( n P ) time, using O(P) processors on the weakest version of parallel random access machines in which concurrent read and concurrent write to a cell in the common memory are not allowed.

...read moreread less

Journal Article•DOI•

Distributed implementation of nested communicating sequential processes: communication and termination

[...]

Fabrizio Baiardi, A Fantechi, Marco Vanneschi, A. Tomasi

01 Dec 1987-Journal of Parallel and Distributed Computing

TL;DR: A protocol for process termination handling that guarantees the consistency of distributed structures and aims at optimizing the rate of communications among virtual processors is presented.

...read moreread less

Journal Article•DOI•

Finding test-and-treatment procedures using parallel computation

[...]

Louis D. Duval¹, Robert A. Wagner¹, Yijie Han¹, Donald W. Loveland¹•Institutions (1)

Duke University¹

01 Jun 1987-Journal of Parallel and Distributed Computing

TL;DR: A parallel algorithm for the NP-hard problem test-and-treatment is presented for a machine whose number of connections is 3p(2 squared), where p is the number of processing elements (PEs), and where the PEs are simple enough such that a machine with 2 to the 20th power PEs is currently implementable and to the 30th powerPE machine is feasible.

...read moreread less