Showing papers in "Journal of Parallel and Distributed Computing in 1995"

PDF

Open Access

Journal Article•DOI•

Distributed loop computer networks: a survey

[...]

Jean-Claude Bermond, Francesc Comellas¹, D. F. Hsu²•Institutions (2)

Polytechnic University of Catalonia¹, Fordham University²

11 Jan 1995-Journal of Parallel and Distributed Computing

TL;DR: A survey of recent results on distributed loop computer networks and the actual computation of the minimum diameter and the construction of loop networks which can achieve this optimal number is given.

...read moreread less

382 citations

Journal Article•DOI•

Load balancing and data locality in adaptive hierarchical N -body methods: Barnes-Hut, fast multipole, and radiosity

[...]

Jaswinder Pal Singh¹, Chris Holt¹, Takashi Totsuka¹, Anoop Gupta¹, John L. Hennessy¹ - Show less +1 more•Institutions (1)

Stanford University¹

01 Jun 1995-Journal of Parallel and Distributed Computing

TL;DR: This paper examines the partitioning and scheduling techniques required to obtain effective parallel performance on applications that use a range of hierarchical N-body methods, and examines a recent hierarchical method for radiosity calculations in computer graphics.

...read moreread less

205 citations

Journal Article•DOI•

FORTRAN M - A Language for Modular Parallel Programming

[...]

Ian Foster¹, K.M. Chandy¹•Institutions (1)

Argonne National Laboratory¹

01 Apr 1995-Journal of Parallel and Distributed Computing

TL;DR: FORTRAN M is a small set of extensions to FORTRAN 77 that supports a modular approach to the design of message-passing programs that can be compiled efficiently for uniprocessors, shared-memory computers, distributed- memory computers, and networks of workstations.

...read moreread less

181 citations

Journal Article•DOI•

Channel Allocation under Batching and VCR Control in Video-on-Demand Systems

[...]

Asit Dan¹, Perwez Shahabuddin¹, Dinkar Sitaram¹, D. Towsley¹•Institutions (1)

IBM¹

01 Nov 1995-Journal of Parallel and Distributed Computing

TL;DR: An analytical model is developed that predicts the reneging probability and expected resume delay, and this model is used to optimally allocate channels for batching, on-demand playback, and contingency and the effectiveness of the proposed policy over a scheme with no contingency channels and no batching is demonstrated.

...read moreread less

128 citations

Journal Article•DOI•

Static and Dynamic Processor Scheduling Disciplines in Heterogeneous Parallel Architectures

[...]

Daniel A. Menascé, Debanjan Saha, S.C.D. Porto, Virgilio Almeida, Satish K. Tripathi - Show less +1 more

01 Jul 1995-Journal of Parallel and Distributed Computing

TL;DR: A new static processor assignment policy, called Largest Task First Minimum Finish Time (LTFMFT), is introduced and the analysis shows that this policy is very sensitive to the degree of heterogeneity of the architecture, and that it outperforms all other policies analyzed.

...read moreread less

124 citations

Journal Article•DOI•

Generating Local Addresses and Communication Sets for Data-Parallel Programs

[...]

Siddhartha Chatterjee, John R. Gilbert, Fred Long, Robert Schreiber, Shang-Hua Teng - Show less +1 more

01 Apr 1995-Journal of Parallel and Distributed Computing

TL;DR: This work demonstrates a storage scheme for an array A affinely aligned to a template that is distributed across p processors with a cyclic(k) distribution that does not waste any storage and shows that the local memory access sequence of any processor for a computation involving the regular section A(?:h:s) is characterized by a finite state machine of at most k states.

...read moreread less

121 citations

Journal Article•DOI•

Design of the Munin distributed shared memory system

[...]

John B. Carter¹•Institutions (1)

University of Utah¹

01 Sep 1995-Journal of Parallel and Distributed Computing

TL;DR: A detailed description of the design and implementation of the Munin prototype, with special emphasis given to its novel write shared protocol.

...read moreread less

95 citations

Journal Article•DOI•

Optimal communication algorithms on star graphs using spanning tree constructions

[...]

Paraskevi Fragopoulou¹, Selim G. Akl¹•Institutions (1)

Queen's University¹

11 Jan 1995-Journal of Parallel and Distributed Computing

TL;DR: All the communication algorithms presented in this paper are based on the construction of spanning trees with special properties on the star graph to fit different communication needs, and are designed in terms of both time and number of message transmissions.

...read moreread less

91 citations

Journal Article•DOI•

“Hypermeshes”: optical interconnection networks for parallel computing

[...]

Ted H. Szymanski¹•Institutions (1)

McGill University¹

01 Apr 1995-Journal of Parallel and Distributed Computing

TL;DR: Hypermeshes are shown to have high bisection bandwidths, thereby minimizing the time for many common algorithms such as parallel sorting, and are considerably more powerful computational models than meshes, generalized hypercubes, and other orthogonal graphs.

...read moreread less

87 citations

Journal Article•DOI•

A Cost Calculus for Parallel Functional Programming

[...]

David B. Skillicorn¹, Wentong Cai¹•Institutions (1)

Nanyang Technological University¹

01 Jul 1995-Journal of Parallel and Distributed Computing

TL;DR: This work presents a strategy for building cost calculi for skeleton-based programming languages which can be used for derivational software development and which deals in a pragmatic way with the difficulties of composition.

...read moreread less

87 citations

Journal Article•DOI•

An optimal sorting algorithm on reconfigurable mesh

[...]

Ju-wook Jang, Viktor K. Prasanna

15 Feb 1995-Journal of Parallel and Distributed Computing

TL;DR: N nontrivial ways to use the Reconfigurable Mesh to solve several basic arithmetic problems in constant time are shown by novel ways to represent numbers and by exploiting the reconfigurability of the architecture.

...read moreread less

Journal Article•DOI•

Evaluating the Performance of Cache-Affinity Scheduling in Shared-Memory Multiprocessors

[...]

Josep Torrellas¹, Andrew Tucker¹, Ashish Gupta¹•Institutions (1)

Stanford University¹

01 Feb 1995-Journal of Parallel and Distributed Computing

TL;DR: This paper explores affinity scheduling, a technique that helps reduce cache misses by preferentially scheduling a process on a processor where it has run recently, and shows that it is extremely simple to add to existing schedulers.

...read moreread less

Journal Article•DOI•

Performance of a Mass-Storage System for Video-on-Demand

[...]

J.W. Hsieh¹, Mengjou Lin¹, Jonathan C. L. Liu¹, David H. C. Du¹, Thomas M. Ruwart¹ - Show less +1 more•Institutions (1)

University of Minnesota¹

01 Nov 1995-Journal of Parallel and Distributed Computing

TL;DR: From the experimental results, the storage system of Onyx machine can potentially provide about 360 concurrent video accesses with guaranteed quality of service and the impact of different concurrent access patterns on the performance of a server is studied.

...read moreread less

Journal Article•DOI•

The Hough transform on a reconfigurable multi-ring network

[...]

Suchendra M. Bhandarkar¹, Hamid R. Arabnia¹•Institutions (1)

University of Georgia¹

11 Jan 1995-Journal of Parallel and Distributed Computing

TL;DR: The RMRN is shown to be a truly scalable network, in that each node in the network has a fixed degree of connectivity and the reconfiguration mechanism ensures a network diameter of O (log 2 N ) for an N -processor network.

...read moreread less

Journal Article•DOI•

Horizons of parallel computation

[...]

Gianfranco Bilardi¹, Franco P. Preparata²•Institutions (2)

University of Padua¹, Brown University²

01 Jun 1995-Journal of Parallel and Distributed Computing

TL;DR: The ultimate impact of fundamental physical limitations on parallel computing machines is considered, and it is found that scalability holds only for neighborly interconnections of bounded-size synchronous modules, presumably of the area-universal type.

...read moreread less

Journal Article•DOI•

A Binding Architecture for Multimedia Networks

[...]

Aurel A. Lazar, Shailendra K. Bhonsle, Koon-Seng Lim

01 Nov 1995-Journal of Parallel and Distributed Computing

TL;DR: An open architecture that achieves seamless binding between networking and multimedia devices is proposed and is embedded into a reference model for multimedia networking architectures that supports a clean separation between binding interfaces and binding algorithms.

...read moreread less

Journal Article•DOI•

The generalized dimension exchange method for load balancing in k -ary n -cubes and variants

[...]

Cheng-Zhong Xu¹, Francis C. M. Lau¹•Institutions (1)

Shantou University¹

11 Jan 1995-Journal of Parallel and Distributed Computing

TL;DR: This paper derives the optimal lambda′s for the k -ary n -cube network and its variants-the ring, the torus, the chain, and the mesh, and concludes that the GDE method favors high-dimensional k -ARY n -cubes.

...read moreread less

Journal Article•DOI•

Affine-by-statement scheduling of uniform and affine loop nests over parametric domains

[...]

Alain Darte¹, Yves Robert¹•Institutions (1)

École Normale Supérieure¹

15 Aug 1995-Journal of Parallel and Distributed Computing

TL;DR: A new, constructive and efficient method is presented to determine the optimal (i.e., with smallest latency) affine-by-statement scheduling, and it is shown that these schedules are asymptotically as efficient as parameter-dependent solutions while much more regular.

...read moreread less

Journal Article•DOI•

Performance of the NAS Parallel Benchmarks on PVM-Based Networks

[...]

S. White¹, A. Alund¹, Vaidy S. Sunderam¹•Institutions (1)

Emory University¹

01 Apr 1995-Journal of Parallel and Distributed Computing

TL;DR: Results of porting and executing the NPB kernels in three different duster environments using low- to medium-powered workstations on Ethernet and two types of FDDI networks indicate that mediocre to good performance could be obtained despite the communications-intensive nature of the applications.

...read moreread less

Journal Article•DOI•

A Data-Parallel Approach for Real-Time MPEG-2 Video Encoding

[...]

Shahriar M. Akramullah¹, Ishfaq Ahmad¹, M.L. Liou¹•Institutions (1)

University of Hong Kong¹

01 Nov 1995-Journal of Parallel and Distributed Computing

TL;DR: A fine-grained parallel implementation of the MPEG-2 video encoder an the Intel Paragon XP/S parallel computer using a data-parallel approach and exploiting parallelism within each frame makes it suitable for real-time applications where the complete video sequence may not be present on the disk and may become available on a frame-by-frame basis with time.

...read moreread less

Journal Article•DOI•

On Efficiently Implementing Global Time for Performance Evaluation on Multiprocessor Systems

[...]

E. Maillet, C. Tron

01 Jul 1995-Journal of Parallel and Distributed Computing

TL;DR: This paper familiarizes the reader with statistical global time estimation methods by presenting two methods, which have been introduced in the literature, and shows how a good balance between length of sample period and global time precision can be achieved through a detailed experimental analysis of the estimation error observed on samples.

...read moreread less

Journal Article•DOI•

Parallel many-body simulations without all-to-all communication

[...]

Bruce Hendrickson¹, Steve Plimpton¹•Institutions (1)

Sandia National Laboratories¹

01 May 1995-Journal of Parallel and Distributed Computing

TL;DR: This work presents a new approach, suitable for direct simulations, that avoids all-to-all communication without requiring any geometric clustering, and proves to be fastest for simulations of up to several thousand particles.

...read moreread less

Journal Article•DOI•

Using write caches to improve performance of cache coherence protocols in shared-memory multiprocessors

[...]

Fredrik Dahlgren¹, Per Stenström¹•Institutions (1)

Lund University¹

15 Apr 1995-Journal of Parallel and Distributed Computing

TL;DR: It is shown that update-based cache protocols can perform significantly better than write-invalidate protocols by incorporating a write cache in each processing node, and the memory-access penalty associated with coherence misses is drastically reduced.

...read moreread less

Journal Article•DOI•

An evaluation of software-based release consistent protocols

[...]

P. Keleher¹, Alan L. Cox¹, Sandhya Dwarkadas¹, Willy Zwaenepoel¹•Institutions (1)

Rice University¹

01 Sep 1995-Journal of Parallel and Distributed Computing

TL;DR: This paper presents an evaluation of three software implementations of release consistency, which allow data communication to be aggregated and allow multiple writers to simultaneously modify a single page, and shows that the lazy protocols consistently outperform the eager protocol for all but one application and the lazy hybrid performs the best overall.

...read moreread less

Journal Article•DOI•

Efficient self-simulation algorithms for reconfigurable arrays

[...]

Yosi Ben-Asher¹, Dan Gordon¹, Assaf Schuster²•Institutions (2)

University of Haifa¹, Technion – Israel Institute of Technology²

01 Oct 1995-Journal of Parallel and Distributed Computing

TL;DR: This work gives several positive answers to the self simulation problem on dynamically reconfigurable meshes, showing that the simulation of a reconfiguring mesh by a smaller one can be carried optimally, by using standard methods, on meshes such that buses are established along rows or along columns.

...read moreread less

Journal Article•DOI•

Deadlock Models and a General Algorithm for Distributed Deadlock Detection

[...]

BrzezinskiJ., RaynalM., SinghalM.

01 Dec 1995-Journal of Parallel and Distributed Computing

TL;DR: In this paper, the problem of deadlock detection in asynchronous message passing systems in a system model that covers unspecified receptions and non-FIFO channels is dealt with, and a hierarchy of algorithms is presented.

...read moreread less

Journal Article•DOI•

Concurrent Aggregates (CA)

[...]

Andrew A. Chien¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

01 Mar 1995-Journal of Parallel and Distributed Computing

TL;DR: This paper describes and evaluates the use of aggregates in a programming langauge, and evaluates language support in CA for composing multiaccess data abstractions (delegation, first-class messages, and first- class and user-defined continuations).

...read moreread less

Journal Article•DOI•

Detecting termination by weight-throwing in a faulty distributed system

[...]

Yu-Chee Tseng

15 Feb 1995-Journal of Parallel and Distributed Computing

TL;DR: This fault-tolerant termination detection algorithm for a distributed system in which processes tend to fail has fewer detection delays than existing algorithms in the literature and comparable performance in terms of message complexity.

...read moreread less

Journal Article•DOI•

On Embedding Binary Trees into Hypercubes

[...]

W.K. Chen, Matthias F. Stallmann

01 Feb 1995-Journal of Parallel and Distributed Computing

TL;DR: A simple linear-time heuristic is presented which embeds an arbitrary binary tree into a hypercube with expansion 1 and average dilation no more than 2 and extends good embeddings for parity-balanced binary trees to arbitrary binary trees.

...read moreread less

Journal Article•DOI•

Advanced compiler optimizations for sparse computations

[...]

Aart J. C. Bik¹, Harry A. G. Wijshoff¹•Institutions (1)

Leiden University¹

15 Nov 1995-Journal of Parallel and Distributed Computing

TL;DR: In this article, the compiler is presented with dense code and automatically converts it into code operating on sparse data structures, then the dependence information obtained by analysis of the original code can be used to exploit potential concurrency in the generated sparse code.

...read moreread less