Showing papers by "Jean-Luc Gaudiot published in 2002"

PDF

Open Access

Journal Article•DOI•

[...]

James S. Burns¹, Jean-Luc Gaudiot²•Institutions (2)

01 Feb 2002-IEEE Transactions on Parallel and Distributed Systems

TL;DR: This paper evaluates the silicon overhead of SMT by performing a transistor/interconnect-level analysis of the layout and shows how the Instruction Set Architecture (ISA) and microarchitecture can have a large effect on the SMT overhead and performance.

...read moreread less

Abstract: Simultaneous Multi-Threading (SMT) is a hardware technique that increases processor throughput by issuing instructions simultaneously from multiple threads. However, while SMT can be added to an existing microarchitecture with relatively low overhead, this additional chip area could be used for other resources such as more functional units, larger caches, or better branch predictors. How large is the SMT overhead and at what point does SMT no longer pay off for maximum throughput compared to adding other architecture features? This paper evaluates the silicon overhead of SMT by performing a transistor/interconnect-level analysis of the layout. We discuss microarchitecture issues that impact SMT implementations and show how the Instruction Set Architecture (ISA) and microarchitecture can have a large effect on the SMT overhead and performance. Results show that SMT yields large performance gains with small to moderate area overhead.

...read moreread less

60 citations

DOI•

Alias analysis for exceptions in Java

[...]

Jongwook Woo¹, Jehak Woo, Isabelle Attali², Denis Caromel², Jean-Luc Gaudiot¹, Andrew L. Wendelborn³ - Show less +2 more•Institutions (3)

University of Southern California¹, University of Nice Sophia Antipolis², University of Adelaide³

01 Jan 2002

TL;DR: A flow-sensitive alias analysis algorithm that computes safe and efficient alias sets in Java and a references-set representation of aliased elements, its type table, and its propagation rules are proposed.

...read moreread less

Abstract: We propose a flow-sensitive alias analysis algorithm that computes safe and efficient alias sets in Java. For that, we propose a references-set representation of aliased elements, its type table, and its propagation rules. Also, for an exception construct, we consider try/catch/finally blocks as well as potential exception statement nodes while building a control flow graph. Finally, for the safe alias computation on a control flow graph, we present a structural order traverse of each block and node.

...read moreread less

12 citations

Book Chapter•DOI•

Performance Prediction Methodology for Parallel Programs with MPI in NOW Environments

[...]

Li Kuan Ching¹, Jean-Luc Gaudiot¹, Liria Matsumoto Sato²•Institutions (2)

University of California, Irvine¹, University of São Paulo²

28 Dec 2002-Lecture Notes in Computer Science

TL;DR: A methodology for parallel programming, along with MPI performance measurement and prediction in a class of a distributed computing environments, namely networks of workstations, is presented, based on a two-level model where analytical models are developed to represent execution behavior of parallel communications and code segments.

...read moreread less

Abstract: We present a methodology for parallel programming, along with MPI performance measurement and prediction in a class of a distributed computing environments, namely networks of workstations. Our approach is based on a two-level model where, at the top, a new parallel version of timing graph representation is used to make explicit the parallel communication and code segments of a given parallel program, while at the bottom level, analytical models are developed to represent execution behavior of parallel communications and code segments. Execution time results obtained from execution, together with problem size and number of nodes, are input to the model, which allows us to predict the performance of similar cluster computing systems with a different number of nodes. The analytical model is validated by performing experiments over a homogeneous cluster of workstations. Final results show that our approach produces accurate predictions, within 5% of actual results.

...read moreread less

10 citations

Proceedings Article•DOI•

An evaluation of thread migration for exploiting distributed array locality

[...]

Stephen F. Jenks¹, Jean-Luc Gaudiot¹•Institutions (1)

University of California, Irvine¹

16 Jun 2002

TL;DR: This work presents experimental evaluation of thread migration's ability to reduce the impact of remote array accesses across distributed-memory computers and compares these alternatives using various array access patterns.

...read moreread less

Abstract: Thread migration is one approach to remote memory accesses on distributed memory parallel computers. In thread migration, threads of control migrate between processors to access data local to those processors, while conventional approaches tend to move data to the threads that need them. Migration approaches enhance spatial locality by making large address spaces local, but are less adept at exploiting temporal locality. Data-moving approaches, such as cached remote memory fetches or distributed shared memory, can use both types of locality. We present experimental evaluation of thread migration's ability to reduce the impact of remote array accesses across distributed-memory computers. Nomadic Threads uses compiler-generated fine-grain threads which either migrate to make data local or fetch cache lines, tolerating latency with multithreading. We compare these alternatives using various array access patterns.

...read moreread less

9 citations

Journal Article•DOI•

On a scheme for parallel sorting on heterogeneous clusters

[...]

Christophe Cérin¹, Jean-Luc Gaudiot²•Institutions (2)

University of Picardie Jules Verne¹, University of Southern California²

01 Jan 2002-Future Generation Computer Systems

TL;DR: It is clear that improved load balance leads to improved execution time and that load balancing for the case of computers with heterogeneous processing capacity is more challenging than for the homogeneous case.

...read moreread less

6 citations