James Laudon

Researcher at Google

Publications - 66

Citations - 12478

James Laudon is an academic researcher from Google. The author has contributed to research in topics: Computer science & Cache. The author has an hindex of 27, co-authored 60 publications receiving 10444 citations. Previous affiliations of James Laudon include Oracle Corporation & Sun Microsystems.

Papers

PDF

Open Access

More filters

Posted Content

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, +74 more

- 16 Apr 2017 -

arXiv: Hardware Architecture

TL;DR: This paper evaluates a custom ASIC-called a Tensor Processing Unit (TPU)-deployed in datacenters since 2015 that accelerates the inference phase of neural networks (NN) and compares it to a server-class Intel Haswell CPU and an Nvidia K80 GPU, which are contemporaries deployed in the samedatacenters.

...read moreread less

Proceedings ArticleDOI

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, +75 more

TL;DR: The Tensor Processing Unit (TPU) as discussed by the authors is a custom ASIC deployed in datacenters since 2015 that accelerates the inference phase of neural networks (NN) using a 65,536 8-bit MAC matrix multiply unit that offers a peak throughput of 92 TeraOps/second (TOPS).

...read moreread less

Proceedings ArticleDOI

Memory consistency and event ordering in scalable shared-memory multiprocessors

Kourosh Gharachorloo, +5 more

TL;DR: A new model of memory consistency, called release consistency, that allows for more buffering and pipelining than previously proposed models is introduced and is shown to be equivalent to the sequential consistency model for parallel programs with sufficient synchronization.

...read moreread less

Journal ArticleDOI

The Stanford Dash multiprocessor

Daniel E. Lenoski, +7 more

- 01 Mar 1992 -

IEEE Computer

TL;DR: The directory architecture for shared memory (Dash) as discussed by the authors allows shared data to be cached, significantly reducing the latency of memory accesses and yielding higher processor utilization and higher overall performance, and a distributed directory-based protocol that provides cache coherence without compromising scalability.

...read moreread less

Proceedings ArticleDOI

The SGI Origin: a ccNUMA highly scalable server

James Laudon, +1 more

TL;DR: The motivation for building the Origin 2000 is discussed and the architecture and implementation of the multiprocessor is described, and performance results are presented for the NAS Parallel Benchmarks V2.2 and the SPLASH2 applications.

...read moreread less

Collapse