Showing papers by "Jeffrey Dean published in 2020"

PDF

Open Access

Posted Content•

Chip Placement with Deep Reinforcement Learning

[...]

Azalia Mirhoseini, Anna Goldie, Mustafa Yazgan, Joe Jiang, Ebrahim M. Songhori, Shen Wang, Young-Joon Lee¹, Eric Johnson, Omkar Pathak, Sungmin Bae, Azade Nazi, Jiwoo Pak, Andy Tong, Kavya Srinivasa, William Hang, Emre Tuncer, Anand Babu, Quoc V. Le, James Laudon, C. Richard Ho, Roger Carpenter, Jeffrey Dean - Show less +18 more•Institutions (1)

Google¹

22 Apr 2020-arXiv: Learning

TL;DR: This work presents a learning-based approach to chip placement, and shows that, in under 6 hours, this method can generate placements that are superhuman or comparable on modern accelerator netlists, whereas existing baselines require human experts in the loop and take several weeks.

...read moreread less

Abstract: In this work, we present a learning-based approach to chip placement, one of the most complex and time-consuming stages of the chip design process. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of chip blocks, our method becomes better at rapidly generating optimized placements for previously unseen chip blocks. To achieve these results, we pose placement as a Reinforcement Learning (RL) problem and train an agent to place the nodes of a chip netlist onto a chip canvas. To enable our RL policy to generalize to unseen blocks, we ground representation learning in the supervised task of predicting placement quality. By designing a neural architecture that can accurately predict reward across a wide variety of netlists and their placements, we are able to generate rich feature embeddings of the input netlists. We then use this architecture as the encoder of our policy and value networks to enable transfer learning. Our objective is to minimize PPA (power, performance, and area), and we show that, in under 6 hours, our method can generate placements that are superhuman or comparable on modern accelerator netlists, whereas existing baselines require human experts in the loop and take several weeks.

...read moreread less

139 citations

Proceedings Article•DOI•

1.1 The Deep Learning Revolution and Its Implications for Computer Architecture and Chip Design

[...]

Jeffrey Dean¹•Institutions (1)

Google¹

01 Feb 2020

TL;DR: This paper provides a sketch of at least one interesting direction towards much larger-scale multi-task models that are sparsely activated and employ much more dynamic, exampleand task-based routing than the machine learning models of today.

...read moreread less

Abstract: The past decade has seen a remarkable series of advances in machine learning, and in particular deeplearning approaches based on artificial neural networks, to improve our abilities to build more accurate systems across a broad range of areas, including computer vision, speech recognition, language translation, and natural language understanding tasks. This paper is a companion paper to a keynote talk at the 2020 International Solid-State Circuits Conference (ISSCC) discussing some of the advances in machine learning, and their implications on the kinds of computational devices we need to build, especially in the post-Moore's Lawera. It also discusses some of the ways that machine learning may be able to help with some aspects of the circuit design process. Finally, it provides a sketch of at least one interesting direction towards much larger-scale multi-task models that are sparsely activated and employ much more dynamic, exampleand task-based routing than the machine learning models of today.

...read moreread less

36 citations

Journal Article•DOI•

Customization Scenarios for De-identification of Clinical Notes

[...]

Tzvika Hartman¹, Michael D. Howell¹, Jeffrey Dean¹, Shlomo Hoory¹, Ronit Slyper¹, Itay Laish¹, Oren Gilon¹, Danny Vainstein¹, Greg S. Corrado¹, Katherine Chou¹, Ming Jack Po¹, Jutta Williams, Scott Ellis¹, Gavin Edward Bee¹, Avinatan Hassidim¹, Rony Amira¹, Genady Beryozkin¹, Idan Szpektor¹, Yossi Matias¹ - Show less +15 more•Institutions (1)

Google¹

30 Jan 2020-BMC Medical Informatics and Decision Making

TL;DR: Health organizations should be aware of the levels of customization available when selecting a de-identification deployment solution, in order to choose the one that best matches their resources and target performance level.

...read moreread less

Abstract: Automated machine-learning systems are able to de-identify electronic medical records, including free-text clinical notes. Use of such systems would greatly boost the amount of data available to researchers, yet their deployment has been limited due to uncertainty about their performance when applied to new datasets. We present practical options for clinical note de-identification, assessing performance of machine learning systems ranging from off-the-shelf to fully customized. We implement a state-of-the-art machine learning de-identification system, training and testing on pairs of datasets that match the deployment scenarios. We use clinical notes from two i2b2 competition corpora, the Physionet Gold Standard corpus, and parts of the MIMIC-III dataset. Fully customized systems remove 97–99% of personally identifying information. Performance of off-the-shelf systems varies by dataset, with performance mostly above 90%. Providing a small labeled dataset or large unlabeled dataset allows for fine-tuning that improves performance over off-the-shelf systems. Health organizations should be aware of the levels of customization available when selecting a de-identification deployment solution, in order to choose the one that best matches their resources and target performance level.

...read moreread less

22 citations

Posted Content•

Interlocking Backpropagation: Improving depthwise model-parallelism.

[...]

Aidan N. Gomez¹, Oscar Key, Stephen Gou, Nicholas Frosst, Jeffrey Dean², Yarin Gal¹ - Show less +2 more•Institutions (2)

University of Oxford¹, Google²

08 Oct 2020-arXiv: Learning

TL;DR: This work introduces a class of intermediary strategies between local and global learning referred to as interlocking backpropagation, which preserves many of the compute-efficiency advantages of local optimisation, while recovering much of the task performance achieved by global optimisation.

...read moreread less

Abstract: The number of parameters in state of the art neural networks has drastically increased in recent years. This surge of interest in large scale neural networks has motivated the development of new distributed training strategies enabling such models. One such strategy is model-parallel distributed training. Unfortunately, model-parallelism suffers from poor resource utilisation, which leads to wasted resources. In this work, we improve upon recent developments in an idealised model-parallel optimisation setting: local learning. Motivated by poor resource utilisation, we introduce a class of intermediary strategies between local and global learning referred to as interlocking backpropagation. These strategies preserve many of the compute-efficiency advantages of local optimisation, while recovering much of the task performance achieved by global optimisation. We assess our strategies on both image classification ResNets and Transformer language models, finding that our strategy consistently out-performs local learning in terms of task performance, and out-performs global learning in training efficiency.

...read moreread less

3 citations