Topic

Pipeline (computing)

About: Pipeline (computing) is a research topic. Over the lifetime, 26760 publications have been published within this topic receiving 204305 citations. The topic is also known as: data pipeline & computational pipeline.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism

[...]

Yanping Huang¹, Youlong Cheng¹, Ankur Bapna¹, Orhan Firat¹, Dehao Chen¹, Mia Xu Chen¹, HyoukJoong Lee¹, Jiquan Ngiam¹, Quoc V. Le¹, Yonghui Wu¹, Zhifeng Chen¹ - Show less +7 more•Institutions (1)

Google¹

01 Jan 2019

TL;DR: TensorPipe as mentioned in this paper is a pipeline parallelism library that allows scaling any network that can be expressed as a sequence of layers by pipelining different sub-sequences of layers on separate accelerators.

...read moreread less

Abstract: Scaling up deep neural network capacity has been known as an effective approach to improving model quality for several different machine learning tasks. In many cases, increasing model capacity beyond the memory limit of a single accelerator has required developing special algorithms or infrastructure. These solutions are often architecture-specific and do not transfer to other machine learning tasks. To address the need for efficient and task-independent model parallelism, we introduce TensorPipe, a pipeline parallelism library that allows scaling any network that can be expressed as a sequence of layers. By pipelining different sub-sequences of layers on separate accelerators, TensorPipe provides the flexibility of scaling a variety of different networks to gigantic sizes efficiently. Moreover, TensorPipe utilizes a novel batch-splitting pipelining algorithm, resulting in almost linear speedup when a model is partitioned across multiple accelerators. We demonstrate the advantages of TensorPipe by training large-scale neural networks on two different tasks with distinct network architectures: (i)Image Classification: We train a 557-million-parameter AmoebaNet model and attain a top-1 accuracy of 84.4% on ImageNet-2012, (ii)Multilingual Neural Machine Translation: We train a single 6-billion-parameter, 128-layer Transformer model on a corpus spanning over 100 languages and achieve better quality than all bilingual models.

...read moreread less

486 citations

Proceedings Article•DOI•

Pipeline gating: speculation control for energy reduction

[...]

Srilatha Manne¹, Artur Klauser¹, Dirk Grunwald¹•Institutions (1)

University of Colorado Boulder¹

16 Apr 1998

TL;DR: This paper introduces a hardware mechanism called pipeline gating to control rampant speculation in the pipeline, and presents inexpensive mechanisms for determining when a branch is likely to mispredict, and for stopping wrong-path instructions from entering the pipeline.

...read moreread less

Abstract: Branch prediction has enabled microprocessors to increase instruction level parallelism (ILP) by allowing programs to speculatively execute beyond control boundaries. Although speculative execution is essential for increasing the instructions per cycle (IPC), it does come at a cost. A large amount of unnecessary work results from wrong-path instructions entering the pipeline due to branch misprediction. Results generated with the SimpleScalar tool set using a 4-way issue pipeline and various branch predictors show an instruction overhead of 16% to 105% for event instruction committed. The instruction overhead will increase in the future as processors use more aggressive speculation and wider issue widths. In this paper we present an innovative method for power reduction ,which, unlike previous work that sacrificed flexibility or performance reduces power in high-performance microprocessors without impacting performance. In particular we introduce a hardware mechanism called pipeline gating to control rampant speculation in the pipeline. We present inexpensive mechanisms for determining when a branch is likely to mispredict, and for stopping wrong-path instructions from entering the pipeline. Results show up to a 38% reduction in wrong-path instructions with a negligible performance loss (/spl ap/1%). Best of all, even in programs with a high branch prediction accuracy, performance does not noticeable degrade. Our analysis indicates that there is little risk in implementing this method in existing processors since it does not impact performance and can benefit energy reduction.

...read moreread less

471 citations

Patent•

Deferred shading graphics pipeline processor having advanced features

[...]

Jerome F. Duluk¹, Richard E. Hessel¹, Vaughn T. Arnold¹, Jack Benkual¹, Joseph P. Bratt¹, George Cuan¹, Stephen L. Dodgen¹, Emerson S. Fang¹, Zhaoyu Gong¹, Thomas Y. Ho¹, Hengwei Hsu¹, Sidong Li¹, Sam Ng¹, Matthew N. Papakipos¹, Jason R. Redgrave¹, Sushma S. Trivedi¹, Nathan D. Tuck¹, Shun Wai Go¹, Lindy Fung¹, Tuan D. Nguyen¹, Joseph P. Grass¹, Bo Hong¹, Abraham Mammen¹, Abbas Rashid¹, Albert Suan-Wei Tsay¹ - Show less +21 more•Institutions (1)

Apple Inc.¹

09 Jun 2003

TL;DR: In this article, a deferred shading graphics pipeline processor and method are provided encompassing numerous substructures, including one or more of deferred shading, a tiled frame buffer, and multiple?stage hidden surface removal processing.

...read moreread less

Abstract: A deferred shading graphics pipeline processor and method are provided encompassing numerous substructures. Embodiments of the processor and method may include one or more of deferred shading, a tiled frame buffer, and multiple?stage hidden surface removal processing. In the deferred shading graphics pipeline, hidden surface removal is completed before pixel coloring is done. The pipeline processor comprises a command fetch and decode unit, a geometry unit, a mode extraction unit, a sort unit, a setup unit, a cull unit, a mode injection unit, a fragment unit, a texture unit, a Phong lighting unit, a pixel unit, and a backend unit.

...read moreread less

468 citations

Book Chapter•DOI•

ML confidential: machine learning on encrypted data

[...]

Thore Graepel¹, Kristin E. Lauter¹, Michael Naehrig²•Institutions (2)

Microsoft¹, Eindhoven University of Technology²

28 Nov 2012

TL;DR: A new class of machine learning algorithms in which the algorithm's predictions can be expressed as polynomials of bounded degree, and confidential algorithms for binary classification based on polynomial approximations to least-squares solutions obtained by a small number of gradient descent steps are proposed.

...read moreread less

Abstract: We demonstrate that, by using a recently proposed leveled homomorphic encryption scheme, it is possible to delegate the execution of a machine learning algorithm to a computing service while retaining confidentiality of the training and test data. Since the computational complexity of the homomorphic encryption scheme depends primarily on the number of levels of multiplications to be carried out on the encrypted data, we define a new class of machine learning algorithms in which the algorithm's predictions, viewed as functions of the input data, can be expressed as polynomials of bounded degree. We propose confidential algorithms for binary classification based on polynomial approximations to least-squares solutions obtained by a small number of gradient descent steps. We present experimental validation of the confidential machine learning pipeline and discuss the trade-offs regarding computational complexity, prediction accuracy and cryptographic security.

...read moreread less

440 citations

Journal Article•DOI•

The incredible shrinking pipeline

[...]

Tracy Camp¹•Institutions (1)

University of Alabama¹

01 Oct 1997-Communications of The ACM

429 citations

Collapse

Network Information

Performance

Metrics

26,760

Papers

229,716

Citations

No. of papers in the topic in previous years
Year	Papers
2022	18
2021	1,066
2020	1,556
2019	1,793
2018	1,754
2017	1,548

Pipeline (computing)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics