PyTorch: An Imperative Style, High-Performance Deep Learning Library

Open AccessProceedings Article

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, +20 more

- Vol. 32, pp 8026-8037

Chats0

TLDR

This paper details the principles that drove the implementation of PyTorch and how they are reflected in its architecture, and explains how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance.

Abstract:

Deep learning frameworks have often focused on either usability or speed, but not both. PyTorch is a machine learning library that shows that these two goals are in fact compatible: it was designed from first principles to support an imperative and Pythonic programming style that supports code as a model, makes debugging easy and is consistent with other popular scientific computing libraries, while remaining efficient and supporting hardware accelerators such as GPUs. In this paper, we detail the principles that drove the implementation of PyTorch and how they are reflected in its architecture. We emphasize that every aspect of PyTorch is a regular Python program under the full control of its user. We also explain how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance. We demonstrate the efficiency of individual subsystems, as well as the overall speed of PyTorch on several commonly used benchmarks.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Array programming with NumPy

Charles R. Harris, +28 more

- 16 Sep 2020 -

Nature

TL;DR: In this paper, the authors review how a few fundamental array concepts lead to a simple and powerful programming paradigm for organizing, exploring and analysing scientific data, and their evolution into a flexible interoperability layer between increasingly specialized computational libraries is discussed.

...read moreread less

Journal ArticleDOI

Array Programming with NumPy

Charles R. Harris, +28 more

- 18 Jun 2020 -

arXiv: Mathematical Software

TL;DR: How a few fundamental array concepts lead to a simple and powerful programming paradigm for organizing, exploring and analysing scientific data is reviewed.

...read moreread less

Posted Content

End-to-End Object Detection with Transformers

Nicolas Carion, +5 more

- 26 May 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a new method that views object detection as a direct set prediction problem, and demonstrates accuracy and run-time performance on par with the well-established and highly-optimized Faster RCNN baseline on the challenging COCO object detection dataset.

...read moreread less

Journal ArticleDOI

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Fabian Isensee, +7 more

- 01 Feb 2021 -

Nature Methods

TL;DR: nnU-Net as mentioned in this paper is a deep learning-based segmentation method that automatically configures itself, including preprocessing, network architecture, training and post-processing for any new task.

...read moreread less

Book ChapterDOI

End-to-End Object Detection with Transformers

Nicolas Carion, +5 more

TL;DR: DetR as mentioned in this paper proposes a set-based global loss that forces unique predictions via bipartite matching, and a transformer encoder-decoder architecture to directly output the final set of predictions in parallel.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Automatic differentiation in PyTorch

Adam Paszke, +9 more

TL;DR: An automatic differentiation module of PyTorch is described — a library designed to enable rapid research on machine learning models that focuses on differentiation of purely imperative programs, with a focus on extensibility and low overhead.

...read moreread less

Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

SciPy: Open Source Scientific Tools for Python

Eric Jones, +2 more

Proceedings ArticleDOI

Data Structures for Statistical Computing in Python

Wes McKinney

TL;DR: P pandas is a new library which aims to facilitate working with data sets common to finance, statistics, and other related fields and to provide a set of fundamental building blocks for implementing statistical models.

...read moreread less

Posted Content

Theano: A Python framework for fast computation of mathematical expressions

Rami Al-Rfou, +111 more

- 09 May 2016 -

arXiv: Symbolic Computation

TL;DR: The performance of Theano is compared against Torch7 and TensorFlow on several machine learning models and recently-introduced functionalities and improvements are discussed.

...read moreread less

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Citations

Array programming with NumPy

Array Programming with NumPy

End-to-End Object Detection with Transformers

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

End-to-End Object Detection with Transformers

References

Automatic differentiation in PyTorch

Caffe: Convolutional Architecture for Fast Feature Embedding

SciPy: Open Source Scientific Tools for Python

Data Structures for Statistical Computing in Python

Theano: A Python framework for fast computation of mathematical expressions

Related Papers (5)

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

ImageNet: A large-scale hierarchical image database

Attention is All you Need

ImageNet Classification with Deep Convolutional Neural Networks

Trending Questions (1)