scispace - formally typeset
Journal ArticleDOI

Algorithm 799: revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation

Reads0
Chats0
TLDR
This article presents the function revolve, which generates checkpointing schedules that are provably optimal with regard to a primary and a secondary criterion and is intended to be used as an explicit “controller” for running a time-dependent applications program.
Abstract
In its basic form, the reverse mode of computational differentiation yields the gradient of a scalar-valued function at a cost that is a small multiple of the computational work needed to evaluate the function itself. However, the corresponding memory requirement is proportional to the run-time of the evaluation program. Therefore, the practical applicability of the reverse mode in its original formulation is limited despite the availability of ever larger memory systems. This observation leads to the development of checkpointing schedules to reduce the storage requirements. This article presents the function revolve, which generates checkpointing schedules that are provably optimal with regard to a primary and a secondary criterion. This routine is intended to be used as an explicit “controller” for running a time-dependent applications program.

read more

Citations
More filters
Book ChapterDOI

Semantic Segmentation Using Neural Ordinary Differential Equations

TL;DR: In this paper , the authors proposed a neural ODE network for semantic segmentation, which is able to achieve state-of-the-art results using 57% less memory for training, 42% fewer memory for testing, and 68% less number of parameters.
Posted Content

Efficient Differentiable Simulation of Articulated Bodies

TL;DR: In this article, the authors present a method for efficient differentiable simulation of articulated body dynamics using spatial algebra and the adjoint method, which is an order of magnitude faster than autodiff tools.

Fine-Grained Code-Comment Semantic Interaction Analysis gengmingyang13@nudt.edu.cnCollegeofComputerScience,NationalUniversityofDefenseTechnologyChangsha,China wangshangwen13@nudt.edu.cnCollegeofComputerScience,NationalUniversityofDefenseTechnologyChangsha,China

TL;DR: Fosterer is proposed, which can build fine-grained semantic interactions between code statements and comment tokens, and mimics the comprehension practice of developers, treating code statements as image patches and comments as texts, and uses contrastive learning to match the semantically-related part between the visual and textual information.
Journal ArticleDOI

A Survey on Auto-Parallelism of Large-Scale Deep Learning Training

TL;DR: In this article , the authors perform a broad and thorough investigation on challenges, basis, and strategy searching methods of auto-parallelism in DL training and discuss several trends in further research.
References
More filters
Book

Numerical methods for conservation laws

TL;DR: In this paper, the authors describe the derivation of conservation laws and apply them to linear systems, including the linear advection equation, the Euler equation, and the Riemann problem.
Book

Optimal Control of Systems Governed by Partial Differential Equations

TL;DR: In this paper, the authors consider the problem of minimizing the sum of a differentiable and non-differentiable function in the context of a system governed by a Dirichlet problem.
Book

Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation

TL;DR: This second edition has been updated and expanded to cover recent developments in applications and theory, including an elegant NP completeness argument by Uwe Naumann and a brief introduction to scarcity, a generalization of sparsity.
Journal ArticleDOI

Upwind difference schemes for hyperbolic systems of conservation laws

TL;DR: In this article, a new upwind finite difference approximation to systems of nonlinear hyperbolic conservation laws has been derived. But the scheme has desirable properties for shock calculations, such as unique and sharp shocks.
Journal ArticleDOI

Achieving logarithmic growth of temporal and spatial complexity in reverse automatic differentiation

TL;DR: It is shown here that, by a recursive scheme related to the multilevel differentiation approach of Volin and Ostrovskii, the growth in both temporal and spatial complexity can be limited to a fixed multiple of log(T).
Related Papers (5)