Agnostic Physics-Driven Deep Learning

doi:10.48550/arXiv.2205.15021

Journal ArticleDOI

Agnostic Physics-Driven Deep Learning

Benjamin Scellier, +3 more

- 30 May 2022 -

arXiv.org

- Vol. abs/2205.15021

Chats0

TLDR

This work establishes that a physical system can perform statistical learning without gradient computations, via an Agnostic Equilibrium Propagation procedure that combines energy minimization, homeostatic control, and nudging towards the correct response.

Abstract:

This work establishes that a physical system can perform statistical learning without gradient computations, via an Agnostic Equilibrium Propagation (Æqprop) procedure that combines energy minimization, homeostatic control, and nudging towards the correct response. In Æqprop, the speciﬁcs of the system do not have to be known: the procedure is based only on external manipulations, and produces a stochastic gradient descent without explicit gradient computations. Thanks to nudging, the system performs a true, order-one gradient step for each training sample, in contrast with order-zero methods like reinforcement or evolutionary strategies, which rely on trial and error. This procedure considerably widens the range of potential hardware for statistical learning to any system with enough con-trollable parameters, even if the details of the system are poorly known. Æqprop also establishes that in natural (bio)physical systems, genuine gradient-based statistical learning may result from generic, relatively simple mechanisms, without backpropagation and its requirement for analytic knowledge of partial derivatives.

Agnostic Physics-Driven Deep Learning

Citations

Beyond Backpropagation: Bilevel Optimization Through Implicit Differentiation and Equilibrium Propagation

Frequency propagation: Multi-mechanism learning in nonlinear physical networks

References

Automatic differentiation in PyTorch

Gradient-based learning applied to document recognition

Memristor-The missing circuit element

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

Large-Scale Machine Learning with Stochastic Gradient Descent