Deep Neural Network Approximation Theory

Open AccessPosted Content

Deep Neural Network Approximation Theory

Dennis Elbrächter, +3 more

- 08 Jan 2019 -

arXiv: Learning

Chats0

TLDR

Deep networks provide exponential approximation accuracy—i.e., the approximation error decays exponentially in the number of nonzero weights in the network— of the multiplication operation, polynomials, sinusoidal functions, and certain smooth functions.

Abstract:

This paper develops fundamental limits of deep neural network learning by characterizing what is possible if no constraints are imposed on the learning algorithm and on the amount of training data. Concretely, we consider Kolmogorov-optimal approximation through deep neural networks with the guiding theme being a relation between the complexity of the function (class) to be approximated and the complexity of the approximating network in terms of connectivity and memory requirements for storing the network topology and the associated quantized weights. The theory we develop establishes that deep networks are Kolmogorov-optimal approximants for markedly different function classes, such as unit balls in Besov spaces and modulation spaces. In addition, deep networks provide exponential approximation accuracy - i.e., the approximation error decays exponentially in the number of nonzero weights in the network - of the multiplication operation, polynomials, sinusoidal functions, and certain smooth functions. Moreover, this holds true even for one-dimensional oscillatory textures and the Weierstrass function - a fractal function, neither of which has previously known methods achieving exponential approximation accuracy. We also show that in the approximation of sufficiently smooth functions finite-width deep networks require strictly smaller connectivity than finite-depth wide networks.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

Salvatore Cuomo, +5 more

- 14 Jan 2022 -

Journal of Scientific Computing

TL;DR: A comprehensive review of the literature on physics-informed neural networks can be found in this article , where the primary goal of the study was to characterize these networks and their related advantages and disadvantages, as well as incorporate publications on a broader range of collocation-based physics informed neural networks.

...read moreread less

Journal ArticleDOI

A Theoretical Analysis of Deep Neural Networks and Parametric PDEs

Gitta Kutyniok, +4 more

- 02 Jun 2021 -

Constructive Approximation

TL;DR: The existence of a small reduced basis is used to construct neural networks that yield approximations of the parametric solution maps in such a way that the sizes of these networks essentially only depend on the size of the reduced basis.

...read moreread less

Journal ArticleDOI

Error bounds for approximations with deep ReLU neural networks in Ws,p norms

Ingo Gühring, +3 more

- 01 Sep 2020 -

Analysis and Applications

TL;DR: In this paper, the authors analyze to what extent deep ReLU neural networks can efficiently approximate Sobolev regular functions if the approximation error is measured with respect to weaker Sobol...

...read moreread less

Journal ArticleDOI

Deep ReLU networks and high-order finite element methods

Joost A.A. Opschoor, +2 more

- 21 Feb 2020 -

Analysis and Applications

TL;DR: Approximation rate bounds for emulations of real-valued functions on intervals by deep neural networks (DNNs) are established and results are given for DNNs based on ReLU activation activation.

...read moreread less

Posted Content

Nearly-tight VC-dimension and pseudodimension bounds for piecewise linear neural networks

Peter L. Bartlett, +3 more

- 08 Mar 2017 -

arXiv: Learning

TL;DR: The authors showed that the VC-dimension of deep neural networks with the ReLU activation function is O(W L \log(W)), where W L is the number of weights and L = number of layers.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Journal ArticleDOI

Learning representations by back-propagating errors

David E. Rumelhart, +2 more

- 01 Jan 1988 -

Nature

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.

...read moreread less

Journal ArticleDOI

Multilayer feedforward networks are universal approximators

Kurt Hornik, +2 more

- 01 Jul 1989 -

Neural Networks

TL;DR: It is rigorously established that standard multilayer feedforward networks with as few as one hidden layer using arbitrary squashing functions are capable of approximating any Borel measurable function from one finite dimensional space to another to any desired degree of accuracy, provided sufficiently many hidden units are available.

...read moreread less

Collapse

Deep Neural Network Approximation Theory

Citations

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

A Theoretical Analysis of Deep Neural Networks and Parametric PDEs

Error bounds for approximations with deep ReLU neural networks in Ws,p norms

Deep ReLU networks and high-order finite element methods

Nearly-tight VC-dimension and pseudodimension bounds for piecewise linear neural networks

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Deep learning

Learning representations by back-propagating errors

Multilayer feedforward networks are universal approximators

Related Papers (5)

Error bounds for approximations with deep ReLU networks.

Approximation by superpositions of a sigmoidal function

Universal approximation bounds for superpositions of a sigmoidal function

Multilayer feedforward networks are universal approximators

Approximation theory of the MLP model in neural networks