Quantitative Weak Convergence for Discrete Stochastic Processes

Open AccessPosted Content

Quantitative Weak Convergence for Discrete Stochastic Processes

- 03 Feb 2019 -

TLDR

This work shows that the iterates of these stochastic processes converge to an invariant distribution at a rate of $\tilde{O}\lrp{1/\sqrt{k}}$ where $k$ is the number of steps; this rate is provably tight up to log factors.

Abstract:

In this paper, we quantitative convergence in $W_2$ for a family of Langevin-like stochastic processes that includes stochastic gradient descent and related gradient-based algorithms. Under certain regularity assumptions, we show that the iterates of these stochastic processes converge to an invariant distribution at a rate of $\tilde{O}\lrp{1/\sqrt{k}}$ where $k$ is the number of steps; this rate is provably tight up to log factors. Our result reduces to a quantitative form of the classical Central Limit Theorem in the special case when the potential is quadratic.

Citations

PDF

Open Access

More filters

Posted Content

Where is the Information in a Deep Neural Network

Alessandro Achille, +1 more

- 29 May 2019 -

arXiv: Learning

TL;DR: A novel notion of effective information in the activations of a deep network is established, which is used to show that models with low (information) complexity not only generalize better, but are bound to learn invariant representations of future inputs.

...read moreread less

Posted Content

Quantitative W 1 Convergence of Langevin-Like Stochastic Processes with Non-Convex Potential State-Dependent Noise.

Xiang Cheng, +3 more

- 07 Jul 2019 -

arXiv: Learning

TL;DR: In this article, the authors prove quantitative convergence rates at which discrete Langevin-like processes converge to the invariant distribution of a related stochastic differential equation and apply their theoretical findings to studying the convergence of Stochastic Gradient Descent (SGD) for non-convex problems.

...read moreread less

Posted Content

Analytic expressions for the output evolution of a deep neural network

Anastasia Borovykh

- 18 Dec 2019 -

arXiv: Machine Learning

TL;DR: A novel methodology based on a Taylor expansion of the network output for obtaining analytical expressions for the expected value of thenetwork weights and output under stochastic training is presented.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Acceleration of stochastic approximation by averaging

Boris T. Polyak, +1 more

- 01 Jul 1992 -

Siam Journal on Control and Optimization

TL;DR: Convergence with probability one is proved for a variety of classical optimization and identification problems and it is demonstrated for these problems that the proposed algorithm achieves the highest possible rate of convergence.

...read moreread less

Journal ArticleDOI

Generalization of an Inequality by Talagrand and Links with the Logarithmic Sobolev Inequality

Felix Otto, +1 more

- 01 Jun 2000 -

Journal of Functional Analysis

TL;DR: In this paper, it was shown that transport inequalities, similar to the one derived by M. Talagrand (1996, Geom. Funct. Anal. 6, 587-600) for the Gaussian measure, are implied by logarithmic Sobolev inequalities.

...read moreread less

Efficient Estimations from a Slowly Convergent Robbins-Monro Process

David Ruppert

Journal ArticleDOI

Stochastic Gradient Descent as Approximate Bayesian Inference

Stephan Mandt, +2 more

- 01 Jan 2017 -

Journal of Machine Learning Research

TL;DR: It is demonstrated that constant SGD gives rise to a new variational EM algorithm that optimizes hyperparameters in complex probabilistic models and a scalable approximate MCMC algorithm, the Averaged Stochastic Gradient Sampler is proposed.

...read moreread less

Posted Content

Theoretical guarantees for approximate sampling from smooth and log-concave densities

Arnak S. Dalalyan

- 23 Dec 2014 -

arXiv: Computation

TL;DR: This work establishes non‐asymptotic bounds for the error of approximating the target distribution by the distribution obtained by the Langevin Monte Carlo method and its variants and illustrates the effectiveness of the established guarantees.

...read moreread less