Non-parametric Approximate Dynamic Programming via the Kernel Method

Open AccessProceedings Article

Non-parametric Approximate Dynamic Programming via the Kernel Method

- Vol. 25, pp 386-394

TLDR

A novel non-parametric approximate dynamic programming (ADP) algorithm that enjoys graceful approximation and sample complexity guarantees and can serve as a viable alternative to state-of-the-art parametric ADP algorithms.

Abstract:

This paper presents a novel non-parametric approximate dynamic programming (ADP) algorithm that enjoys graceful approximation and sample complexity guarantees. In particular, we establish both theoretically and computationally that our proposal can serve as a viable alternative to state-of-the-art parametric ADP algorithms, freeing the designer from carefully specifying an approximation architecture. We accomplish this by developing a kernel-based mathematical program for ADP. Via a computational study on a controlled queueing network, we show that our procedure is competitive with parametric ADP approaches.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

An Approximate Quadratic Programming for Efficient Bellman Equation Solution

Jianmei Su, +4 more

- 03 Sep 2019 -

IEEE Access

TL;DR: Experimental results on two canonical reinforcement learning scenarios demonstrate that the proposed algorithm achieves similar or better performance than the state-of-the-art algorithms, while reduces the computation time significantly and improves the robustness of the algorithm against state uncertainty.

...read moreread less

Risk-Neutral and Risk-Averse Approximate Dynamic Programming Methods

Daniel R. Jiang

TL;DR: This thesis presents a provably convergent algorithm that exploits the monotone structure of the problem in order to obtain near– optimal policies using a relatively small amount of computation (when compared to exact techniques).

...read moreread less

Posted Content

Corporative Stochastic Approximation with Random Constraint Sampling for Semi-Infinite Programming

Bo Wei, +2 more

- 21 Dec 2018 -

arXiv: Optimization and Control

TL;DR: This work developed a corporative stochastic approximation (CSA) type algorithm for semi-infinite programming (SIP), where the cut generation problem is solved inexactly, and proposes two specific random constraint sampling schemes to approximately solve the cutgeneration problem.

...read moreread less

Posted Content

Randomized Primal-Dual Algorithms for Semi-Infinite Programming

Bo Wei, +1 more

TL;DR: A novel algorithm for semi-infinite programming which combines random constraint sampling with the classical primal-dual method is presented, adapted to solve convex optimization problems with a finite (but possibly very large) number constraints and shows that it has the same convergence rates in this case.

...read moreread less

Posted Content

Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation.

Martin Gottwald, +3 more

- 16 Jun 2021 -

arXiv: Learning

TL;DR: In this paper, the authors proposed an approximate Newton's algorithm to minimize the Mean Squared Bellman Error (MSE) function with a residual gradient formulation, which is shown to be locally quadratically convergent to a global minimum numerically.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Dynamic Programming and Optimal Control

Dimitri P. Bertsekas

TL;DR: The leading and most up-to-date textbook on the far-ranging algorithmic methododogy of Dynamic Programming, which can be used for optimal control, Markovian decision problems, planning and sequential decision making under uncertainty, and discrete/combinatorial optimization.

...read moreread less

BookDOI

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

Bernhard Schölkopf, +1 more

TL;DR: Learning with Kernels provides an introduction to SVMs and related kernel methods that provide all of the concepts necessary to enable a reader equipped with some basic mathematical knowledge to enter the world of machine learning using theoretically well-founded yet easy-to-use kernel algorithms.

...read moreread less

Book

Optimization by Vector Space Methods

David G. Luenberger

TL;DR: This book shows engineers how to use optimization theory to solve complex problems with a minimum of mathematics and unifies the large field of optimization with a few geometric principles.

...read moreread less

Journal ArticleDOI

Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks

Leandros Tassiulas, +1 more

- 01 Dec 1992 -

IEEE Transactions on Automatic Control

TL;DR: The stability of a queueing network with interdependent servers is considered and a policy is obtained which is optimal in the sense that its Stability Region is a superset of the stability region of every other scheduling policy, and this stability region is characterized.

...read moreread less

Book ChapterDOI

Rademacher and gaussian complexities: risk bounds and structural results

Peter L. Bartlett, +1 more

TL;DR: In this paper, the authors investigate the use of data-dependent estimates of the complexity of a function class, called Rademacher and Gaussian complexities, in a decision theoretic setting and prove general risk bounds in terms of these complexities.

...read moreread less

Collapse

Related Papers (5)

Approximate dynamic programming : solving the curses of dimensionality

Warren B. Powell

The Linear Programming Approach to Approximate Dynamic Programming

Daniela Pucci de Farias, +1 more

- 01 Nov 2003 -

Operations Research

Non-parametric Approximate Dynamic Programming via the Kernel Method

Citations

An Approximate Quadratic Programming for Efficient Bellman Equation Solution

Risk-Neutral and Risk-Averse Approximate Dynamic Programming Methods

Corporative Stochastic Approximation with Random Constraint Sampling for Semi-Infinite Programming

Randomized Primal-Dual Algorithms for Semi-Infinite Programming

Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation.

References

Dynamic Programming and Optimal Control

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

Optimization by Vector Space Methods

Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks

Rademacher and gaussian complexities: risk bounds and structural results

Related Papers (5)

Approximate dynamic programming : solving the curses of dimensionality

The Linear Programming Approach to Approximate Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Dynamic Programming and Optimal Control

Reinforcement Learning: An Introduction