Lipschitz Continuity of Value Functions in Markovian Decision Processes

doi:10.1007/S00186-005-0438-1

Journal ArticleDOI

Lipschitz Continuity of Value Functions in Markovian Decision Processes

Karl Hinderer

- 01 Sep 2005 -

Mathematical Methods of Operations Resea...

- Vol. 62, Iss: 1, pp 3-22

Chats0

TLDR

Tools and guidelines for investigating Lipschitz continuity of the value functions in MDP’s, using the Hausdorff metric and the Kantorovich metric for measuring the influence of the constraint set and the transition law, respectively are presented.

Abstract:

We present tools and guidelines for investigating Lipschitz continuity of the value functions in MDP’s, using the Hausdorff metric and the Kantorovich metric for measuring the influence of the constraint set and the transition law, respectively. The methods are explained by examples. Additional topics include an application to the the discretization algorithm of Bertsekas (1975).

Citations

PDF

Open Access

More filters

Posted Content

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees

Yuping Luo, +5 more

- 10 Jul 2018 -

arXiv: Learning

TL;DR: A novel algorithmic framework for designing and analyzing model-based RL algorithms with theoretical guarantees is introduced and a meta-algorithm with a theoretical guarantee of monotone improvement to a local maximum of the expected reward is designed.

...read moreread less

Journal ArticleDOI

Policy gradient in Lipschitz Markov Decision Processes

Matteo Pirotta, +2 more

- 01 Sep 2015 -

Machine Learning

TL;DR: This paper shows that both the expected return of a policy and its gradient are Lipschitz continuous w.r.t. policy parameters and defines policy-parameter updates that guarantee a performance improvement at each iteration.

...read moreread less

Journal ArticleDOI

Approximation of Markov decision processes with general state space

François Dufour, +1 more

- 15 Apr 2012 -

Journal of Mathematical Analysis and App...

TL;DR: A state and action discretization procedure for approximating the optimal value function and an optimal policy of the original control model is proposed and explicit bounds on the approximation errors are provided.

...read moreread less

Posted Content

Lipschitz Continuity in Model-based Reinforcement Learning

Kavosh Asadi, +2 more

- 19 Apr 2018 -

arXiv: Learning

TL;DR: The authors examined the impact of learning Lipschitz continuous models in the context of model-based reinforcement learning and provided a bound on multi-step prediction error using the Wasserstein metric.

...read moreread less

Journal ArticleDOI

Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes

François Dufour, +1 more

- 28 Mar 2013 -

Siam Journal on Control and Optimization

TL;DR: This work proposes a finite state approximation of the linear programming formulation of the constrained MDP to a finite-dimensional static optimization problem that can be used to obtain explicit numerical approximations of the corresponding optimal constrained cost.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Real and abstract analysis

Edwin Hewitt, +1 more

TL;DR: The first € price and the £ and $ price are net prices, subject to local VAT, and the €(D) includes 7% for Germany, the€(A) includes 10% for Austria.

...read moreread less

Journal ArticleDOI

Real and Abstract Analysis. By E. Hewitt and K. Stromberg Pp. viii, 476. 1965. (Springer-Verlag.)

J. C. Burkill

- 01 Dec 1967 -

The Mathematical Gazette

Book

Real analysis and probability

Richard M. Dudley

TL;DR: This book discusses set theory, vector spaces, and Taylor's theorem with remainder, as well as general topology, measurement, and differentiation, and introduces probability theory.

...read moreread less

Journal ArticleDOI

Integral Probability Metrics and Their Generating Classes of Functions

Alfred Müller

- 01 Jun 1997 -

Advances in Applied Probability

TL;DR: A unified study of integral probability metrics of the following type are given and how some interesting properties of these probability metrics arise directly from conditions on the generating class of functions is shown.

...read moreread less

Book ChapterDOI

Controlled Markov Processes

Onésimo Hernández-Lerma

TL;DR: This chapter introduces the stochastic control processes, also known as Markov decision processes or Markov dynamic programs, and discusses (briefly) more general control systems, such as non-stationary CMP’s and semi-Markov control models.

...read moreread less

Lipschitz Continuity of Value Functions in Markovian Decision Processes

Citations

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees

Policy gradient in Lipschitz Markov Decision Processes

Approximation of Markov decision processes with general state space

Lipschitz Continuity in Model-based Reinforcement Learning

Finite Linear Programming Approximations of Constrained Discounted Markov Decision Processes

References

Real and abstract analysis

Real and Abstract Analysis. By E. Hewitt and K. Stromberg Pp. viii, 476. 1965. (Springer-Verlag.)

Real analysis and probability

Integral Probability Metrics and Their Generating Classes of Functions

Controlled Markov Processes

Related Papers (5)

Discrete-time Markov control processes

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Reinforcement Learning: An Introduction

Stochastic optimal control : the discrete time case

Dynamic Programming and Optimal Control