Constrained Markov Decision Processes

Open AccessBook

Constrained Markov Decision Processes

TLDR

In this paper, a unified approach for the study of constrained Markov decision processes with a countable state space and unbounded costs is presented, where a single controller has several objectives; it is desirable to design a controller that minimize one of cost objectives, subject to inequality constraints on other cost objectives.

Abstract:

This report presents a unified approach for the study of constrained Markov decision processes with a countable state space and unbounded costs. We consider a single controller having several objectives; it is desirable to design a controller that minimize one of cost objective, subject to inequality constraints on other cost objectives. The objectives that we study are both the expected average cost, as well as the expected total cost (of which the discounted cost is a special case). We provide two frameworks: the case were costs are bounded below, as well as the contracting framework. We characterize the set of achievable expected occupation measures as well as performance vectors. This allows us to reduce the original control dynamic problem into an infinite Linear Programming. We present a Lagrangian approach that enables us to obtain sensitivity analysis. In particular, we obtain asymptotical results for the constrained control problem: convergence of both the value and the policies in the time horizon and in the discount factor. Finally, we present and several state truncation algorithms that enable to approximate the solution of the original control problem via finite linear programs.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Markov Decision Processes

Nicole Bäuerle, +1 more

- 08 Sep 2010 -

Jahresbericht Der Deutschen Mathematiker...

TL;DR: The theory of Markov Decision Processes is the theory of controlled Markov chains as mentioned in this paper, which has found applications in various areas like e.g. computer science, engineering, operations research, biology and economics.

...read moreread less

Journal ArticleDOI

Update or Wait: How to Keep Your Data Fresh

Yin Sun, +4 more

- 03 Aug 2017 -

IEEE Transactions on Information Theory

TL;DR: In this paper, the authors study how to optimally manage the freshness of information updates sent from a source node to a destination via a channel and develop efficient algorithms to find the optimal update policy among all causal policies and establish sufficient and necessary conditions for the optimality of the zero-wait policy.

...read moreread less

Book

A First Course in Stochastic Models

Henk Tijms

TL;DR: In this article, the authors present an analysis of queuing models useful tools in applied probability useful probability distributions generating functions the discrete fast Fourier transform Laplace transformtheory numerical Laplace inversion the root-finding problem.

...read moreread less

Proceedings Article

Constrained policy optimization

Joshua Achiam, +3 more

TL;DR: Constrained Policy Optimization (CPO) as discussed by the authors is the first general-purpose policy search algorithm for constrained reinforcement learning with guarantees for near-constraint satisfaction at each iteration.

...read moreread less

Proceedings ArticleDOI

Delay-optimal computation task scheduling for mobile-edge computing systems

Juan Liu, +3 more

TL;DR: By analyzing the average delay of each task and the average power consumption at the mobile device, a power-constrained delay minimization problem is formulated, and an efficient one-dimensional search algorithm is proposed to find the optimal task scheduling policy.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Matrix Analysis

Roger A. Horn, +1 more

TL;DR: In this article, the authors present results of both classic and recent matrix analyses using canonical forms as a unifying theme, and demonstrate their importance in a variety of applications, such as linear algebra and matrix theory.

...read moreread less

Book

Convergence of Probability Measures

Patrick Billingsley

TL;DR: Weak Convergence in Metric Spaces as discussed by the authors is one of the most common modes of convergence in metric spaces, and it can be seen as a form of weak convergence in metric space.

...read moreread less

Book

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Martin L. Puterman

TL;DR: Puterman as discussed by the authors provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models, focusing primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous time discrete state models.

...read moreread less

Book

Markov Chains and Stochastic Stability

Sean P. Meyn, +1 more

TL;DR: This second edition reflects the same discipline and style that marked out the original and helped it to become a classic: proofs are rigorous and concise, the range of applications is broad and knowledgeable, and key ideas are accessible to practitioners with limited mathematical background.

...read moreread less

MonographDOI

Markov Decision Processes

P. Whittle, +1 more

- 15 Apr 1994 -

Journal of The Royal Statistical Society...

TL;DR: Markov Decision Processes covers recent research advances in such areas as countable state space models with average reward criterion, constrained models, and models with risk sensitive optimality criteria, and explores several topics that have received little or no attention in other books.

...read moreread less

Collapse

Journal of The Royal Statistical Society...

Constrained Markov Decision Processes

Citations

Markov Decision Processes

Update or Wait: How to Keep Your Data Fresh

A First Course in Stochastic Models

Constrained policy optimization

Delay-optimal computation task scheduling for mobile-edge computing systems

References

Matrix Analysis

Convergence of Probability Measures

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Chains and Stochastic Stability

Markov Decision Processes

Related Papers (5)

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Dynamic Programming and Optimal Control

Reinforcement Learning: An Introduction

Markov Decision Processes

Human-level control through deep reinforcement learning