Percentile Optimization for Markov Decision Processes with Parameter Uncertainty

doi:10.1287/OPRE.1080.0685

Journal ArticleDOI

Percentile Optimization for Markov Decision Processes with Parameter Uncertainty

Erick Delage, +1 more

- 01 Jan 2010 -

Operations Research

- Vol. 58, Iss: 1, pp 203-213

TLDR

A set of percentile criteria that are conceptually natural and representative of the trade-off between optimistic and pessimistic views of the question are presented and the use of these criteria under different forms of uncertainty for both the rewards and the transitions is studied.

Abstract:

Markov decision processes are an effective tool in modeling decision making in uncertain dynamic environments. Because the parameters of these models typically are estimated from data or learned from experience, it is not surprising that the actual performance of a chosen strategy often differs significantly from the designer's initial expectations due to unavoidable modeling ambiguity. In this paper, we present a set of percentile criteria that are conceptually natural and representative of the trade-off between optimistic and pessimistic views of the question. We study the use of these criteria under different forms of uncertainty for both the rewards and the transitions. Some forms are shown to be efficiently solvable and others highly intractable. In each case, we outline solution concepts that take parametric uncertainty into account in the process of decision making.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Theory and Applications of Robust Optimization

Dimitris Bertsimas, +2 more

- 01 Aug 2011 -

Siam Review

TL;DR: This paper surveys the primary research, both theoretical and applied, in the area of robust optimization (RO), focusing on the computational attractiveness of RO approaches, as well as the modeling power and broad applicability of the methodology.

...read moreread less

Journal Article

Theory and Applications of Robust Optimization

Dimitris Bertsimas, +2 more

- 01 Aug 2011 -

Siam Journal on Control and Optimization

TL;DR: In this article, the authors survey the primary research, both theoretical and applied, in the area of robust optimization and highlight applications of RO across a wide spectrum of domains, including finance, statistics, learning, and various areas of engineering.

...read moreread less

Journal Article

A comprehensive survey on safe reinforcement learning

Javier García, +1 more

- 01 Jan 2015 -

Journal of Machine Learning Research

TL;DR: This work categorize and analyze two approaches of Safe Reinforcement Learning, based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor and the incorporation of external knowledge or the guidance of a risk metric.

...read moreread less

Posted Content

Robust Adversarial Reinforcement Learning

Lerrel Pinto, +3 more

- 08 Mar 2017 -

arXiv: Learning

TL;DR: RARL is proposed, where an agent is trained to operate in the presence of a destabilizing adversary that applies disturbance forces to the system and the jointly trained adversary is reinforced - that is, it learns an optimal destabilization policy.

...read moreread less

Posted Content

Theory and Applications of Robust Optimization

Dimitris Bertsimas, +2 more

- 26 Oct 2010 -

arXiv: Optimization and Control

TL;DR: In this paper, the authors survey the primary research, both theoretical and applied, in the area of robust optimization and highlight applications of RO across a wide spectrum of domains, including finance, statistics, learning, and various areas of engineering.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Bayesian Data Analysis

Andrew Gelman, +5 more

TL;DR: Detailed notes on Bayesian Computation Basics of Markov Chain Simulation, Regression Models, and Asymptotic Theorems are provided.

...read moreread less

Book

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Martin L. Puterman

TL;DR: Puterman as discussed by the authors provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models, focusing primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous time discrete state models.

...read moreread less

Journal ArticleDOI

Bayesian data analysis.

John K. Kruschke

- 01 Sep 2010 -

Wiley Interdisciplinary Reviews: Cogniti...

TL;DR: A fatal flaw of NHST is reviewed and some benefits of Bayesian data analysis are introduced and illustrative examples of multiple comparisons in Bayesian analysis of variance and Bayesian approaches to statistical power are presented.

...read moreread less

MonographDOI

Markov Decision Processes

P. Whittle, +1 more

- 15 Apr 1994 -

Journal of The Royal Statistical Society...

TL;DR: Markov Decision Processes covers recent research advances in such areas as countable state space models with average reward criterion, constrained models, and models with risk sensitive optimality criteria, and explores several topics that have received little or no attention in other books.

...read moreread less

Neuro-Dynamic Programming.

Dimitri P. Bertsekas

TL;DR: In this article, the authors present the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

...read moreread less

Collapse

Related Papers (5)

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

Arnab Nilim, +1 more

- 01 Sep 2005 -

Operations Research

Percentile Optimization for Markov Decision Processes with Parameter Uncertainty

Citations

Theory and Applications of Robust Optimization

Theory and Applications of Robust Optimization

A comprehensive survey on safe reinforcement learning

Robust Adversarial Reinforcement Learning

Theory and Applications of Robust Optimization

References

Bayesian Data Analysis

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Bayesian data analysis.

Markov Decision Processes

Neuro-Dynamic Programming.

Related Papers (5)

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Dynamic Programming and Optimal Control

Reinforcement Learning: An Introduction

Neuro-dynamic programming