Verification of general Markov decision processes by approximate similarity relations and policy refinement

doi:10.1137/16M1079397

Open AccessJournal ArticleDOI

Verification of general Markov decision processes by approximate similarity relations and policy refinement

Sofie Haesaert, +2 more

- 03 Aug 2017 -

Siam Journal on Control and Optimization

- Vol. 55, Iss: 4, pp 2333-2367

Chats0

TLDR

It is shown that the new probabilistic similarity relations, inspired by a notion of simulation developed for finite-state models, can be effectively employed over general Markov decision processes for verification purposes, and specifically for control refinement from abstract models.

Abstract:

In this work we introduce new approximate similarity relations that are shown to be key for policy (or control) synthesis over general Markov decision processes. The models of interest are discrete-time Markov decision processes, endowed with uncountably infinite state spaces and metric output (or observation) spaces. The new relations, underpinned by the use of metrics, allow, in particular, for a useful trade-off between deviations over probability distributions on states, and distances between model outputs. We show that the new probabilistic similarity relations, inspired by a notion of simulation developed for finite-state models, can be effectively employed over general Markov decision processes for verification purposes, and specifically for control refinement from abstract models.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Compositional Construction of Infinite Abstractions for Networks of Stochastic Control Systems

Abolfazl Lavaei, +3 more

- 01 Sep 2019 -

Automatica

TL;DR: In this article, a compositional approach for constructing infinite abstractions of interconnected discrete-time stochastic control systems is proposed, which is based on new notions of so-called storage functions.

...read moreread less

Journal ArticleDOI

Compositional (In)Finite Abstractions for Large-Scale Interconnected Stochastic Systems

Abolfazl Lavaei, +2 more

- 24 Feb 2020 -

IEEE Transactions on Automatic Control

TL;DR: An approach to construct finite MDPs as finite abstractions of concrete models or their reduced-order versions satisfying an incremental input-to-state stability property is proposed and it is shown that for the particular class of nonlinear stochastic control systems, the aforementioned property can be readily checked by matrix inequalities.

...read moreread less

Posted Content

Certified Reinforcement Learning with Logic Guidance.

Mohammadhosein Hasanbeig, +2 more

- 02 Feb 2019 -

arXiv: Learning

TL;DR: This paper proposes the first model-free Reinforcement Learning (RL) framework to synthesise policies for unknown, and continuous-state Markov Decision Processes (MDPs), such that a given linear temporal property is satisfied.

...read moreread less

Proceedings ArticleDOI

Formal Controller Synthesis for Continuous-Space MDPs via Model-Free Reinforcement Learning

Abolfazl Lavaei, +4 more

TL;DR: A key contribution of the paper is to leverage the classical convergence results for reinforcement learning on finite MDPs and provide control strategies maximizing the probability of satisfaction over unknown, continuous-space MDPS while providing probabilistic closeness guarantees.

...read moreread less

Proceedings ArticleDOI

From Dissipativity Theory to Compositional Construction of Finite Markov Decision Processes

Abolfazl Lavaei, +2 more

TL;DR: In this article, a compositional approach for constructing finite Markov decision processes of interconnected discrete-time stochastic control systems is proposed, which leverages the interconnection topology and a notion of so-called storage functions describing joint dissipativity type properties of subsystems and their abstractions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Stochastic optimal control : the discrete time case

Dimitri P. Bertsekas, +1 more

TL;DR: This research monograph is the authoritative and comprehensive treatment of the mathematical foundations of stochastic optimal control of discrete-time systems, including thetreatment of the intricate measure-theoretic issues.

...read moreread less

Book

Lectures on the Coupling Method

Torgny Lindvall

TL;DR: In this article, the authors propose Discrete Theory Continuous Theory, Inequalities Intensity-Governed Processes Diffusions Appendix Frequently Used Notation References Index, Section 5.

...read moreread less

Journal ArticleDOI

Bisimulation through probabilistic testing

Kim Guldstrand Larsen, +1 more

- 01 Sep 1991 -

Information & Computation

TL;DR: By using probabilistic transition systems as the underlying semantic model, it is shown how a testing algorithm can distinguish, with a probability arbitrarily close to one, between processes that are not bisimulation equivalent.

...read moreread less

Dissertation

Modeling and verification of randomized distributed real-time systems

Roberto Segala

TL;DR: Theoretical work builds a new mathematical model of randomized distributed computation and develops techniques to prove the correctness of some property by reducing the problem to the verification of properties of non-randomized systems.

...read moreread less

Journal ArticleDOI

Probabilistic reachability and safety for controlled discrete time stochastic hybrid systems

Alessandro Abate, +3 more

- 01 Nov 2008 -

Automatica

TL;DR: In this work, probabilistic reachability over a finite horizon is investigated for a class of discrete time stochastic hybrid systems with control inputs and it is revealed that it is amenable to two complementary interpretations, leading to dual algorithms for reachability computations.

...read moreread less