Threshold constraints with guarantees for parity objectives in markov decision processes

doi:10.4230/LIPICS.ICALP.2017.121

Open AccessDOI

Threshold constraints with guarantees for parity objectives in markov decision processes

- Vol. 80

TLDR

Mayr et al. as mentioned in this paper extend the framework of [BFRR14] and follow-up papers, which focused on quantitative objectives, by addressing the case of parity objectives, a natural way to represent functional requirements of systems.

Abstract:

The beyond worst-case synthesis problem was introduced recently by Bruyere et al. [BFRR14]: it aims at building system controllers that provide strict worst-case performance guarantees against an antagonistic environment while ensuring higher expected performance against a stochastic model of the environment. Our work extends the framework of [BFRR14] and follow-up papers, which focused on quantitative objectives, by addressing the case of $\omega$-regular conditions encoded as parity objectives, a natural way to represent functional requirements of systems. We build strategies that satisfy a main parity objective on all plays, while ensuring a secondary one with sufficient probability. This setting raises new challenges in comparison to quantitative objectives, as one cannot easily mix different strategies without endangering the functional properties of the system. We establish that, for all variants of this problem, deciding the existence of a strategy lies in ${\sf NP} \cap {\sf coNP}$, the same complexity class as classical parity games. Hence, our framework provides additional modeling power while staying in the same complexity class. [BFRR14] Veronique Bruyere, Emmanuel Filiot, Mickael Randour, and Jean-Francois Raskin. Meet your expectations with guarantees: Beyond worst-case synthesis in quantitative games. In Ernst W. Mayr and Natacha Portier, editors, 31st International Symposium on Theoretical Aspects of Computer Science, STACS 2014, March 5-8, 2014, Lyon, France, volume 25 of LIPIcs, pages 199-213. Schloss Dagstuhl - Leibniz - Zentrum fuer Informatik, 2014.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

UAV Autonomous Target Search Based on Deep Reinforcement Learning in Complex Disaster Scene

Chunxue Wu, +7 more

- 05 Aug 2019 -

IEEE Access

TL;DR: This paper proves that deep reinforcement learning can be successfully applied to an ancient puzzle game Nokia Snake after further processing, and learns to find the target path autonomously, and the average score on the Snake Game exceeds theaverage score on human level.

...read moreread less

Book ChapterDOI

Simple Strategies in Multi-Objective MDPs

Florent Delgrange, +3 more

TL;DR: It is shown that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and the author provides an MILP encoding to solve the corresponding problem.

...read moreread less

Book ChapterDOI

Multi-cost Bounded Reachability in MDP

Arnd Hartmanns, +3 more

TL;DR: The need for output beyond Pareto curves is discussed and the available information from the algorithm is exploited to support decision makers and show the algorithm’s scalability.

...read moreread less

Posted Content

Combinations of Qualitative Winning for Stochastic Parity Games

Krishnendu Chatterjee, +1 more

- 10 Apr 2018 -

arXiv: Logic in Computer Science

TL;DR: The problem of parity games with parity conditions was studied in this paper, where it was shown that the problem is coNP-complete for both finite-memory and infinite-memory strategies, both for MDPs and turn-based stochastic games.

...read moreread less

Proceedings ArticleDOI