scispace - formally typeset
Open AccessDOI

Threshold constraints with guarantees for parity objectives in markov decision processes

TLDR
Mayr et al. as mentioned in this paper extend the framework of [BFRR14] and follow-up papers, which focused on quantitative objectives, by addressing the case of parity objectives, a natural way to represent functional requirements of systems.
Abstract
The beyond worst-case synthesis problem was introduced recently by Bruyere et al. [BFRR14]: it aims at building system controllers that provide strict worst-case performance guarantees against an antagonistic environment while ensuring higher expected performance against a stochastic model of the environment. Our work extends the framework of [BFRR14] and follow-up papers, which focused on quantitative objectives, by addressing the case of $\omega$-regular conditions encoded as parity objectives, a natural way to represent functional requirements of systems. We build strategies that satisfy a main parity objective on all plays, while ensuring a secondary one with sufficient probability. This setting raises new challenges in comparison to quantitative objectives, as one cannot easily mix different strategies without endangering the functional properties of the system. We establish that, for all variants of this problem, deciding the existence of a strategy lies in ${\sf NP} \cap {\sf coNP}$, the same complexity class as classical parity games. Hence, our framework provides additional modeling power while staying in the same complexity class. [BFRR14] Veronique Bruyere, Emmanuel Filiot, Mickael Randour, and Jean-Francois Raskin. Meet your expectations with guarantees: Beyond worst-case synthesis in quantitative games. In Ernst W. Mayr and Natacha Portier, editors, 31st International Symposium on Theoretical Aspects of Computer Science, STACS 2014, March 5-8, 2014, Lyon, France, volume 25 of LIPIcs, pages 199-213. Schloss Dagstuhl - Leibniz - Zentrum fuer Informatik, 2014.

read more

Citations
More filters
Journal ArticleDOI

UAV Autonomous Target Search Based on Deep Reinforcement Learning in Complex Disaster Scene

TL;DR: This paper proves that deep reinforcement learning can be successfully applied to an ancient puzzle game Nokia Snake after further processing, and learns to find the target path autonomously, and the average score on the Snake Game exceeds theaverage score on human level.
Book ChapterDOI

Simple Strategies in Multi-Objective MDPs

TL;DR: It is shown that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and the author provides an MILP encoding to solve the corresponding problem.
Book ChapterDOI

Multi-cost Bounded Reachability in MDP

TL;DR: The need for output beyond Pareto curves is discussed and the available information from the algorithm is exploited to support decision makers and show the algorithm’s scalability.
Posted Content

Combinations of Qualitative Winning for Stochastic Parity Games

TL;DR: The problem of parity games with parity conditions was studied in this paper, where it was shown that the problem is coNP-complete for both finite-memory and infinite-memory strategies, both for MDPs and turn-based stochastic games.
Proceedings ArticleDOI

MSO+∇ is undecidable

TL;DR: It is shown that it is undecidable to check if a given sentence of $\mathrm{MSO+ abla$ is true in the full binary tree.
Related Papers (5)