What contributions have the authors mentioned in the paper "Stochastic real-time games with qualitative timed automata objectives" ?

The authors consider two-player stochastic games over real-time probabilistic processes where the winning objective is specified by a timed automaton. The authors prove that whenever player has a winning strategy, then she also has a strategy that can be specified by a timed automaton.

What is the function that adds t to all clocks of and to all?

The operator “+s t” adds t to all clocks stored in ξ and to all events scheduled in s, and (e ∪ X) := ~0 resets all clocks of X to zero and assigns zero delay to e.

What is a probability measure over a measurable space?

A probability measure over a measurable space (Ω,F ) is a function P : F → R≥0 such that, for each countable collection {Xi}i∈I of pairwise disjoint elements of F , P(⋃i∈I Xi) = ∑i∈I P(Xi), and moreover P(Ω) = 1.

How does the algorithm guarantee that any region that is reachable in one step is reachable?

Being away from the boundaryby a fixed δ then intuitively guarantees that any region that is reachable in one step is reachablewith a probability bounded from below.

What is the probability of delaying all events in E ′?

E the authors have that the conditional probability of delaying all events in E ′ for at least b + t under the condition thatall events in E ′ are delayed for at least b is equal to ∏ e∈E ′ ∫∞ t fe|b(x) dx.

What is the probability that e is assigned a delay at most?

The probability that e is assigned a delay at most 1− εin s1 is 1 − ε, and hence the constructed DFA accepts a play with probability 1 − ε.

How is the probability of an event happening in a region of this size bounded?

the possible waiting time that lead us to thatregion lies in an interval that has length at least δ, and the probability that an event happensduring an interval of this minimal size is bounded from below.

(Open Access) Stochastic real-time games with qualitative timed automata objectives (2010) | Tomáš Brázdil

Q: What is the common characteristic of all events that are delayed?

A commoncharacteristic of all events is that they are delayed (it takes some time before an initiated eventactually occurs) and concurrent (there can be several previously initiated events that are currently awaited).

Q: What is the definition of a property that can be encoded by a DTA?

A simple example of a property that can be en-coded by a DTA is “whenever a new request is generated, it is either serviced within the next10 time units, or the system eventually enters a safe state”.

}w !"#$%&'()+,-./012345<yA|

FI MU

Faculty of Informatics

Masaryk University Brno

Stochastic Real-Time Games with

Qualitative Timed Automata Objectives

Tomáš Brázdil

Jan Krˇcál

Jan Kˇretínský

Antonín Kuˇcera

Vojtˇech

Rehák

FI MU Report Series FIMU-RS-2010-05

Reproduction of all or part of this work

is permitted for educational or research use

on condition that this copyright notice is

included in any copy.

Publications in the FI MU Report Series are in general accessible

via WWW:

http://www.fi.muni.cz/reports/

Further information can be obtained by contacting:

Faculty of Informatics

Masaryk University

Botanická 68a

602 00 Brno

Czech Republic

Stochastic Real-Time Games with Qualitative

Timed Automata Objectives

∗

Tomáš Brázdil Jan Kr

cál Jan K

retínský

†

Antonín Ku

cera

Vojt

ech

Rehák

Faculty of Informatics, Masaryk University,

Botanická 68a, 60200 Brno,

Czech Republic

{brazdil, krcal, kucera, rehak}@fi.muni.cz

jan.kretinsky@in.tum.de

December 13, 2010

Abstract

We consider two-player stochastic games over real-time probabilistic processes where

the winning objective is speciﬁed by a timed automaton. The goal of player  is to play in

such a way that the play (a timed word) is accepted by the timed automaton with probability

one. Player ^ aims at the opposite. We prove that whenever player  has a winning strat-

egy, then she also has a strategy that can be speciﬁed by a timed automaton. The strategy

automaton reads the history of a play, and the decisions taken by the strategy depend only on

the region of the resulting conﬁguration. We also give an exponential-time algorithm which

computes a winning timed automaton strategy if it exists.

1 Introduction

In this paper, we study stochastic real-time games (SRTGs) which are obtained as a natural

game-theoretic extension of generalized semi-Markov processes (GSMP) [1, 2, 3] or real-time

∗

The authors are supported by the Alexander von Humboldt Foundation (T. Brázdil), the Institute for Theoret-

ical Computer Science, project No. 1M0545 (J. Kr

cál), Brno Municipality (J. K

retínský), and the Czech Science

Foundation, grants No. P202/10/1469 (A. Ku

cera), No. 201/08/P459 (V.

Rehák), and No. 102/09/H042 (J. Kr

cál).

†

On leave at TU München, Boltzmannstr. 3, Garching, Germany.

probabilistic processes (RTP) [4]. Intuitively, all of these formalisms model systems which re-

act to certain events, such as message receipts, subsystem failures, timeouts, etc. A common

characteristic of all events is that they are delayed (it takes some time before an initiated event

actually occurs) and concurrent (there can be several previously initiated events that are currently

awaited). For example, if two messages e and e

are sent, it takes some (random) time before

they arrive, and one can specify, or approximate, the densities f

, f

of their arrival times. When

e arrives (say, after 20 time units), the system reacts to this event by changing its state, and awaits

in a new state. The arrival time of e

in the new state is measured from zero again, and its

density f

|20

is obtained from f

by incorporating the condition that e

is delayed for at least 20

time units. That is, f

|20

(x) = f

(x + 20)/

∞

(y) dy. Note that if the delays of all events are

exponentially distributed, then f

= f

e|b

for every b ∈ R

≥0

, and thus we obtain continuous-time

Markov chains (see, e.g., [5]) and continuous-time stochastic games [6, 7] as restricted forms of

RTPs and SRTGs, respectively.

Intuitively, a SRTG is a ﬁnite graph (see Fig. 1) with three types of nodes—states (drawn as

large circles), controls, where each control can be either internal or adversarial (drawn as boxes

and diamonds, respectively), and actions (drawn as small ﬁlled circles). In each state s, there

is a ﬁnite subset E(s) of events scheduled in s (the events scheduled in s are those which are

“awaited” in a given state; the other events are disabled. Each state s can react to every event of

E(s) by entering a designated control c, where player  or player ^ chooses some of the available

actions. Each action is associated with a ﬁxed probability distribution over states. In general,

both players can use randomized strategies, which means that they do not necessarily select just

a single action but a probability distribution over the available actions, which is multiplied with

the distributions associated to actions. Then, the next state is chosen randomly according to the

constructed probability distribution, and the play goes on. Whenever a new state s

is entered

from a previous state s along a play, each event scheduled in s

is assigned a new delay which is

chosen randomly according to the corresponding (conditional) density. The state s

then “reacts”

to the event with the least delay (under the assumptions adopted in this paper, the probability of

assigning the same delay to diﬀerent events is zero).

Our contribution. In this work we consider SRTGs with deterministic timed automata

(DTA) objectives. Intuitively, a timed automaton “observes” a play of a given SRTG and checks

that certain timing constraints are satisﬁed. A simple example of a property that can be en-

coded by a DTA is “whenever a new request is generated, it is either serviced within the next

10 time units, or the system eventually enters a safe state”. In this case, we want to setup the

internal controls so that the above property holds for almost all plays, no matter what deci-

0.3

0.7

0.6

0.4

0.5

Figure 1: An example of a stochastic real-time game

sions are taken in adversarial controls. Hence, the aim of player  is to maximize the proba-

bility that a play is accepted by a given timed automaton, while player ^ aims at the opposite.

By applying the result of [8], we obtain that SRTGs with DTA objectives have a value, i.e.,

sup

inf

σ,π

= inf

sup

σ,π

, where σ and π range over all strategies of player  and player ^,

and P

σ,π

is the probability of all plays satisfying a given DTA objective. This immediately raises

the question whether the players have optimal strategies which guarantee the equilibrium value

against every strategy of the opponent. We show that the answer is negative. Then, we con-

centrate on the qualitative variant of the problem, which is perhaps most interesting from the

practical point of view. An almost-sure winning strategy for player  is a strategy such that

for every strategy of player ^, the probability of all plays satisfying a given DTA objective is

equal to one. The main result of this paper is the following: We show that if player  has some

almost-sure winning strategy, then she also has a DTA almost-sure winning strategy, which can

be encoded by a deterministic timed automaton A constructable in exponential time. The au-

tomaton A reads the history of a play, and the decision taken by the corresponding DTA strategy

depends only on the region of the resulting conﬁguration entered by A.

Our constructions and proofs are combinations of standard techniques (used for timed au-

tomata and ﬁnite-state games) and some new non-trivial observations that are speciﬁc for the

considered model of SRTGs. We also adapt some ideas presented in [4] (in particular, we use the

concept of δ-separation).

Related work. Continuous-time (semi)Markov chains are a classical and deeply studied

model with a mature mathematical theory (see, e.g., [5, 9]). Continuous-time Markov decision

processes (CTMDPs) [10, 11, 12] combine probabilistic and non-deterministic choice, but all

events are required to be exponentially distributed. Two player games over continuous-time

Markov chains were considered only recently [6, 7]. Timed automata [13] were originally in-

troduced as a non-stochastic model with time. Probabilistic semantics of timed automata was

Stochastic real-time games with qualitative timed automata objectives

Figures

Citations

Probability and Measure

Efficient CTMC model checking of linear real-time objectives

The Value of Attack-Defence Diagrams

Verification of Open Interactive Markov Chains

Optimizing Performance of Continuous-Time Stochastic Systems Using Timeout Synthesis

References

Dynamic Programming and Optimal Control

A theory of timed automata

Probability and Measure

Stochastic Processes

On Observing Nondeterminism and Concurrency

Related Papers (5)

A theory of timed automata

Reachability in Stochastic Timed Games

Robust Timed Automata

Weighted Timed Automata: Model-Checking and Games

Stochastic timed automata

Frequently Asked Questions (9)

Q1. What contributions have the authors mentioned in the paper "Stochastic real-time games with qualitative timed automata objectives" ?

Q2. What is the function that adds t to all clocks of and to all?

Q3. What is a probability measure over a measurable space?

Q4. How does the algorithm guarantee that any region that is reachable in one step is reachable?

Q5. What is the common characteristic of all events that are delayed?

Q6. What is the probability of delaying all events in E ′?

Q7. What is the probability that e is assigned a delay at most?

Q8. How is the probability of an event happening in a region of this size bounded?

Q9. What is the definition of a property that can be encoded by a DTA?