What are the contributions mentioned in the paper "The theory of stabilisation monoids and regular cost functions" ?

Q: What are the contributions mentioned in the paper "The theory of stabilisation monoids and regular cost functions" ?

The authors introduce the notion of regular cost functions: a quantitative extension to the standard theory of regular languages. The authors provide equivalent characterisations of this notion by means of automata ( extending the nested distance desert automata of Kirsten ), of history-deterministic automata ( history-determinism is a weakening of the standard notion of determinism, that replaces it in this context ), and a suitable notion of recognisability by stabilisation monoids. The authors also provide closure and decidability results.

(Open Access) The Theory of Stabilisation Monoids and Regular Cost Functions (2009) | Thomas Colcombet

The theory of stabilisation monoids

and regular cost functions

Thomas Colcombet

Liafa/Cnrs/Universit´e Paris 7, Denis Diderot, France

Abstract. We introduce the notion of regular cost functions: a quanti-

tative extension to the standard theory of regular languages.

We provide equivalent characterisations of this notion by means of au-

tomata (extending the nested distance desert automata of Kirsten), of

history-deterministic automata (history-determinism is a weakening of

the standard notion of determinism, that replaces it in this context),

and a suitable notion of recognisability by stabilisation monoids. We

also provide closure and decidability results.

1 Introduction

When considering standard regular languages (say on ﬁnite words), some re-

sults appear as cornerstones on which the whole theory is constructed. The ﬁrst

such kind of results are the equivalences between many diﬀerent formalisms: non-

deterministic automata, deterministic automata, recognisability by monoids, reg-

ular expressions, etc. The second one consists in the numerous closure properties

that regular languages enjoy: union, intersection, projection (mapping under a

length-preserving morphism), complementation, etc. From these facts one can

derive a third kind of results: the equivalence with logical formalisms such as

monadic (second-order) logic. Finally, all these properties do not come at an un-

aﬀordable price: emptiness is decidable, and hence the satisfaction of the logic

is also decidable.

In this paper, we present a quantitative extension to the standard notion of

regularity in which those cornerstone results still hold. We consider a quantitative

notion of regularity which allows to attach non-negative integer values to words,

such as the number of occurrences of a pattern, the length of segments, etc. One

also possess some freedom for combining those values, e.g., using minimum or

maximum. One can for instance describe the maximum number of occurrences of

letter a that are not separated by a letter b. Those integer values are considered

modulo an equivalence which preserves the existence of bounds, but does not

preserve exact values – as opposed to the usual way one considers quantitative

forms of automata. This is the price to pay for keeping all equivalences and

closure properties.

Originally, this work aimed at unifying and reinterpreting some recent results

from the literature. Let us review them.

Supported by the Anr project Jade: ‘Jeux et Automates, D´ecidabilit´e et Extensions’

First, in [9] Kirsten gives a new proof to the decidability of the (restricted)

star-height problem

. This problem is known to be decidable from Hashiguchi

[7], but with a very diﬃcult proof. The ﬁrst part in Kirsten’s proof consists in

reducing the star-height problem to a problem of limitedness: decide the exis-

tence of a bound for some function deﬁned by means of a nested distance desert

automata, a new form of automata introduced for this purpose. The second part

consists in proving the decidability of this limitedness problem. This is done by

turning this automata-related question into an algebraic one: the automaton is

translated into a monoid equipped with a stabilisation operator ]. The limit-

edness problem becomes easy to decide in this presentation. Kirsten’s paper is

itself the continuation of a long line of research concerning distance automata,

tropical semiring, desert automata, etc. [6, 8, 11–17].

Second, the paper [3] provides a study of an extension of the monadic second-

order logic over inﬁnite words with new ‘bound’ quantiﬁers such as: ‘there exists

a set of arbitrary large size satisfying some property’. The goal being diﬀerent,

the presentation is also signiﬁcantly diﬀerent, and getting results comparable to

the ones in the present paper requires a translation that we cannot detail here.

However, two new forms of automata are introduced in [3] as intermediate objects

in the proofs, namely B-automata and S-automata. The class of B-automata

corresponds essentially to the non-nested variant of the nested desert distance

automata, while the class of S-automata is a new dual variant. The decidability

of limitedness can be derived from this work but with a bad complexity (non-

elementary, as opposed to [9]). Independently, B-automata were also introduced

in [1] under the name of R-automata, and the decidability of the limitedness

problem established using another technique, yielding better complexity.

Other applications of the technique have also been described. Still in this

framework, the restricted star-height problem for trees has been shown decidable

[4], and the Mostowski hierarchy problem

has been reduced to the corresponding

limitedness problem over inﬁnite trees [5], which remains open. The existence

of a bound on the number of iterations necessary for reaching the ﬁxpoint of a

monadic second-order formula over words has been also shown decidable using

distance automata [2].

Contribution. Our contribution can be roughly described as 1) a uniﬁcation

of the ideas in [9] and [3], and 2) the development of a suitable mathematical

background and the establishment of new results in order to make this theory a

complete extension of the standard theory of regular languages. Let us be more

precise.

The ﬁrst contribution lies in the deﬁnition of a cost function: cost functions

are mappings from words (or from any set in general) to ω + 1 quotiented by a

suitable equivalence that preserves the notion of bound (≈ in the paper). In our

Problem: given a regular language L of words and an integer k, is it possible to

describe L with a regular expression using at most k nesting of Kleene stars?

The hierarchy induced by the number of priorities used by a non-deterministic parity

automaton running on inﬁnite trees.

framework, cost functions can be seen as a reﬁnement of the notion of language

(each language can be seen as a cost function, while the converse is not true).

We then introduce B- and S-automata, automata that accept cost functions

rather than languages. Those are slight extensions of the automata in [3]. We

establish the equivalence of the two forms of automata, via an elementary con-

struction, as well as the equivalence with their history-deterministic form. The

new notion of history-determinism is a weakening of the classical notion of de-

terminism (deterministic automata are strictly weaker in this framework). It is

needed for the further extension of the theory to trees. Quiet naturally, we call

regular the cost functions described by one of these formalisms.

The second aspect of the theory that we develop is the algebraic formalism.

We introduce the notion of stabilisation monoids: ﬁnite monoids equipped with a

stabilisation operator, inspired from [9]. We develop a mathematical framework

– new to the knowledge of the author – in order to deﬁne the semantics of

stabilisation monoids. The key result here is the existence of unique semantics

(that we call compatible mappings) for each stabilisation monoid

. Building on

these notions, we introduce the notion of recognisable cost functions. As we may

expect, these happen to be exactly the regular cost functions.

While describing the above objects, we prove the closure of regular cost func-

tions under operations which correspond to union, intersection, projection and

dual of projection in the world of languages. We also provide decision procedures

subsuming the limitedness results from [9].

Structure of the paper. We present in Section 2 cost functions and the au-

tomata part of the theory. We present in Section 3 the algebraic framework, and

the equivalent notion of recognisability.

Some notations

As usual, we denote by ω the set of non-negative integers and ω + 1 the set

ω ∪ {ω}. Those are ordered by 0 < 1 < · · · < ω. The identity mapping over ω is

id. Given a set E, E

is the set of sequences of ω-length of elements in E. Such

sequences will be denoted by bold letters (a, b,. . . ). We ﬁx a ﬁnite alphabet A

consisting of letters. The set of words over A is A

∗

. The empty word is ε. The

concatenation of a word u and word v is uv. The length of word u is |u|. The

number of occurrences of a letter a in u is |u|

2 Regular cost functions

We introduce in Section 2.1 the notion of cost function. We present B and S-

automata in Section 2.2, and their history-deterministic form in Section 2.3. The

key duality result is the subject of Section 2.4.

This result is reminiscent of the one for inﬁnite words stating that each ﬁnite Wilke

algebra can be uniquely extended into an ω-semigroup.

2.1 Cost functions

A correction function is a mapping from ω to ω. From now, the symbols α, α

, . . .

implicitly designate correction functions. Given x, y in ω + 1, x 4

y holds

if x ≤ α(y) in which α is the extension of α with α(ω) = ω. For every set E,

is extended to (ω + 1)

in a natural way by f 4

g if f (x) 4

g(x) for

all x ∈ E, or equivalently f ≤ α ◦ g. Intuitively, f is dominated by g after it has

been ‘stretched’ by α. One also writes f ≈

g if f 4

g and g 4

Some elementary properties of 4

are:

Fact 1 If α ≤ α

and f 4

g, then f 4

g. If f 4

g 4

h, then f 4

α◦α

Example 1. Over ω × ω, maximum and sum are equivalent for the doubling cor-

rection function (for short, (max) ≈

×2

(+)). Proof: for all x, y ∈ ω, max(x, y) ≤

x + y ≤ 2 × max(x, y).

Our second example concerns mappings from sequence of words to ω. Given

words u

, . . . , u

∈ {a, b}

∗

, we have |u

. . . u

≈

max(|K|, max

i=1...n

)

in which K is the set of indices i such that |u

≥ 1 and α(θ) = θ

. Proof:

max(|K|, max

i=1...n

) ≤ |u|

≤ Σ

i∈K

≤ (max(|K|, max

i=1...n

))

One also deﬁnes f 4 g (resp. f ≈ g) to hold if f 4

g (resp. f ≈

g) for

some α. A cost function (over a set E) is an equivalence class of ≈ (i.e., a set of

mappings from E to ω + 1). The relation 4 has other characterisations:

Proposition 1. For all f, g from E to ω + 1, the following items are equivalent:

– f 4 g,

– ∀n ∈ ω.∃m ∈ ω.∀x ∈ E.g(x) ≤ n → f (x) ≤ m , and;

– for all X ⊆ E, g|

is bounded implies f|

is bounded.

The last characterisation shows that the relation ≈ is an equivalence relation

that preserves the existence of bounds. Indeed, all this theory can be seen as an

automata theoretic method for proving the existence/non-existence of bounds.

Cost functions over some set E ordered by 4 form a lattice. Given a sub-

set X ⊆ E, one denotes by χ

its characteristic mapping deﬁned by χ

(x) = 0

if x ∈ X, ω otherwise. It is easy to see that for all X, Y ⊆ E, χ

4 χ

iﬀ

Y ⊆ X. To this respect, the lattice of cost functions is a reﬁnement of the lat-

tice of subsets of E equipped with the superset ordering. Keeping this in mind,

the notion of regular cost function developed in the paper is an extension of

the standard notion of regular language. This extension is strict as soon as E

is inﬁnite: there are cost functions that are not equivalent to any characteristic

mapping. Consider for instance the size mapping over words, or the number of

occurrences of some letter.

2.2 Automata

We present here the automata model we use. A cost automaton (that can be

either a B-automaton or an S-automaton) is a tuple hQ, A, In, Fin, Γ, ∆i in

which Q is a ﬁnite set of states, A is the alphabet, In and Fin are respectively

the set of initial and ﬁnal states, Γ is a ﬁnite set of counters, and ∆ ⊆ Q × A ×

{, i, r, c}

× Q is the set of transitions. The idea behind the letter in {, i, r, c}

(called an action) is that each counter (the value of which ranges over ω) can

either be left unchanged (), be incremented by one (i), be reset to 0 (r), or

be checked (c). A run σ of an automaton over a word a

. . . a

is deﬁned as a

sequence q

, a

, c

, q

, . . . , q

n−1

, a

, c

, q

such that q

is initial, q

is ﬁnal and for

all i = 1 . . . n, (q

i−1

, a

, c

, q

) ∈ ∆. Given a run σ, each counter ι ∈ Γ is initialized

with value 0 and evolves from left to right according to c

(ι): if c

(ι) is  or c, the

value is left unchanged, if it is i, it is incremented by 1, if it is r, the counter is

reset. The set C(σ) ⊆ ω is the set of values taken by the counters when checked

(i.e., the value of counter ι when c

(ι) = c). The diﬀerence between B-automata

and S-automata comes from their dual semantics, [[·]]

and [[·]]

respectively:

for all u ∈ A

∗

, [[A]]

(u) = inf{sup C(σ) : σ run over u} ,

and, [[A]]

(u) = sup{inf C(σ) : σ run over u} ,

in which we use the standard convention that inf ∅ = ω and sup ∅ = 0. Remark

that if A is a non-deterministic ﬁnite automaton in the standard sense, accepting

the language L, then it can be seen as a cost automaton without counters. Seen as

a B-automaton, [[A]]

(u) = χ

, while seen as an S-automaton [[A]]

(u) = χ

∗

Remark 1 (variants). The other similar automata known from the literature can

essentially be seen as special instances of the above formalism. The B-automata

and S-automata in [3] use only actions in {, i, cr} in which cr is an atomic

operation that checks the counter and immediately resets it. The models are

equivalent but the history-determinism (see below) cannot be achieved for S-

automata in this restricted form. The hierarchical automata correspond to the

case when Γ = {1, . . . , n} and for all transitions (p, a, c, q), if for all i ∈ Γ ,

if c(i) 6=  implies c(j) = r for all j < i. The nested distance desert automata of

Kirsten corresponds to hierarchical B-automata that use actions in {, ic, r} in

which ic is an atomic operation which increments the counter and immediately

checks it. The R-automata in [1] use also actions in {, ic, r}, but without the

hierarchical constraint. All those models are equivalent, up to ≈.

We conclude the section by showing some easy closure properties. Given a

mapping f from A

∗

to ω + 1 and a length-preserving morphism h from A

∗

to B

∗

(B is another alphabet) the inf-projection and sup-projection of f with respect

to h are the mappings f

inf,h

and f

sup,h

from B

∗

to ω + 1 deﬁned for v ∈ B

∗

by:

inf,h

(v) = inf{f (u) : h(u) = v} and f

sup,h

(v) = sup{f(u) : h(u) = v}.

By simply adapting the standard constructions for intersection, union, and pro-

jection of non-deterministic automata, we get:

Proposition 2. The mappings accepted by B-automata (resp. S-automata) are

closed under min and max. The mappings accepted by B-automata (resp. S-

automata) are closed under inf-projection (resp. sup-projection).

The Theory of Stabilisation Monoids and Regular Cost Functions

Citations

CONCUR 2008 - Concurrency Theory: 19th International Conference, CONCUR 2008, Toronto, Canada, August 19-22, 2008, Proceedings

Regular Functions and Cost Register Automata

Regular Cost Functions over Finite Trees

Quantitative reactive modeling and verification

Regular Cost Functions, Part I: Logic and Algebra over Words

References

On Observing Nondeterminism and Concurrency

Disjunctive Tautologies as Synchronisation Schemes

Computational Adequacy in an Elementary Topos

Limitedness theorem on finite automata with distance functions

Recognizable Sets with Multiplicities in the Tropical Semiring

Related Papers (5)

Distance desert automata and the star height problem

Solving games without determinization

Limitedness theorem on finite automata with distance functions

On Observing Nondeterminism and Concurrency

Decidability of second-order theories and automata on infinite trees.

Frequently Asked Questions (1)

Q1. What are the contributions mentioned in the paper "The theory of stabilisation monoids and regular cost functions" ?