What have the authors contributed in "Similarity relations and cover automata" ?

Cover automata for finite languages have been much studied a few years ago. In the present work, the authors investigate in detail for themselves the properties of these relations beyond the scope of finite languages. New results with straightforward proofs are obtained in this generalized framework, and previous results concerning cover automata are obtained as immediate consequences.

(Open Access) Similarity relations and cover automata (2005) | Jean-Marc Champarnaud

RAIRO-Inf. Theor. Appl. 39 (2005) 115-123

DOI: 10.1051/ita:2005006

SIMILARITY RELATIONS AND COVER AUTOMATA

Jean-Marc Champarnaud

, Franck Guingne

1, 2

and

Georges Hansel

Abstract. Cover automata for ﬁnite languages have been much stud-

ied a few years ago. It turns out that a simple mathematical structure,

namely similarity relations over a ﬁnite set of words, is underlying these

studies. In the present work, we investigate in detail for themselves

the properties of these relations beyond the scope of ﬁnite languages.

New results with straightforward proofs are obtained in this general-

ized framework, and previous results concerning cover automata are

obtained as immediate consequences.

Mathematics Subject Classiﬁcation. 68Q25, 68Q45, 68W01,

68W10.

1. Introduction

Let Σ be an alphabet and Σ

≤l

be the subset of words of Σ

∗

whose length is not

greater than the integer l.ArelationoverΣ

≤l

is semi-transitive if, given three

words x, y, z ∈ Σ

≤l

such that |x|≤|y|≤|z|, transitivity holds when x ∼ y ∧ y ∼ z

or y ∼ x ∧ x ∼ z. In this paper, we present a general study of similarity relations

over Σ

≤l

, i.e. relations that are reﬂexive, symmetrical and semi-transitive. We

show in particular that right invariant similarity relations are recognized by semi-

automata and we characterize minimal semiautomata recognizing a given relation.

We use these general properties to study cover automata for a ﬁnite language.

Cover automata have been introduced by Cˆampeanu, Sˆantean and Yu in [1]. A

ﬁnite language L is said to be of order l if the length of a longest word in L is equal

to l. A cover automaton for a language L of order l is a deterministic automaton A

Keywords and phr ases. Finite automata, cover automaton for a ﬁnite language, similarity

relation.

LIFAR, Universit´edeRouen,France;jean-marc.champarnaud@univ-rouen.fr

& franck.guingne@univ-rouen.fr & georges.hansel@univ-rouen.fr

XRCE, Xerox, 38240 Meylan, France; franck.guingne@xrce.xerox.com

 EDP Sciences 2005

116 J.-M. CHAMPARNAUD, F. GUINGNE AND G. HANSEL

such that L(A)∩Σ

≤l

= L. Checking word membership to L on a cover automaton

for L only requires an additional test on the length of the word. Since covering

generally reduces the size of an automaton [7], it is of practical interest to be able

to compute a minimal cover automaton for L, that is a cover automaton with a

minimal number of states. It is shown in [1] that a minimal cover automaton can

be obtained from any cover automaton for L by merging states according to a state

relation involving the right languages of the states. Minimality with respect to L

comes from the properties of the similarity relation over Σ

≤l

that is underlying

the state relation. This word relation, called L-similarity, has been introduced by

Kaneps and Freivalds [5] and Dwork and Stockmeyer [3].

In this paper, we show how a semiautomaton recognizing the L-similarity re-

lation can be equipped with ﬁnal states to yield a cover automaton for L.This

leads to a characterization of minimal cover automata for a ﬁnite language.

Notice that several eﬃcient algorithms have been designed for computing a

minimal cover automaton, either from a deterministic automaton recognizing L,

or from an arbitrary cover automaton for L.In[1],Cˆampeanu, Sˆantean and Yu

present an O(n

) time and space algorithm to minimize an n-state cover automa-

ton for L.In[2],Cˆampeanu, P˘aun and Yu provide an O(n

) time and space al-

gorithm whose input is an n-state deterministic automaton recognizing L.In[6],

K¨orner describes an Hopcroft-like algorithm with an O(n log n)timeandO(n)

space complexity that works on both types of input.

Section 2 is devoted to a general study of similarity relations over Σ

≤l

and

Section 3 addresses right invariance property. The connexion between similarity

relations and semiautomata is investigated in Section 4. The application of the

study of similarity relations to the computation of a minimal cover automaton for

a ﬁnite language is developed in Section 5.

2. Similarity relations over Σ

≤l

Let l be an integer. In the following, Σ

≤l

denotes the subset of Σ

∗

of words

having a length not greater than l.

Arelation∼ over Σ

≤l

is semi-transitive iﬀ for all x, y, z in Σ

≤l

such that

|x|≤|y|≤|z|, the following implications hold:

(i) x ∼ y and y ∼ z ⇒ x ∼ z,

(ii) x ∼ y and x ∼ z ⇒ y ∼ z.

A reﬂexive, symmetrical and semi-transitive relation is a similarity relation.In

the following, the relation ∼ is supposed to be a similarity relation over Σ

≤l

Two words x and y are similar (resp. dissimilar)ifx ∼ y (resp. x ∼ y). A

similarity set (resp. a dissimilarity set) is a subset of pairwise similar (resp. pair-

wise dissimilar) elements of Σ

≤l

. A dissimilarity set is maximal if its cardinality

is maximal among dissimilarity sets. A partition of Σ

≤l

whose all classes are sim-

ilarity sets is called a similarity partition. A similarity partition is minimal if its

SIMILARITY RELATIONS AND COVER AUTOMATA 117

cardinality is minimal among similarity partitions. Two similarity sets S and T

are said to be merg eable if S ∪ T is a similarity set. Hence the partition resulting

from merging two mergeable classes of a similarity partition is again a similarity

partition.

An element x ∈ Σ

≤l

is minimal if for all y ∈ Σ

≤l

,wehave

y ∼ x ⇒|y|≥|x|.

We denote by M the set of all minimal elements of Σ

≤l

Proposition 2.1.

1) The retriction of the relation ∼ to M is an equivalence relation.

2) For al l x ∈ Σ

≤l

, t here exists at least one minimal element similar to x.

Proof.

1) It follows from the very deﬁnition of minimal elements that two minimal

similar elements have the same length. Consequently, by Condition (i),

when restricted to M,therelation∼ is transitive.

2) Let x ∈ Σ

≤l

.Lety be an element of smallest length among all elements

similar to x. It follows from Condition (i)thaty is a minimal element.



Let us ﬁx some notation. We denote by π

= {M

,...,M

} the partition of M

in equivalence classes and by C = {c

,...,c

} a cross-section of π

, i.e. c

∈ M

for all i =1,...,k. For all x ∈ M , let us denote by S

the similarity set of all the

elements similar to x. Finally, for all i =1,...,k,letusset

= S

i−1



j=1

and T



= S



j=i

Remark 2.2. It follows from Condition (i)thatifx and x



are similar mini-

mal elements, then S

= S



. Moreover it follows from Proposition 2.1(2) that

∪

x∈M

=Σ

≤l

Proposition 2.3.

1) The set C is a maximal dissimilarity set.

2) Any minimal similarity partition has k elements and {T

,...,T

} is such

a minimal similarity partition.

Proof.

1) Being a cross-section of M,thesetC is a dissimilarity set. Let D be

any dissimilarity set. Suppose that |D| > |C|. Hence it follows from

Proposition 2.1(2) that there exist two elements y and z in D similar

to a same element c of C.Sincec is a minimal element, |y|≥|c| and

|z|≥|c| and therefore, by Condition (ii), we get that y and z are similar,

a contradiction.

118 J.-M. CHAMPARNAUD, F. GUINGNE AND G. HANSEL

2) Let π be a similarity partition of Σ

≤l

. Diﬀerent elements of C belong to

diﬀerent elements of π. Hence π has at least k elements. It remains only

to observe that {T

,...,T

} is a similarity partition (cf. Rem. 2.2).



The following proposition gives a complete characterization of maximal dissimi-

larity sets.

Proposition 2.4. Let D be a subset of Σ

≤l

. The following conditions are equiv-

alent:

1) D is a maximal dissimilarity set.

2) |D| = k and, for all i =1,...,k, there exists one and only one element

∈ D such that d

∈ T



Proof. 1) ⇒ 2) Since D is a maximal dissimilarity set, it follows from Proposi-

tion 2.3(1) that |D| = |C| = k. By Proposition 2.1(2), we can chose for all d ∈ D

a minimal element f (d) ∈ C such that f (d) ∼ d.Letd, d



be two elements of D

and suppose that f (d)=f(d



). It follows from Condition (ii)thatd ∼ d



and,

since D is a dissimilarity set, we get that d = d



. Hence the mapping d → f (d)is

one-to-one onto. Let d

= f

−1

), i =1,...,k.Thend

∼ c

and d

∼ c

for j = i

(otherwise we would get d

∼ d

). Hence d

∈ T



, i =1,...,k, and 2) is satisﬁed.

2) ⇒ 1) It suﬃces to observe that according to the deﬁnition of the sets T



i =1,...,k,wegetthatD = {d

,...,d

} is a dissimilarity set. 

Corollary 2.5. Let D be a dissimilarity set. The following conditions are equiv-

alent:

1) D is a maximal dissimilarity set.

2) D is a cross-section of a similarity p artition of Σ

≤l

Proof. 1) ⇒ 2) According to Proposition 2.4, D = {d

,...,d

},withd

∈ T



for

all i =1,...,k. Hence D is a cross-section of the similarity partition {T

,...,T

2) ⇒ 1) Let π = {U

,...,U

} be a similarity partition of Σ

≤l

whose D is a

cross-section. Since π is a similarity partition, we have p ≥ k and since D is a

dissimilarity set, we have p ≤ k. Hence |D| = p = k. We can assume that c

∈ U

for all i =1,...,k and denote by d

the unique element of D ∩ U

.SinceD is a

dissimilarity set, we get that d

∼ c

if and only if i = j. Hence d

∈ T



for all

i =1,...,k and D is a maximal dissimilarity set (cf. Prop. 2.4). 

Lemma 2.6. Let S and T be two similarity sets. L et s (resp. t)beoneofthe

smallest elements of S (resp. T ). The following conditions are equivalent:

1) S and T are mergeable;

2) s and t ar e similar.

Proof. 1) ⇒ 2) is obvious. Let us prove that 2) ⇒ 1). Suppose that |s|≤|t|.Lety

be an element of S and z be an element of T .Since|s|≤|t|≤|z|, by Condition (i)

we get that s ∼ z. Consequently, since |s|≤|y| and |s|≤|z|, by Condition (ii)

we get that y ∼ z. Hence S and T are mergeable. 

SIMILARITY RELATIONS AND COVER AUTOMATA 119

Theorem 2.7. Let π be a similarity partition of Σ

≤l

. The following conditions

are equivalent:

1) π is a minimal similarity partition.

2) π admits a maximal dissimilarity cross-section.

3) π admits a dissimilarity cross-section.

4) π cannot be reduced by merging elements.

Proof. 1) ⇒ 2) Since π is minimal, it has k elements (cf. Prop. 2.3) and conse-

quently the set C is a maximal dissimilarity cross-section of π.

2) ⇒ 3) is obvious.

3) ⇒ 1) Let D be a dissimilarity cross-section of π. It follows from Corollary 2.5

that D is a maximal dissimilarity set. Hence D has k elements and π is minimal.

Thus we have already shown that 1) ⇔ 2) ⇔ 3). The implication 1) ⇒ 4) is

obvious and it follows from Lemma 2.6 that 4) ⇒ 3). The proof is complete. 

3. Right invariant similarity relations

A similarity relation ∼ over Σ

≤l

is right invariant if x ∼ y ⇒ xz ∼ yz, for all

z ∈ Σ

∗

such that |xz|, |yz|≤l. A similarity partition (U

)isright invariant with

respect to ∼ if the conditions x, y ∈ U

, |xz|≤l, |yz|≤l,andxz ∈ U

imply

yz ∈ U

Proposition 3.1. Let ∼ be a right invariant similarity relation over Σ

≤l

.Then

there exists a minimal right invariant similarity partition.

Proof. First we deﬁne a mapping (c, a) → c ·a from C × ΣtoC by deﬁning c · a as

any element c



∈ C that is similar to the word ca. This mapping is then inductively

extended to a mapping (c, x) → c · x from C × Σ

∗

to C by setting

c · x =



c if x = 

(c · y) · a if x = ya.

Now we construct a partition of Σ

≤l

denoted {U

,...,U

},withk = |C|,byﬁxing,

for all x ∈ Σ

≤l

,towhichsetU

it belongs. Remark that the empty word  belongs

to C and we can suppose that  = c

.Thenweset

x ∈ U

⇔  · x = c

Let us ﬁrst inductively check that x ∈ U

⇒ x ∼ c

and hence that (U

)isa

similarity partition. By deﬁnition  ∈ U

and trivially  = c

∼ c

. Suppose that

x = ya with y ∈ U

and x ∈ U

. By the induction hypothesis, y ∼ c

.Sincethe

relation ∼ is right invariant we get that x ∼ c

a. On the other hand

 · x =( · y) · a = c

· a = c

By deﬁnition of c

· a,wehavec

a ∼ c

· a.Thuswehavex ∼ c

a and c

a ∼ c

· a.

But |x|≥|c

a| and |c

a|≥|c

· a|. Hence x ∼ c

· a, i.e. x ∼ c

Similarity relations and cover automata

Citations

Notes on Hyper-minimization.

Unweighted and weighted hyper-minimization

Computing all l-cover automata fast

Cover transducers for functions with finite domain

More on deterministic and nondeterministic finite cover automata

References

J.E.Hopcroft, J.D. Ullman 著, "Introduction to Automata Theory, Languages, and Computation", Addison-Wesley, A5変形版, X+418, \6,670, 1979

A time complexity gap for two-way probabilistic finite-state automata

Minimal cover-automata for finite languages

Running Time to Recognize Nonregular Languages by 2-Way Probabilistic Automata

A time and space efficient algorithm for minimizing cover automata for finite languages

Related Papers (5)

An efficient algorithm for constructing minimal cover automata for finite languages

Minimal cover-automata for finite languages

A time and space efficient algorithm for minimizing cover automata for finite languages

Composition sequences for functions over a finite domain

Characterizations of recognizable picture series

Frequently Asked Questions (1)

Q1. What have the authors contributed in "Similarity relations and cover automata" ?