what is the pre x w of the left-hand side of the sentential form?

The derivation proceeds from right to left; the nonterminals [w] in the left-hand side of sentential forms memorize the pre x w of large enough length to control the derivation in G in the pre x mode.

What is the simplest way to deduce a string?

Because the authors work with one-letter strings, a string x can be derived using a component (Ri; Ci) if (and only if) the shortest string in Ri is contained in x, hence is of length at most jxj.

What is the case for a derivation in G0?

If i n, then the authors have x 2 Ri, otherwise the derivation is not allowed: for x = x1x2x3; x2 2 Ri; x1x2 6= , the authors have x1x2x3 2 Rn+1; hence the derivation in (Ri; Ci) is not allowed.

What is the simplest way to derive c(ab)sy?

As u contains occurrences of both a and b, the authors can derive c(ab)s(ab)p with arbitrary s (all such strings are in Lmax(G)) in such a way to obtain c(ab)sy with y containing a substring aa or a substring bb.

What is the simplest way to obtain a string of the form of w0?

To a string of the form of w0 above only the fth component of G can be applied and again either the derivation must be nished, or it continues only in (R6; C6).

(Open Access) Grammars Based on the Shuffle Operation (1996) | Gheorghe Paun

Q: What are the contributions in this paper?

The authors consider generative mechanisms producing languages by starting from a nite set of words and shu ing the current words with words in given sets, depending on certain conditions.

Q: What are the types of grammars used in formal language theory?

In formal language theory, besides the basic two types of grammars, the Chomsky grammars and the Lindenmayer systems, there are many "exotic" classes of generative devices, based not on the process of rewriting symbols (or strings) by strings, but using various operations of adjoining strings.

Q: What are the types of grammars that are used in formal language theory?

Depending on the place of the string in Ri in the current string generated by their grammar, the authors can distinguish several types of grammars: pre x (the string in Ri appears in the left-hand of the processed string, as a pre x of it), leftmost (we look for the leftmost possible occurrence of a string in Ri), arbitrary (no condition on the place where the string in Ri appears), global (the whole current string is in Ri), and parallel (the current string is portioned into strings in Ri).

Q: Why is shu e incomparable with Chomsky languages?

As it is expected, the languages generated by shu e grammars are mostly incomparable with Chomsky languages (due to the fact that the authors do not use nonterminals).

Journal of Universal Computer Science, vol. 1, no. 1 (1995), 67-82

submitted: 8/11/94, accepted: 10/1/95, appeared: 28/1/95Springer Pub. Co.

Grammars Based on the Shue Op eration

Gheorghe Paun

Institute of Mathematics of the Romanian Academy of Sciences

PO Box 1 { 764, Bucuresti, Romania

Grzegorz Rozenberg

University of Leiden, Department of Computer Science

Niels Bohrweg 1, 2333 CA Leiden, The Netherlands

Arto Salomaa

Academy of Finland and UniversityofTurku

Department of Mathematics, 20500 Turku, Finland

Abstract:

We consider generative mechanisms producing languages by starting from a nite set of words and

shuing the currentwords with words in given sets, dep ending on certain conditions. Namely, regular and nite

sets are given for controlling the shuing: strings are shued only to strings in asso ciated sets. Six classes of such

grammars are considered, with the shuing being done on a leftmost p osition, on a prex, arbitrarily, globally,

in parallel, or using a maximal selector. Most of the corresponding six families of languages, obtained for nite,

respectively for regular selection, are found to be incomparable. The relations of these families with Chomsky

language families are briey investigated.

Key Words:

Shue op eration, Chomsky grammars, L Systems

Categories:

F4.2 [Mathematical Logic and Formal Languages]: Grammars and other Rewriting Sys-

tems:

Grammar types

, F4.3 [Mathematical Logic and Formal Languages]: Formal Languages:

Operations

on languages

1 Introduction

In formal language theory, besides the basic twotyp es of grammars, the Chomsky grammars and the

Lindenmayer systems, there are many "exotic" classes of generative devices, based not on the process

of rewriting symbols (or strings) by strings, but using various op erations of adjoining strings. We quote

here the string adjunct grammars in [7], the semi-contextual grammars in [3], the



-grammars in [10], the

pattern grammars in [2], and the contextual grammars [8]. The starting point of the present paper is this

last class of grammars, via the paper [9], where a variant of contextual grammars was introduced based

on the shue op eration. Basically, one gives two nite sets of strings,

and

,over some alphab et,

and one considers the set of strings obtained by starting from

and iteratively shuing strings from

without any restriction. This corresponds to the simple contextual grammars in [8].

We consider here a sort of couterpart of the contextual grammars with choice, where the adjoining

is controlled by a selection mapping. In fact, we proceed in a way similar to that used in conditional

contextual grammars [12], [13], [14] and in mo dular contextual grammars [15]: we start with several pairs

of the form (

a language (we consider here only the case when

is regular or nite) and

a nite set of strings, and allow the strings in

to be shued only to strings in

. Depending on the

place of the string in

in the current string generated by our grammar, we can distinguish several types

of grammars: prex (the string in

appears in the left-hand of the processed string, as a prex of it),

leftmost (we look for the leftmost possible occurrence of a string in

), arbitrary (no condition on the

place where the string in

appears), global (the whole current string is in

), and parallel (the current

string is p ortioned into strings in

). An interesting variant is to use a substring as base for shuing

only when it is maximal. Twelve families of languages are obtained in this way. Their study is the sub ject

of this pap er.

It is worth noting that the shue op eration appears in various contexts in algebra and in formal

language theory; we quote only [1], [4], [5], [6] (and [11], for applications). In [1], [6] the operation is used

in a generative-likeway, for identifying families of languages of the form

FAM

(

[

;



; Shuf

;

FIN

), the

smallest family of languages containing the nite languages and closed under union, concatenation and

shue. (From this point of view, the family investigated in [9] is a particular case,

FAM

(

S huf

;

FIN

)).

The shue of the symbols of twowords is also related to the concurrent execution of two processes

described by these words, hence our mo dels can be interpreted in terms of concurrent processes, too. For

instance, the prex mode of work corresponds to the concurrent execution of a process strictly at the

beginning of another process, the "beginning" being dened modulo a regular language; in the global

case the elementary actions of the two processes can b e freely intercalated.

As it is exp ected, the languages generated byshue grammars are mostly incomparable with Chomsky

languages (due to the fact that we do not use nonterminals). Somewhat surprising is the fact that in the

regular selection case the ve mo des of work described above, with only one exception, cannot simulate

each other (the corresp onding families of languages are incomparable).

2 Classes of shue grammars

As usual,



denotes the set of all words over the alphabet

, the emptyword is denoted by



, the length



and the set of non-emptywords over

is identied by

. The number of o ccurrences

of a symbol

in a string

will be denoted by

.For basic notions in formal language theory (Chomsky

grammars and L systems) we refer to [16], [17]. We only mention that

F I N ; RE G; C F; C S;

are the

families of nite, regular, context-free, context-sensitive, and of 0L languages, respectively.

For

x; y



we dene the

shue

(product) of

x; y

, denoted

,as

:::x

:::y

;



;



n; n



Various prop erties of this op eration, such as commutativity and associativity, will b e implicitely used in

the sequel. The op eration

is extended in the natural way to languages,

y; x

;

and iterated,

(0)



;

(

+1)

(

)

L; i



;

[



(

)

The grammars considered in [9] are triples of the form

V; B; C

), where

is an alphabet,

and

are nite languages over

. The language generated by

is dened as the smallest language

over

containing

and having the property that if

and

, then



. (Therefore, this language

is equal to

There is no restriction in [9] about the shuing of elements of

to current strings. Such a control

of the grammar work can b e done in various ways of introducing a context-dependency.We use here the

following natural idea:

Denition 1.

shue grammar

is a construct

V; B;

(

)

;:::;

(

))

;

where

is an alphab et,

is a nite language over

are languages over

and

are nite languages

over



The parameter



1 is called the

degree

.If

are languages in a given family

, then wesay

that

with F choice

. Here we consider only the cases

FIN

and

RE G

The idea is to allow the strings in

to b e shued only to strings in the corresponding set

. The

sets

are called

selectors

Denition 2.

For a shue grammar

as above, a constant



, and two strings

x; y



we dene the following derivation relations:

)

arb

yiff x



;

f or some x

u; u

;

)

yiff x



;

f or some x

u; u

;

)

yiff x



;

f or some x

u; u

; and ther e is no j;



such that x

;

)

yiff x

u; f or some u

These derivation relations are called

arbitrary, prex, leftmost, and global

derivations, respectively.

Moreover, we dene the

paral lel

derivation as

)

yiff x

:::x



:::x

;

for x

;



We denote

arb; pr; lm; gl ; pl

Denition 3.

The

language generated

byashue grammar

in the mode

is dened

as follows:

(



)

:::

)



m; m



For the parallel mode of derivation we dene

(



)

:::

)

x; w

B; m



The corresp onding families of languages generated byshue grammars with

choice,

F I N ; RE G

are denoted by

ARB

(

)

;PR

(

)

;LM

(

)

;GL

(

)

;PL

(

), resp ectively; by

we denote the family of lan-

guages generated by grammars as in [9], without choice. Subscripts

can b e added to

ARB ; P R; LM ; GL; P L

when only languages generated by grammars of degree at most

n; n



1, are considered.

3 The generative capacityofshue grammars

From denition wehave the inclusions

(

)



(

FIN

)



(

RE G

), for all



ARB ; P R; LM ; GL; P L

and

F I N ; RE G

Every family

ARB

(

FIN

)

;PR

(

FIN

)

;LM

(

FIN

)

;GL

(

FIN

)

;PL

(

FIN

) contains each nite language.

This is obvious, b ecause for

A; B ;

(



)) wehave

(

for all

. In fact, wehave

Theorem 1.

(i)

Every family

ARB

(

RE G

)

;PR

(

RE G

)

;LM

(

RE G

)

;GL

(

RE G

)

;PL

(

RE G

)

includes strictly the

family SL.

(ii)

The family SL is incomparable with each

(

FIN

)



ARB ; P R; LM ; P L

(iii)

(

FIN

(

FIN

FIN:

Proof.

(i) If

V; B; C

) is a simple shue grammar, then

(

) for each

and

V; B;

(



))

(The only p oint which needs some discussion is the fact that a parallel derivation

)

, with

:::x

and

:::x

;k ge

2, can be simulated bya

-step derivation in

, because the

strings shued into

;:::;x

do not overlap each other.) Consequently,



(

RE G

), for all

The inclusion is prop er in view of the following necessary condition for a language to be in

(Lemma

10 in [9]): if

and

f or every n



there is x

L with



;

then each string in



is a subword of a string in

. This condition rejects languages suchas

[

;

this language can b e generated by the shue grammar with nite choice

a; b

;

a; b

;

(

;

; a

)

;

(

;

; b

))

in all modes of derivation excepting the global one. For

we take

a; b

;

a; b

;

(



;

)

;

(



;

))

(ii) Wehave already seen that

(

FIN

)

;

for

ARB ; P R; LM ; P L

Consider now the

simple shue grammar

a; b; c

;

abc

;

abc

)

Wehave

(

)



Assume that

(

) for some shue grammar

a; b; c

;B;

(

)

;::: :::;

(

))

with nite

;:::;R

and

arb; pr;lm

Take a string

(

) with arbitrarily large

. A derivation

)

must b e p ossible in

, with

;p<m

, for some 1



Wemust

have

, and

for

u; u

.If



, this derivation step can be

omitted, hence wemay assume that



The sets

are nite. Denote

= max

;



;

= max

;



Therefore

j

, that is



.For

m>r

wehave



q>r

, hence either

sub

(

)or

sub

(

). In both cases, the shuing with

modies at most twoofthe

three subwords

, hence a parasitic string is obtained. (In the prex derivation case we precisely

know that only

is mo died.)

Consider now the parallel case. Take

a; b; c

;B;

(

)

;:::;

(

)) such that

(

)

nite sets. For obtaining a string

with large enough

we need a derivation

)

;p < m:

This implies wehave sets

containing strings



1 and with the corresponding

sets

containing strings



1 (similarly for

and

Assume that we nd in the sets

two strings

and in the associated sets

we nd two

strings

. Similarly,wehave pairs (

) and (

). The string

for

is in

(

) and it can be rewritten using the same pairs of strings containing the symbols

and

and dierent

pairs for

)

(

t=r

)(

)

;

)

(

t=r

)(

)

; f or some s:

Consequently,wemust have

(

)

;

which implies

For all such pairs (

)we obtain the same value for

. Denote it by



. Rewriting some

using

such pairs we obtain

(

(1 +



)

occurrences of

. Continuing



1 steps, we get

(1 +



)

occurrences of

, hence a geometrical progres-

sion, starting at

Symbols

can b e also introduced in a derivation

)

by using pairs (

) with

containing occurrences of

.At most one such pair can b e used in a derivation step. Let

be the

largest number of symbols

in strings

as above (the number of such strings is nite). Thus, in a

derivation

)

, the number

can be mo died by such pairs in an interval [

]

with



We start from at most

card

(

) strings of the form

, hence wehave at most

geometrical

progressions

(1 +



)



1. The dierence



(1 +



)

(1 +



)

can b e arbitrarily large. When



>gh

, the at most

progressions can have an elementbetween

(1 +



)

and

(1 +



)

; each such

element can have at most

values. Consequently, at least one natural number

between

(1 +



)

and

(1 +



)

is not reached, the corresp onding string

, although in

(

), is not in

(

). This

contradiction concludes the pro of of point (ii).

(iii) As only strings in the sets

;



, of a grammar

V; B;

(

)

;:::;

(

)) can

be derived, wehave

(

FIN

)



FIN

. The inclusion

FIN



(

FIN

) has b een pointed out at the

beginning of this section.

}

Corollary.

The families

(

RE G

)



ARB ; LM ; P R; P L; GL

,contain non-context-free

languages.

For

(

FIN

)

;PL

(

RE G

) and

(

RE G

) this assertion can be strenghtened.

Theorem 2.

Every propagating unary 0L language belongs to the family

(

FIN

)

Proof.

For a unary 0L system

;w;P

) with



and

;:::;a

;

with



;



,we construct the shue grammar

;

(

;

;:::;a

))

It is easy to see that

(

}

This implies that

(

FIN

)

;PL

(

RE G

) contain one-letter non-regular languages. This is not true for

the other mo des of derivation.

Theorem 3.

A one-letter language is in

ARB

(

)

;PR

(

)

;LM

(

)

F I N ; RE G

,orin

(

RE G

)

if and only if it is regular.

Proof.

Takeashue grammar

;B;

(

)

;:::;

(

)) with regular sets

;



Clearly,

arb

(

). Because wework with one-letter strings, a string

can be derived

using a comp onent(

) if (and only if ) the shortest string in

is contained in

, hence is of length

at most

. Therefore we can replace each

, where

= min

;



without modifying the generated language. Denote

= max



Grammars Based on the Shuffle Operation

Figures

Citations

Contextual grammars and formal languages

Table-Based division by small integer constants

Computing by Folding

Contextual Grammars with Uniform Sets of Trajectories

Languages and Problem Specification.

References

Formal Languages

Ordering by Divisibility in Abstract Algebras

Mathematical Theory of L Systems

The mathematical theory of L systems

Developments in Language Theory II

Related Papers (5)

Parallel generation of finite images

On the leftmost derviation in matrix grammars

Transformational classes of grammars

Programmed grammars -- A new device for generating formal languages

Theory of ω-languages. II: A study of various models of ω-type generation and recognition

Frequently Asked Questions (9)

Q1. What are the contributions in this paper?

Q2. What are the types of grammars used in formal language theory?

Q3. what is the pre x w of the left-hand side of the sentential form?

Q4. What is the simplest way to deduce a string?

Q5. What is the case for a derivation in G0?

Q6. What is the simplest way to derive c(ab)sy?

Q7. What are the types of grammars that are used in formal language theory?

Q8. What is the simplest way to obtain a string of the form of w0?

Q9. Why is shu e incomparable with Chomsky languages?