What is the known complexity for the resulting logic?

The best known complexity for the resulting logic is obtained through reduction to the emptiness problem of alternating tree automaton which is in 2O(n4·log n), where n corresponds to the length of a formula [18].

What are the contributions in "Efficient static analysis of xml paths and types" ?

The authors present an algorithm to solve XPath decision problems under regular tree type constraints and show its use to statically typecheck XPath queries. To this end, the authors prove the decidability of a logic with converse for finite ordered trees whose time complexity is a simple exponential of the size of a formula. Building on these results, the authors describe a practical, effective system for solving the satisfiability of a formula.

What have the authors stated for future works in "Efficient static analysis of xml paths and types" ?

Finally, there are a number of interesting directions for further research that build on ideas developed here: extending XPath to restricted data values comparisons that preserves this complexity, for instance data values on a finite domain, and integrating related work on counting [ 8 ] to their logic. The authors also plan on continuing to improve the performance of their implementation.

What is the expressive decidable logic used in XHTML?

specifically the weak monadic second-order logic of two successors (WS2S) [9], is one of the most expressive decidable logic used when both regular types and queries [2] are under consideration.

What is the reason why the lemma holds?

As the tree and the number of subformulas are finite, the satisfaction derivation is finite hence only a finite number of unfolding is necessary to prove that the tree satisfies the formula, which is what the lemma states.

What is the function that stops the selection of a node?

To preserve semantics, the translation of p[q] stops the “selecting navigation” to those nodes reached by p, then filters them depending on whether q holds or not.

What is the meaning of cycle free formulas?

Given a tree, if a formula ϕ is cycle free, then every node of the tree will be tested a finite number of time against any given subformula of ϕ.

What is the main result of the paper?

The main result of their paper is a sound and complete algorithm for the satisfiability of decision problems involving regular tree types and XPath queries with a tighter 2O(n) complexity in the length of a formula.

What is the translation function for XPath?

5. The translation function, noted “A→JaKχ”, takes an XPath axis a as input, and returns its Lµ translation, parameterized by the Lµ formula χ given as parameter.

what formalisms exist for describing types of XML documents?

The following hold for an XPath expression e and a Lµ formula ϕ denoting a set of focused trees, with ψ = E→JeKϕ:1. JψK∅ = SeJeKJϕK∅ 2. ψ is cycle-free 3. the size of ψ is linear in the size of e and ϕSeveral formalisms exist for describing types of XML documents (e.g. DTD, XML Schema, Relax NG).

how do the authors add a triple to a type?

The algorithm proceeds in a bottom-up approach, repeatedly adding new triples until a satisfying model is found (i.e. a triple whose first component is a type implying the formula), or until nomore triple can be added.

What is the function that checks whether a formula is produced?

To check a formula ϕ, their algorithm builds satisfiable formulas out of some subformulas (and their negation) of ϕ, then checks whether ϕ was produced.

What are the main directions for further research?

there are a number of interesting directions for further research that build on ideas developed here: extending XPath to restricted data values comparisons that preserves this complexity, for instance data values on a finite domain, and integrating related work on counting [8] to their logic.

What is the important factor in the analysis of XHTML?

For the XHTML case, the authors observe that the time needed is more important, but it remains practically relevant, especially for static analysis operations performed only at compile-time.

What did previous work show that XPath decision problems are complicated?

Previous works [28, 3] showed that including general comparisons of data values from an infinite domain may lead to undecidability.

What type of tests can be used to check the XPath equival?

The tests use XPath expressions shown on Fig. 12 (where “//” is used as a shorthand for “/desc-or-self::*/”) and XML types shown on Table 1.

What is the main difference between the two approaches?

The approach only deals with emptiness of XPathexpressions without reverse axes, whereas their approach solves the more general problem of containment, including reverse axes.

(Open Access) Efficient static analysis of XML paths and types (2007) | Pierre Genevès

Q: What is the essence of the results?

The essence of their results lives in a sub-logic of the alternation free modal µ-calculus (AFMC) with converse, some syntactic restrictions on formulas, without greatest fixpoint, and whose models are finite trees.

HAL Id: hal-00189123

https://hal.archives-ouvertes.fr/hal-00189123

Submitted on 20 Nov 2007

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-

entic research documents, whether they are pub-

lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diusion de documents

scientiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Ecient Static Analysis of XML Paths and Types

Pierre Genevès, Nabil Layaïda, Alan Schmitt

To cite this version:

Pierre Genevès, Nabil Layaïda, Alan Schmitt. Ecient Static Analysis of XML Paths and Types.

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implemen-

tation, Jun 2007, San Diego, United States. pp.342–351, �10.1145/1250734.1250773�. �hal-00189123�

Efﬁcient Static Analysis of XML Paths and Types

Pierre Genev

Ecole Polytechnique F

erale de Lausanne

∗

pierre.geneves@epﬂ.ch

Nabil Laya

ıda Alan Schmitt

INRIA Rh

one-Alpes

{nabil.layaida, alan.schmitt}@inria.fr

Abstract

We present an algorithm to solve XPath decision problems under

regular tree type constraints and show its use to statically type-

check XPath queries. To this end, we prove the decidability of a

logic with converse for ﬁnite ordered trees whose time complexity

is a simple exponential of the size of a formula. The logic cor-

responds to the alternation free modal µ-calculus without greatest

ﬁxpoint, restricted to ﬁnite trees, and where formulas are cycle-free.

Our proof method is based on two auxiliary results. First, XML

regular tree types and XPath expressions have a linear translation

to cycle-free formulas. Second, the least and greatest ﬁxpoints are

equivalent for ﬁnite trees, hence the logic is closed under negation.

Building on these results, we describe a practical, effective

system for solving the satisﬁability of a formula. The system has

been experimented with some decision problems such as XPath

emptiness, containment, overlap, and coverage, with or without

type constraints. The beneﬁt of the approach is that our system can

be effectively used in static analyzers for programming languages

manipulating both XPath expressions and XML type annotations

(as input and output types).

Categories and Subject Descriptors E.1 [Data Structures]: Trees;

F.4.1 [Mathematical Logic and Formal Languages]: Mathemat-

ical Logic—modal logic; F.4.3 [Mathematical Logic and For-

mal Languages]: Formal Languages—decision problems; H.2.1

[Database Management]: Logical Design; H.2.3 [Database Man-

agement]: Languages—Query Languages

General Terms Algorithms, languages, theory, veriﬁcation

Keywords Modal logic, satisﬁability, type checking, XPath

1. Introduction

This work is motivated by the need of efﬁcient type checkers for

XML-based programming languages where XML types and XPath

queries are used as ﬁrst class language constructs. In such settings,

XPath decision problems in the presence of XML types such as

DTDs or XML Schemas arise naturally. Examples of such decision

problems include emptiness test (whether an expression ever se-

lects nodes), containment (whether the results of an expression are

always included in the results of another one), overlap (whether two

∗

Major part of this work done when the author was at INRIA Rh

one-Alpes.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full citation

on the ﬁrst page. To copy otherwise, to republish, to post on servers or to redistribute

to lists, requires prior speciﬁc permission and/or a fee.

PLDI’07 June 11–13, 2007, San Diego, California, USA.

 2007 ACM 978-1-59593-633-2/07/0006. . . $5.00

expressions select common nodes), and coverage (whether nodes

selected by an expression are always contained in the union of the

results selected by several other expressions).

XPath decision problems are not trivial in that they need to be

checked on a possibly inﬁnite quantiﬁcation over a set of trees. An-

other difﬁculty arises from the combination of upward and down-

ward navigation on trees with recursion [31].

The most basic decision problem for XPath is the emptiness

test of an expression [3]. This test is important for optimization of

host languages implementations: for instance, if one can decide at

compile time that a query result is empty then subsequent bound

computations can be ignored. Another basic decision problem is

the XPath equivalence problem: whether or not two queries always

return the same result. It is important for reformulation and opti-

mization of an expression [17] , which aim at enforcing operational

properties while preserving semantic equivalence [23]. The most

essential problem for type-checking is XPath containment. It is re-

quired for the control-ﬂow analysis of XSLT [25], for checking in-

tegrity constraints, and for XML security [12].

The complexity of XPath decision problems heavily depends on

the language features. Previous works [28, 3] showed that including

general comparisons of data values from an inﬁnite domain may

lead to undecidability. Therefore, we focus on a XPath fragment

which covers all features except counting [8] and data values.

In our approach to solve XPath decision problems, two issues

need to be addressed. First, we identify the most appropriate logic

with sufﬁcient expressiveness to capture both regular tree types and

our XPath fragment. Second, we solve efﬁciently the satisﬁability

problem which allows to test if a given formula of the logic admits

a satisfying ﬁnite tree.

The essence of our results lives in a sub-logic of the alternation

free modal µ-calculus (AFMC) with converse, some syntac tic re-

strictions on formulas, without greatest ﬁxpoint, and whose models

are ﬁnite trees. We prove that XPath expressions and regular tree

type formulas conform to these syntactic restrictions. Boolean clo-

sure is the key property for solving the containment (a logical im-

plication). In order to obtain closure under negation, we prove that

the least and greatest ﬁxpoint operators collapse in a single ﬁxpoint

operator. Surprisingly, the translations of XML regular tree types

and a large XPath fragment does not increase complexity since they

are linear in the size of the corresponding formulas in the logic. The

combination of these ingredients lead to our main result: a satisﬁ-

ability algorithm for a logic for ﬁnite trees whose time complexity

is a simple exponential of the size of a formula.

The decision procedure has been implemented in a system for

solving XML decision problems such as XPath emptiness, con-

tainment, overlap, and coverage, with or without XML type con-

straints. The system can be used as a component of static analyzers

for programming languages manipulating XPath expressions and

XML type annotations for both input and output.

2. Outline

The paper is organized as follows. We ﬁrst present our data model,

trees with focus, and our logic in §3 and §4. We next present XPath

and its translation in our logic in §5. Our satisﬁability algorithm

is introduced and proven correct in §6, and a few details of the

implementation are discussed in §7. Applications for type checking

and some experimental results are described in §8. We study related

work in §9 and conclude in §10.

Detailed proofs and implementation techniques can be found in

a long version of this paper [16].

3. Trees with Focus

In order to represent XML trees that are easy to navigate we use fo-

cused trees, inspired by Huet’s Zipper data structure [20]. Focused

trees not only describe a tree but also its context: its previous sib-

lings and its parent, including its parent context recursively. Explor-

ing such a structure has the advantage to preserve all information,

which is quite useful when considering languages such as XPath

that allow forward and backward axes of navigation.

Formally, we assume an alphabet Σ of labels, ranged over by σ.

t ::= σ[tl] tree

tl ::= list of trees

 empty list

| t :: tl cons cell

c ::= context

(tl, Top, tl) root of the tree

| (tl, c[σ], tl ) context node

f ::= (t, c) focused tree

In order to deal with decision problems such as containment, we

need to represent in a focused tree the place where the evaluation

was started using a start mark, often simply called “mark” in the

following. To do so, we consider focused trees where a single tree

or a single context node is marked, as in σ

[tl] or (tl , c[σ

], tl ).

When the presence of the mark is unknown, we write it as σ

◦

[tl].

We write F for the set of ﬁnite focused trees with a single mark.

The name of a focused tree is deﬁned as nm(σ

◦

[tl], c) = σ.

We now describe how to navigate focused trees, in binary style.

There are four directions that can be followed: for a focused tree f,

f h1i changes the focus to the children of the current tree, f h2i

changes the focus to the next sibling of the current tree, f

changes the focus to the parent of the tre e if the current tree is

a leftmost sibling, and f

changes the focus to the previous

sibling.

Formally, we have:

(σ

◦

[t :: tl ], c) h1i

def

= (t, (, c[σ

◦

], tl ))

(t, (tl

, c[σ

◦

], t

:: tl

)) h2i

def

= (t

, (t :: tl

, c[σ

◦

], tl

))

(t, (, c[σ

◦

], tl ))

def

= (σ

◦

[t :: tl ], c)

, (t :: tl

, c[σ

◦

], tl

))

def

= (t, (tl

, c[σ

◦

], t

:: tl

))

When the focused tree does not have the required shape, these

operations are not deﬁned.

4. The Logic

We introduce in this section the logic to which XPath expressions

and XML regular tree types are going to be translated, a sub-logic

of the alternation free modal µ-calculus with converse. We also

introduce a restriction on the formulas we consider and give an

interpretation of formulas as sets of ﬁnite focused trees. We ﬁnally

show that the logic has a single ﬁxpoint for these models and that

it is closed under negation.

3 ϕ, ψ ::= formula

> true

| σ | ¬σ atomic prop (negated)

| s | ¬s start prop (negated)

| X variable

| ϕ ∨ ψ disjunction

| ϕ ∧ ψ conjunction

| hai ϕ | ¬ hai > existential (negated)

| µX

.ϕ

in ψ least n-ary ﬁxpoint

| νX

.ϕ

in ψ greatest n-ary ﬁxpoint

Figure 1. Logic formulas

J>K

def

= F JσK

def

= {f | nm(f) = σ}

JXK

def

= V (X) J¬σK

def

= {f | nm(f) 6= σ}

Jϕ ∨ ψK

def

= JϕK

∪ JψK

JsK

def

f | f = (σ

[tl], c)

Jϕ ∧ ψK

def

= JϕK

∩ JψK

J¬sK

def

= {f | f = (σ[tl ], c)}

Jhai ϕK

def

= {f hai | f ∈ JϕK

∧ f hai deﬁned}

J¬ hai >K

def

= {f | f hai undeﬁned}

JµX

.ϕ

in ψK

def

= let T

“

⊆ F | Jϕ

V [T

]

⊆ T

o”

in JψK

V [T

]

JνX

.ϕ

in ψK

def

= let T

“

[

⊆ F | T

⊆ Jϕ

V [T

]

o”

in JψK

V [T

]

Figure 2. Interpretation of formulas

In the following deﬁnitions, a ∈ {1, 2, 1, 2} are programs and

atomic propositions σ correspond to labels from Σ. We also assume

that a = a. Formulas deﬁned in Fig. 1 include the truth predicate,

atomic propositions (denoting the name of the tree in focus), start

propositions (denoting the presence of the start mark), disjunction

and conjunction of formulas, formulas under an existential (denot-

ing the existence a subtree satisfying the sub-formula), and least

and greatest n-ary ﬁxpoints. We chose to include a n-ary version of

ﬁxpoints because regular types are often deﬁned as a set of mutu-

ally recursive deﬁnitions, making their translation in our logic more

succinct. In the following we write “µX.ϕ” for “µX.ϕ in ϕ”.

We deﬁne in Fig. 2 an interpretation of our formulas as sets of ﬁ-

nite focused trees with a single start mark. The interpretation of the

n-ary ﬁxpoints ﬁrst compute the smallest or largest interpretation

for each ϕ

then returns the interpretation of ψ.

We now restrict the set of valid formulas to cycle-free formulas,

i.e. formulas that have a bound on the number of modality cycles

independently of the number of unfolding of their ﬁxpoints. A

modality cycle is a subformula of the form hai ϕ where ϕ contains

a top-level existential of the form hai ψ. (By “top-level” we mean

under an arbitrary number of conjunctions or disjunctions, but not

under any other construct.) For instance, the formula “µX. h1i (ϕ∨

X) in X” is not cycle free: for any integer n, there is an

unfolding of the formula with n modality cycles. On the other hand,

the formula “µX. h1i (X ∨Y ), Y.

(Y ∨>) in X” is cycle free:

there is at most one modality cycle.

Cycle-free formulas have a very interesting property, which we

now describe. To test whether a tree satisﬁes a formula, one may

deﬁne a straightforward inductive relation between trees and for-

mulas that only holds when the root of the tree satisﬁes the formula,

unfolding ﬁxpoints if necessary. Given a tree, if a formula ϕ is cy-

cle free, then every node of the tree will be tested a ﬁnite number

of time against any given subformula of ϕ. The intuition behind

this property, which holds a central role in the proof of lemma 4.2,

is the following. If a tree node is tested an inﬁnite number of times

against a subformula, then there must be a cycle in the navigation in

the tree, corresponding to some modalities occurring in the subfor-

mula, between one occurrence of the test and the next one. As we

consider trees, the cycle implies there is a modality cycle in the for-

mula (as cycles of the form h1i h2i

¸˙

cannot occur). Hence

the number of modality cycles in any expansion of ϕ is unbounded,

thus the formula is not cycle free.

We are now ready to show a ﬁrst result: in the ﬁnite focused-tree

interpretation, the least and greatest ﬁxpoints coincide for cycle-

free formulas. To this end, we prove a stronger result that states

that a given focused tree is in the interpretation of a formula if it

is in a ﬁnite unfolding of the formula. In the base case, we use the

formula σ ∧ ¬σ as “false”.

DEFINITION 4.1 (Finite unfolding). The ﬁnite unfolding of a for-

mula ϕ is the set unf (ϕ) inductively deﬁned as

unf (ϕ)

def

= {ϕ} for ϕ = >, σ, ¬σ, s, ¬s, X, ¬ hai >

unf (ϕ ∨ ψ)

def

∨ ψ

| ϕ

∈ unf (ϕ), ψ

∈ unf (ψ)

unf (ϕ ∧ ψ)

def

∧ ψ

| ϕ

∈ unf (ϕ), ψ

∈ unf (ψ)

unf (hai ϕ)

def

hai ϕ

| ϕ

∈ unf (ϕ)

unf (µX

.ϕ

in ψ)

def

= unf (ψ{

µX

.ϕ

in X

}) ∪ {σ ∧ ¬σ}

unf (νX

.ϕ

in ψ)

def

= unf (ψ{

νX

.ϕ

in X

}) ∪ {σ ∧ ¬σ}

LEMMA 4.2. Let ϕ a cycle-free formula, then JϕK

= Junf (ϕ)K

The reason why this lemma holds is the following. Given a tree

satisfying ϕ, we deduce from the hypothesis that ϕ is cycle free

the fact that every node of the tree will be tested a ﬁnite number of

times against every subformula of ϕ. As the tree and the number

of subformulas are ﬁnite, the satisfaction derivation is ﬁnite hence

only a ﬁnite number of unfolding is necessary to prove that the tree

satisﬁes the formula, which is what the lemma states. As least and

greatest ﬁxpoints coincide when only a ﬁnite number of unfolding

is required, this is sufﬁcient to show that they collapse. Note that

this would not hold if inﬁnite trees were allowed: the formula

µX. h1i X is cycle free, but its interpretation is empty, whereas

the interpretation of νX. h1i X includes every tree with an inﬁnite

branch of h1i children.

We now illustrate why formulas need to be cycle free for the

ﬁxpoints to collapse. Consider the formula µX. h1i

X. Its in-

terpretation is empty. The interpretation of νX. h1i

X however

contains every focused tree that has one h1i child.

In the rest of the paper, we only consider least ﬁxpoints. An

important consequence of Lemma 4.2 is that the logic restricted

in this way is closed under negation using De Morgan’s dualities,

extended to eventualities and ﬁxpoints as follows:

¬ hai ϕ

def

= ¬ hai > ∨ hai ¬ϕ

¬µX

.ϕ

in ψ

def

= µX

.¬ϕ

{

¬X

} in ¬ψ{

¬X

}

5. XPath and Regular Tree Languages

XPath [6] is a powerful language for navigating in XML documents

and selecting sets of nodes matching a predicate. In their simplest

form, XPath expressions look like “directory navigation paths”. For

XPath

3 e ::= XPath expression

/p absolute path

| p relative path

| e

p e

union

| e

∩ e

intersection

Path p ::= path

path composition

| p[q] qualiﬁed path

| a::σ step with node test

| a::∗ step

Qualif q ::= qualiﬁer

and q

conjunction

| q

or q

disjunction

| not q negation

| p path

Axis a ::= tree navigation axis

child | self | parent

| descendant | desc-or-self

| ancestor | anc-or-self

| foll-sibling | prec-sibling

| following | preceding

Figure 3. XPath Abstract Syntax.

example, the XPath expression

/child::book/child::chapter/child::section

navigates from the root of a document (designated by the lead-

ing “/”) through the top-level “book” node to its “chapter” child

nodes and on to its child nodes named “section”. The result of the

evaluation of the entire expression is the set of all the “section”

nodes that can be reached in this manner. The situation becomes

more interesting when combined with XPath’s capability of search-

ing along “axes” other than “child”. For instance, one may use the

“preceding-sibling” axis for navigating backward through nodes of

the same parent, or the “ancestor” axis for navigating upward re-

cursively. Furthermore, at each step in the navigation the selected

nodes can be ﬁltered using qualiﬁers: boolean expression between

brackets that can test the existence or absence of paths.

We consider a large XPath fragment covering all major features

of the XPath recommendation [6] except counting and comparisons

between data values.

Fig. 3 gives the syntax of XPath expressions. Fig. 4 gives an

interpretation of XPath expressions as functions between sets of

focused trees.

5.1 XPath Embedding

We now explain how an XPath expression can be translated into an

equivalent L

formula that performs navigation in focused trees in

binary style.

Logical Interpretation of Axes The translation of navigational

primitives (namely XPath axes) is formally speciﬁed in Fig. 5. The

translation function, noted “A

→

JaK

”, takes an XPath axis a as

input, and returns its L

translation, parameterized by the L

for-

mula χ given as parameter. This parameter represents the context

in which the axis occurs and is needed for formula composition

in order to translate path composition. More precisely, the formula

→

JaK

holds for all nodes that can be accessed through the axis

a from some node verifying χ.

Let us consider an example. The formula A

→

JchildK

, trans-

lated as µZ.

χ ∨

Z, is satisﬁed by children of the context

χ. These nodes consist of the ﬁrst child and the remaining chil-

dren. From the ﬁrst child, the context must be reached immediately

J·K

: L

XPath

→ 2

J/pK

def

= S

JpK

root(F )

JpK

def

= S

JpK

{(σ

[tl ],c)∈F }

p e

def

= S

∪ S

∩ e

def

= S

∩ S

J·K

: Path → 2

→ 2

def

| f

∈ S

(

)

Jp[q]K

def

= {f | f ∈ S

JpK

∧ S

JqK

}

Ja::σK

def

= {f | f ∈ S

JaK

∧ nm(f) = σ}

Ja::∗K

def

= {f | f ∈ S

JaK

}

J·K

: Qualif → F → {true, false}

and q

def

= S

∧ S

or q

def

= S

∨ S

Jnot qK

def

= ¬ S

JqK

JpK

def

= S

JpK

{f}

6= ∅

J·K

: Axis → 2

→ 2

JselfK

def

= F

JchildK

def

= fchild(F) ∪ S

Jfoll-siblingK

fchild(F )

Jfoll-siblingK

def

= nsibling(F) ∪ S

Jfoll-siblingK

nsibling(F )

Jprec-siblingK

def

= psibling(F) ∪ S

Jprec-siblingK

psibling(F )

JparentK

def

= parent(F)

JdescendantK

def

= S

JchildK

∪ S

JdescendantK

JchildK

)

Jdesc-or-selfK

def

= F ∪ S

JdescendantK

JancestorK

def

= S

JparentK

∪ S

JancestorK

JparentK

)

Janc-or-selfK

def

= F ∪ S

JancestorK

JfollowingK

def

= S

Jdesc-or-selfK

(

Jfoll-siblingK

Janc-or-selfK

)

JprecedingK

def

= S

Jdesc-or-selfK

(

Jprec-siblingK

Janc-or-selfK

)

fchild(F )

def

= {f h1i | f ∈ F ∧ f h1i deﬁned}

nsibling(F )

def

= {f h2i | f ∈ F ∧ f h2i deﬁned}

psibling(F )

def

| f ∈ F ∧ f

deﬁned

parent(F )

def

= {(σ

◦

[rev a(tl

, t :: tl

)], c)

| (t, (tl

, c[σ

◦

], tl

)) ∈ F }

rev a(, tl

)

def

= tl

rev a(t :: tl

, tl

)

def

= rev a(tl

, t :: tl

)

root(F )

def

= {(σ

[tl], (tl , Top, tl)) ∈ F }

∪ root(parent(F ))

Figure 4. Interpretation of XPath in terms of Focused Trees.

→

J·K

: Axis → L

→ L

→

JselfK

def

= χ

→

JchildK

def

= µZ.

χ ∨

→

Jfoll-siblingK

def

= µZ.

χ ∨

→

Jprec-siblingK

def

= µZ. h2i χ ∨ h2i Z

→

JparentK

def

= h1i µZ.χ ∨ h2i Z

→

JdescendantK

def

= µZ.

(χ ∨ Z) ∨

→

Jdesc-or-selfK

def

= µZ.χ ∨ µY.

(Y ∨ Z) ∨

→

JancestorK

def

= h1i µZ.χ ∨ h1i Z ∨ h2i Z

→

Janc-or-selfK

def

= µZ.χ ∨ h1i µY.Z ∨ h2i Y

→

JfollowingK

def

= A

→

Jdesc-or-selfK

→

JprecedingK

def

= A

→

Jdesc-or-selfK

def

= A

→

Jfoll-siblingK

→

Janc-or-selfK

def

= A

→

Jprec-siblingK

→

Janc-or-selfK

Figure 5. Translation of XPath Axes.

→

J·K

: L

XPath

→ L

→

J/pK

def

= P

→

JpK

((

µZ.¬

>∨

)

∧(µY.χ∧s∨h1iY ∨h2iY )

)

→

JpK

def

= P

→

JpK

(χ∧s)

→

p e

def

= E

→

∨ E

→

∩ e

def

= E

→

∧ E

→

J·K

: Path → L

→ L

→

def

= P

→

(

→

)

→

Jp[q]K

def

= P

→

JpK

∧ Q

←

JqK

→

Ja::σK

def

= σ ∧ A

→

JaK

→

Ja::∗K

def

= A

→

JaK

Figure 6. Translation of Expressions and Paths.

by going once upward via 1. From the remaining children, the con-

text is reached by going upward (any number of times) via 2 and

then ﬁnally once via 1.

Logical Interpretation of Expressions Fig. 6 gives the translation

of XPath expressions into L

. The translation function “E

→

JeK

”

takes an XPath expression e and a L

formula χ as input, and re-

turns the corresponding L

translation. The translation of a relative

XPath expression marks the initial context with s. The translation

of an absolute XPath expression navigates to the root which is taken

as the initial context.

Figure 7 illustrates the translation of the XPath expression

“child::a[child::b]”. This expression selects all “a” child nodes of a

given context which have at least one “b” child. The translated L

formula holds for “a” nodes which are selected by the expression.

The ﬁrst part of the translated formula, ϕ, corresponds to the step

Efficient static analysis of XML paths and types

Figures

Citations

Automating Separation Logic with Trees and Data

Semantic subtyping with an SMT solver

Set-theoretic foundation of parametric polymorphism and subtyping

SPARQL query containment under RDFS entailment regime

Reasoning about XML with temporal logics and automata

References

Graph-Based Algorithms for Boolean Function Manipulation

Model checking

Design and Synthesis of Synchronization Skeletons Using Branching-Time Temporal Logic

Results on the propositional μ-calculus

XML Path Language (XPath) Version 1.0

Related Papers (5)

Taxonomy of XML schema languages using formal language theory

XML Path Language (XPath) Version 1.0

XPath satisfiability in the presence of DTDs

CDuce: an XML-centric general-purpose language

XDuce: A statically typed XML processing language

Frequently Asked Questions (18)

Q1. What is the known complexity for the resulting logic?

Q2. What are the contributions in "Efficient static analysis of xml paths and types" ?

Q3. What have the authors stated for future works in "Efficient static analysis of xml paths and types" ?

Q4. What is the expressive decidable logic used in XHTML?

Q5. What is the reason why the lemma holds?

Q6. What is the essence of the results?

Q7. What is the function that stops the selection of a node?

Q8. What is the meaning of cycle free formulas?

Q9. What is the main result of the paper?

Q10. What is the translation function for XPath?

Q11. what formalisms exist for describing types of XML documents?

Q12. how do the authors add a triple to a type?

Q13. What is the function that checks whether a formula is produced?

Q14. What are the main directions for further research?

Q15. What is the important factor in the analysis of XHTML?

Q16. What did previous work show that XPath decision problems are complicated?

Q17. What type of tests can be used to check the XPath equival?

Q18. What is the main difference between the two approaches?