Journal Article•DOI•

Exploiting resolution proofs to speed up LTL vacuity detection for BMC

Q: What contributions have the authors mentioned in the paper "Exploiting resolution proofs to speed up ltl vacuity detection for bmc" ?

The authors address the problem of efficient vacuity detection for Bounded Model Checking ( BMC ) of LTL properties, presenting three partial vacuity detection methods based on the efficient analysis of the resolution proof produced by a successful BMC run. In particular, the authors define a characteristic of resolution proofs – peripherality – and prove that if a variable is a source of vacuity, then there exists a resolution proof in which this variable is peripheral.

Q: What are the future works in "Exploiting resolution proofs to speed up ltl vacuity detection for bmc" ?

The authors plan to investigate this further in the future. The authors plan to enhance their methods by developing a heuristic based on the clause/variable ratio and proof size that indicates when naive detection should be applied instead. Thus, the authors believe that both local irrelevance and peripherality can be used to speed up naive detection.

Q: Why do the authors think that naive detection is more effective than local irrelevance?

The authors conjecture that the poor performance is due to a low clause/variable ratio [22] which favours naive detection in cases where vacuity is not present.

Q: What is the way to determine the vacuity of a model?

Since the authors are interested in replacing expensive model-checking runs by inexpensive partial vacuity detection methods, the authors limit ourselves to considering the output of the original model-checking run on BMCk(K, ϕ), i.e., CLK ∪ CLe.

Q: What is the procedure for converting the path and error constraints?

After the boolean formulas for the path and error constraints are calculated, they are converted to Conjunctive Normal Form (CNF) before being passed to a SAT solver.

Q: What is the labeling function used to represent a clause?

If Π is a resolution proof whose root clauses are divided into two disjoint sets,A∪B, then the labeling functionL is defined recursively as shown in Figure 4, where c is used to represent a clause.

Q: How did Armoni et al. generalize the definition of vacuity?

Armoni et al. [1] generalized the above syntactic definition of vacuity by introducing universal quantification, i.e., ∀x · ϕ[ψ ← x].

Q: How can the authors make the SAT solver more effective?

It might be possible to make them more effective by modifying the SAT solver to guide it to a particular kind of a proof (e.g., by changing the decision order heuristic), or to output multiple proofs (if possible).

Jocelyn Simmonds¹, Jessica Davies¹, Arie Gurfinkel², Marsha Chechik¹•Institutions (2)

University of Toronto¹, Software Engineering Institute²

01 Sep 2010-International Journal on Software Tools for Technology Transfer (Springer-Verlag)-Vol. 12, Iss: 5, pp 319-335

TL;DR: The vacuity detection tool, VaqTree, uses a characteristic of resolution proofs— peripherality—and proves that if a variable is a source of vacuity, then there exists a resolution proof in which this variable is peripheral.

read less

Abstract: When model-checking reports that a property holds on a model, vacuity detection increases user confidence in this result by checking that the property is satisfied in the intended way. While vacuity detection is effective, it is a relatively expensive technique requiring many additional model-checking runs. We address the problem of efficient vacuity detection for Bounded Model Checking (BMC) of linear temporal logic properties, presenting three partial vacuity detection methods based on the efficient analysis of the resolution proof produced by a successful BMC run. In particular, we define a characteristic of resolution proofs— peripherality—and prove that if a variable is a source of vacuity, then there exists a resolution proof in which this variable is peripheral. Our vacuity detection tool, VaqTree, uses these methods to detect vacuous variables, decreasing the total number of model-checking runs required to detect all sources of vacuity.

...read moreread less

Summary (3 min read)

Jump to: [1 Introduction] – [2.1 Bounded Model-Checking] – [2.2 Resolution Proofs] – [3 Defining Vacuity] – [4 Exploiting Resolution Proofs] – [4.1 Examining UNSAT cores] – [4.2 Peripherality] – [6 Practical Experience] – [6.1 Results obtained with SA] – [6.2 Results obtained with SB] and [6.3 Conclusions]

1 Introduction

Model-checking [7] is a widely-used automated technique for verification of both hardware and software artifacts that checks whether a temporal logic property is satisfied by a finite-state model of the artifact.
If the model does not satisfy the property, a counterexample, which can aid in debugging, is produced.
Vacuity detection [2,18,21,1] is an automatic sanity check that can be applied after a positive model-checking run in order to gain confidence that the model and the property capture the desired behaviours.
The peripherality algorithm examines the structure of the resolution proof, identifying as vacuous those variables that are not necessary or central to the derivation of false.

2.1 Bounded Model-Checking

Bounded model-checking (BMC) [4] is a method for determining whether a linear temporal logic (LTL) formulaϕ holds on a finite state system represented by a Kripke structure K up to a finite number of steps.
Below, the authors give an informal overview of Kripke structures, LTL formulas and BMC.
A run of K is a sequence of states starting with s0 that obeys R. Each run has an associated trace π, where πi is simply the set of propositional formulas that label the ith state in the run.
The error constraint CLe is encoded according to a recursive procedure which removes the temporal and logical operators from the property [4], e.g., the algorithm encodes ϕ =.
When a DPLL-based SAT solver processes an unsatisfiable theory, a resolution derivation of false (or the empty clause) is implicitly constructed [10, 27].

2.2 Resolution Proofs

Resolution is an inference rule that is applied to propositional clauses to produce logical consequences.
Π represents a tree of resolutions between the clauses labeling its nodes.
Its roots are the nodes with no parents; otherwise, all nodes have exactly two parents.
Given a non-root node labeled by the clause c, and the labels of its parents, c1 and c2, c is the resolvent since it has been produced by resolving c1 and c2 on some variable v.
Figure 2 shows a resolution proof of the unsatisfiability of Roots(Π).

3 Defining Vacuity

This article uses the following definition of vacuity.
Therefore, their techniques aim to find the k-step vacuous variables of ϕ.
This follows from the fact that a subformula is vacuous iff it is mutually vacuous in all of its atomic propositions [13, Th. 9], and that the definitions can be easily extended to mutual vacuity.
The authors now review some of the alternative definitions of vacuity and their algorithms.
In comparison, the authors use an “existential” definition of vacuity: a formula is vacuous if there exists a proof that does not use a subformula.

4 Exploiting Resolution Proofs

In Section 3, the authors discussed the existence of a sound and complete vacuity detection algorithm for BMC, which requires as many model-checking runs as there are propositional variables in the property being checked.
The authors propose a new vacuity detection strategy: first detect partial vacuity using inexpensive techniques and then complete the analysis using extra model-checking runs.
Consider a model that is composed of two completely disjoint sub-models, running in parallel, i.e.,K = K1 ‖ K2.
Suppose that K1 satisfies Gp, K2 satisfies Gq, and that both do so non-vacuously.
Any method based on examining only one resolution proof cannot prove the absence of vacuity, since another resolution proof, showing the property to be vacuous, might exist.

4.1 Examining UNSAT cores

Given a resolution proof that BMCk(K, ϕ) is unsatisfiable, the authors can sometimes cheaply determine that the similar theory BMCk(K, ϕ[p← x]) is also unsatisfiable, and therefore, that the property is p-vacuous.
Definition 2 provides an algorithm to detect some vacuous variables.
1.2 Local Irrelevance Variables which do not appear in the UNSAT core are vacuous.
By looking at the UNSAT core, it is possible to anticipate whether a variable will not be involved in resolutions between CLK and CLe using the following definition.
Assume that p is locally irrelevant in BMCk(K, ϕ).

4.2 Peripherality

In Section 4.1, two vacuity detection methods based on examining the variables in the UNSAT core were found to fall short of completeness.
By applying this labeling function to the proof shown in Figure 2, the authors can determine that variable p from EXAMPLE 2 is peripheral.
The authors defined three methods of detecting vacuity based on examining the UNSAT core and the resolution proof produced by BMC.
Proof-outputting SAT solver generates the resolution proof (Π) for CLK ∪ CLe.
This is a new component, written in Java (around 1.3k lines of code).

6 Practical Experience

The authors have run VaqTree on two benchmark suites.
To evaluate the overall performance of the tool and the effectiveness of their partial vacuity detection methods, the authors have created a benchmark suite SA using various models and properties from the NUSMV distribution.
To evaluate the scalability of the tool to industrial models, the authors have created a benchmark suite SB from the models in the IBM Formal Verification Benchmarks Library [14].
In SA, this corresponds to a test case from the asynchronous abp4 model (roughly 30 boolean variables, with k = 19).
The authors full results are available in Table 1.

6.1 Results obtained with SA

This benchmark suite consists of 5 models: abp4, msi wtrans, pci, and prod-cell from the NUSMV distribution (107 properties) and toyFGS04 from [15] (14 properties).
Execution times for naive detection include CNF theory generation and satisfiability testing for each variable of the property.
The numbers (see Table 1) show that local irrelevance is faster than peripherality in 96% of the cases.
VaqTree with local irrelevance was faster than naive detection in 70 (58%) of the test cases, out of which 30 cases were twice as fast, and 20 cases were faster by an order of magnitude.
Points below the diagonal indicate where the X-axis method detects more vacuous variables than the Y-axis method.

6.2 Results obtained with SB

This benchmark suite consists of 13 models from the IBM Formal Verification Benchmarks Library [26] (18 properties).
At this k, some of the models where too large to analyze using VaqTree, and some of the properties did not hold.
Naive vacuity detection required eight model-checking runs, taking 115.68 seconds to generate the corresponding CNF theories and 2.36 seconds to test their satisfiability, requiring a total of 118.04 seconds.
Irrelevance took 0.36 seconds to find one of the vacuous variables during the partial pass.
Graphs in Figure 9 show that their techniques do in fact detect vacuity, as indicated by the points that appear below the diagonal.

6.3 Conclusions

In summary, the authors observed that local irrelevance performs best out of their proposed partial methods, finding most vacuity in the least amount of time.
On the industrial benchmark SB , the overhead produced by peripherality was negligible.
Thus, the authors believe that both local irrelevance and peripherality can be used to speed up naive detection.
The authors plan to enhance their methods by developing a heuristic based on the clause/variable ratio and proof size that indicates when naive detection should be applied instead.

Did you find this useful? Give us your feedback

Figures (10)

Fig. 4: Labeling function for the peripherality algorithm.

Fig. 2: A resolution proof for EXAMPLE 2.

Table 1: Statistics for vacuity detection experiments on NuSMV distribution and other examples.

Fig. 8:SA : Comparison of the number of vacuous variables detected by partial pass. Larger points represent more test cases than the smaller points.

Fig. 7:SA : Comparison of execution times. Where applicable, all times include times for both the partial and model-checking passes.

Fig. 9:SB : Comparison of execution times. Where applicable, all times include times for both the partial and model-checking passes.

Fig. 3: A resolution proof for EXAMPLE 1.

Content maybe subject to copyright Report

Software Tools for Technology Transfer manuscript No.

(will be inserted by the editor)

Exploiting Resolution Proofs to Speed Up LTL Vacuity Detection for

BMC

Jocelyn Simmonds

, Jessica Davies

, Arie Gurﬁnkel

, Marsha Chechik

Department of Computer Science, University of Toronto

Software Engineering Institute, Carnegie Mellon University

The date of receipt and acceptance will be inserted by the editor

Abstract. When model-checking reportsthat a propertyholds

on a model, vacuity detection increases user conﬁdence in

this result by checking that the property is satisﬁed in the in-

tended way. While vacuity detection is effective, it is a rela-

tively expensive technique requiring many additional model-

checking runs. We address the problem of efﬁcient vacuity

detection for Bounded Model Checking (BMC) of LTL prop-

erties, presenting three partial vacuity detection methodsbased

on the efﬁcient analysis of the resolution proof produced by

a successful BMC run. In particular, we deﬁne a character-

istic of resolution proofs – peripherality – and prove that if

a variable is a source of vacuity, then there exists a resolu-

tion proof in which this variable is peripheral. Our vacuity

detection tool, VaqTree, uses these methods to detect vacu-

ous variables, decreasing the total number of model-checking

runs required to detect all sources of vacuity.

1 Introduction

Model-checking [7] is a widely-used automated technique

for veriﬁcation of both hardware and software artifacts that

checks whether a temporal logic property is satisﬁed by a

ﬁnite-state model of the artifact. If the model does not satisfy

the property, a counterexample, which can aid in debugging,

is produced. If the model does satisfy the property, no infor-

mation about why it does so is provided by the model-checker

alone. A positive answer without any additional information

can be misleading, since a property may be satisﬁed in a way

that was not intended. For instance, a property “every request

is eventually acknowledged” is satisﬁed in an environment

that never generates requests.

Vacuity detection [2,18,21,1]is an automatic sanity check

that can be applied after a positive model-checking run in or-

der to gain conﬁdence that the model and the property cap-

ture the desired behaviours. Informally, a property is said to

be vacuous if it has a subformula which is not relevant to

its satisfaction, or if the property itself is a tautology. Con-

versely, a property is satisﬁed non-vacuously if every part of

the formula is important – even a slight change to the formula

affects its satisfaction.

In this article, we focus on vacuity detection for SAT-

based Bounded Model Checking (BMC). Given a BMC prob-

lem with a particular bound, we wish to determine if the

property holds vacuously on the model up to this bound. In

this context, a naive method for detecting vacuity is to re-

place subformulas of the temporal logic property with un-

constrained boolean variables and run BMC for each such

substitution. If the property with some substitution still holds

on the model, the property is vacuous. This naive approach is

expensive, since in the worst case it requires as many model-

checking runs as there are subformulas in the property. Our

goal is to reduce the number of model-checking runs required

to detect vacuity. We do this by detecting some vacuity through

novel and inexpensive techniques reported in this article, and

complete the method by running the naive algorithm on the

remaining atomic subformulas. The key to our technique is

that SAT-based BMC can automatically provide useful infor-

mation (a resolution proof) beyond a decision whether the

property holds on the model; we exploit such proofs for par-

tial vacuity detection.

In SAT-based BMC, the property and the behavior of the

model are encoded in a propositional theory, such that the

theory is satisﬁable if and only if the formula does not hold.

When the property does hold, a DPLL-based SAT solver can

produce a resolution proof that derives false from a subset of

the clauses in the theory called the UNSAT core. Intuitively,

the resolution proof provides an explanation of why the prop-

erty is not falsiﬁed by the model, and the UNSAT core deter-

mines the relevant parts of the model and the property [19].

In this article, we develop three methods of increasing

precision (irrelevance, local irrelevance, and peripherality)

to analyze the resolution proof to achieve partial vacuity de-

tection. These algorithms are used by our vacuity detection

tool, VaqTree,in order to reduce the number of model-checking

runs required to ﬁnd all sources of vacuity, thus reducing exe-

cution times. Irrelevance and local irrelevance detect vacuity

based on which variables appear in the UNSAT core, and in

which locations. However, as these methods only examine

the UNSAT core, their precision is limited. The periphera-

lity algorithm examines the structure of the resolution proof,

identifying as vacuous those variables that are not necessary

or central to the derivation of false. This method is as pre-

cise as can be achieved through analyzing a single resolution

proof, and its running time is linear in the size of the resolu-

tion proof and the number of variables in the property. Our

experience shows that local irrelevance is the ideal candidate

for speeding up naive vacuity detection.

The remainder of the article is organized as follows. Sec-

tion 2 presents some required background, followed, in Sec-

tion 3 by our deﬁnition of vacuity, the naive algorithm for

LTL vacuity detection using BMC, and an overview of work

in the vacuity detection ﬁeld. Section 4 presents the three al-

gorithms that detect vacuity by analyzing a resolution proof.

Tool support for our approach is described in Section 5. Our

experimental results are presented in Section 6. We conclude

with a summary, additional related work, and suggestions for

future work in Section 7.

2 Background

In this section, we review bounded model-checking and res-

olution proofs.

2.1 Bounded Model-Checking

Bounded model-checking (BMC) [4] is a method for deter-

mining whether a linear temporal logic (LTL) formula ϕ holds

on a ﬁnite state system represented by a Kripke structure K

up to a ﬁnite number of steps. To solve an instance of the

BMC problem, denoted by BMC

(K, ϕ), it is required to de-

termine whether K |=

ϕ, where |=

is the k-depth satisfac-

tion relation. Below, we give an informal overview of Kripke

structures, LTL formulas and BMC. More detailed deﬁnitions

can be found in [7,4].

A Kripke structure K has a ﬁnite set of states S, one of

which is considered to be the initial state s

. A transition re-

lation R ⊆ S × S relates states to states. Each state is labeled

by the set of propositional formulas (or variables) that hold in

that state. A run of K is a sequence of states starting with s

that obeys R. Each run has an associated trace π, where π

is simply the set of propositional formulas that label the i

state in the run. We write π

to denote the sufﬁx of the trace

beginning at i.

LTL formulas are built from propositional variables, the

usual boolean operators (∨, ∧, ¬), and the temporal op-

erators G (“always”), F (“eventually”), U (“until”), and X

(“next”). Their semantics are deﬁned on linear traces, such as

those produced by runs of a Kripke structure. π |= ϕ means

{p} {q}

Fig. 1: A Kripke structure.

that the trace π satisﬁes the LTL formula ϕ. For example,

π |= Fϕ if and only if there exists some i such that ϕ holds

on π

. The satisfaction relation |= is deﬁned inductively in

a similar way for all operators and propositional variables in

LTL. We refer the reader to [7] for a detailed description of

the semantics of LTL.

A Kripke structure K satisﬁes an LTL formula ϕ if and

only if π |= ϕ for all traces π of K. The BMC problem

BMC

(K, ϕ) is to determine whether K satisﬁes ϕ for up to

k steps, i.e., whether K |=

ϕ. The k-depth satisfaction rela-

tion |=

is deﬁned inductively; for example, π |=

Gϕ if and

only if π

k−1

ϕ for all i ≤ k.

To determine whether K |=

ϕ, the problem is converted

to a propositional formula Φ (see [4,6,5]) which is satisﬁ-

able if and only if there exists a length-k counterexample to

K |=

ϕ. Φ is then given to a SAT solver which decides its

satisﬁability. The propositional encoding represents the be-

havior of K up to k steps with a path constraint CL

, and

encodes all counterexamples to ϕ of length k in an error con-

straint CL

. Therefore, if the theory CL

∪ CL

is satis-

ﬁable, there is a path through K which obeys the transition

relation and falsiﬁes ϕ. The value of each variable v of K at

each time step is represented using new boolean variables v

(0 ≤ i ≤ k), called timed variables.

The transition relation of a Kripke structure can be repre-

sented symbolically by a propositional formula over the vari-

ables V and primed variables V

′

(which represent the vari-

ables in the next state). For example, in the model in Fig-

ure 1, the transition relation is represented by the formula

R = (p ∧ ¬q ∧ ¬p

′

∧ q

′

) ∨ (¬p ∧ q ∧ ¬p

′

∧ q

′

). The path

constraint is obtained by substituting the timed variables V

for V in R, and replacing V

′

by the timed variables for the

next step, V

i+1

. This is repeated for each 0 ≤ i < k, and the

resulting propositional formulas are conjoined along with a

formula representing the initial state [4]. In Figure 1, if k = 1,

= (p

∧ ¬q

) ∧ ((p

∧ ¬q

∧ ¬p

∧ q

)

∨(¬p

∧ q

∧ ¬p

∧ q

)).

The error constraint CL

is encoded according to a recursive

procedure which removes the temporal and logical operators

from the property [4], e.g., the algorithm encodes ϕ = Gp,

where p is a propositional variable, expanded up to k = 2, by

the formula ¬p

∨ ¬p

After the boolean formulas for the path and error con-

straints are calculated, they are converted to Conjunctive Nor-

mal Form (CNF) before being passed to a SAT solver. If

the solver reports that CL

∪ CL

is unsatisﬁable, it means

that there is no length-k counterexample to ϕ; otherwise, a

(¬r

) (r

∨ p

) (¬p

∨ q

) (¬p

∨ ¬q

) (p

)

(¬p

)

()

Fig. 2: A resolution proof for EXAMPLE 2.

satisfying assignment is returned. When a DPLL-based SAT

solver processes an unsatisﬁable theory, a resolution deriva-

tion of false (or the empty clause) is implicitly constructed [10,

27]. This resolution proof is used to verify that false can in-

deed be derived from CL

∪ CL

[28].

2.2 Resolution Proofs

Resolution is an inference rule that is applied to propositional

clauses to produce logical consequences. A clause is a dis-

junction of literals (boolean variables or their negations). For

example, (v

∨ ¬v

∨ v

) is a clause stating that at least one

of v

, ¬v

or v

must be true. The resolution rule takes two

clauses, where one contains a literal v and the other – its nega-

tion ¬v, and produces a clause containing the union of the

two clauses’ literals minus v and ¬v. For example, resolv-

ing (v

∨ ¬v

∨ v

) and (v

∨ v

) produces the resolvent

∨ v

A resolution proof Π is a directed acyclic graph whose

nodes are labeled by propositional clauses. Π represents a

tree of resolutions between the clauses labeling its nodes.

Its roots are the nodes with no parents; otherwise, all nodes

have exactly two parents. The nodes with no children are

called the leaves. For example, the roots of resolution proof

Π in Figure 2 are Roots(Π) = {(¬r

), (r

∨ p

), (¬p

∨

), (¬p

∨¬q

), (p

)}, and the leaf of Π is the empty clause,

i.e., Leaf (Π) = false. Given a non-root node labeled by the

clause c, and the labels of its parents, c

and c

, c is the re-

solvent since it has been produced by resolving c

and c

some variable v. A resolution proof Π is a proof of unsat-

isﬁability of a set of clauses A if and only if all roots of Π

belong to A, and one of the leaves of Π is the empty clause.

For example, Figure 2 shows a resolution proof of the unsat-

isﬁability of Roots(Π). If a propositional theory in CNF is

unsatisﬁable, an UNSAT core is an unsatisﬁable subset of its

clauses.

Given two disjoint sets of clauses A and B, a variable v is

said to be local to A if and only if v appears in A but does not

appear in B, and v is said to be global if it appears in both

A and B. In Figure 2, if Roots(Π) = A ∪ B, where A =

{(¬r

), (r

∨ p

), (¬p

∨ q

)} and B = {(¬p

∨ ¬q

), (p

)},

then r

is local to A, and the rest of the variables are global.

3 Deﬁning Vacuity

This article uses the following deﬁnition of vacuity.

Deﬁnition 1. Let K be a Kripke structure, ϕ be a formula

s.t. K |=

ϕ, and p be a variable. ϕ is k-step p-vacuous iff

K |=

ϕ[p ← x], where x is a variable not occurring in K or

in ϕ.

If ϕ is k-step p-vacuous, we call p a k-step vacuous vari-

able. A property ϕ is k-step vacuous if and only if ϕ contains

a k-step vacuous variable. Therefore, our techniques aim to

ﬁnd the k-step vacuous variables of ϕ. The qualiﬁer “k-step”

is omitted in the remainder of the article but should be under-

stood implicitly in the BMC context.

Deﬁnition 1 can be generalized to vacuity in arbitrary (not

necessarily atomic) subformulas. This follows from the fact

that a subformulais vacuous iff it is mutually vacuous in all of

its atomic propositions [13, Th. 9], and that the deﬁnitions can

be easily extended to mutual vacuity. A set of atomic propo-

sitions {p

, ..., p

} is mutually vacuous if K |=

ϕ[p

←

, ..., p

← x

], where {x

, ..., x

} are new variables. For

example, if ϕ contains subformula θ = p ∧ q, and p and q

are mutually vacuous, then we can deduce that θ is vacuous

as well.

Naive Vacuity Detection. Deﬁnition 1 suggests a sound and

complete algorithm for vacuity detection: for each proposi-

tional variable p in ϕ, run BMC on ϕ[p ← x], where x is a

variable that does not appear in K and ϕ. If K |=

ϕ[p ← x]

for some p, then ϕ is k-step vacuous. We refer to this algo-

rithm as naive. Its drawback is that it may require as many

model-checking runs as there are propositional variables in

ϕ.

We now review some of the alternativedeﬁnitions of vacu-

ity and their algorithms. The ﬁrst attempt to formulate and

automate vacuity detection is due to Beer et al. [2]. They

consider a property ϕ to be vacuous if ϕ contains a sub-

formula ψ such that replacing ψ by any other formula does

not affect the satisfaction of ϕ. Applying this deﬁnition di-

rectly would require an inﬁnite number of subformula re-

placements, precluding a practical implementation. However,

Beer et al. show that to detect vacuity w.r.t. a single occur-

rence of a subformula ψ in w-ACTL, it is sufﬁcient to replace

ψ with only true and false. This was later extended to CTL*

by Kupferman and Vardi [18], and to the modal µ−calculus

by Dong et al. [9]. Purandare and Somenzi [21] showed how

to speed up subformula vacuity by analyzing the parse tree of

a CTL property.

Armoni et al. [1] generalized the above syntactic deﬁni-

tion of vacuity by introducing universal quantiﬁcation, i.e.,

∀x · ϕ[ψ ← x]. Based on the domain of x, three notions of

vacuity are obtained, the most robust of which being trace

vacuity. Gurﬁnkel and Chechik [12] extended Armoni’s deﬁ-

nition of vacuity to CTL*, thus uniformly capturing CTL and

LTL. Armoni et al. also analyzed the syntactic structure of

the property in order to avoid checking the operands of sub-

formulas that are known to be vacuous. Such optimizations

complement our techniques, which focus on detecting vacu-

ous atomic subformulas.

In [20], Namjoshi has introduced a proof-based variant

of vacuity. Although it is called proof vacuity in the original

paper, we refer to it as forall-proof vacuity. This deﬁnition is

based on the semantic proofs of K |= ϕ for a Kripke structure

K and a formula ϕ. Informally, a formula ϕ is forall-proof

vacuous in a subformula ψ if ψ is not used in any proof of

K |= ϕ. Of course, a formal deﬁnition depends on the exact

interpretation of the notion of “proof”. In comparison, we use

an “existential” deﬁnition of vacuity: a formula is vacuous

if there exists a proof that does not use a subformula. Inter-

estingly, we rely on syntactic (and not semantic) resolution

proofs that may include “semantically-useless” resolutions.

As a result, it is possible that a formula ϕ is vacuous in ψ in

a model K, yet there is no resolution proof of bounded satis-

faction of K |= ϕ that does not use ψ. More importantly, our

goal is to develop a method to efﬁciently detect vacuity for

LTL as it was deﬁned by [2,3,1,12], whereas Namjoshi was

looking for an alternative deﬁnition of vacuity for branching

time logic.

Our deﬁnition of vacuity is syntactic, and in this respect,

it is similar to the original deﬁnition of Beer et al. [2]. How-

ever, Deﬁnition 1 is stronger, and is equivalent to the seman-

tic deﬁnition of Armoni et al. [1], as shown by Gurﬁnkel and

Chechik [12].

4 Exploiting Resolution Proofs

In Section 3, we discussed the existence of a sound and com-

plete vacuity detection algorithm for BMC, which requires

as many model-checking runs as there are propositional vari-

ables in the property being checked. We propose a new vacu-

ity detection strategy: ﬁrst detect partial vacuity using inex-

pensive techniques and then complete the analysis using ex-

tra model-checking runs. Since we are interested in replacing

expensive model-checking runs by inexpensive partial vacu-

ity detection methods, we limit ourselves to considering the

output of the original model-checking run on BMC

(K, ϕ),

i.e., CL

∪ CL

. This run provides us with a single reso-

lution proof to analyze. Of course, in general, there may be

many ways to derive the empty clause from different sub-

sets of BMC

(K, ϕ). Any method that only examines one of

these derivations is inherently incomplete, in the sense that

a property may be p-vacuous but there is no way of deter-

mining this based on a given resolution proof. For example,

consider a model that is composed of two completely disjoint

sub-models, running in parallel, i.e., K = K

k K

. Suppose

that K

satisﬁes Gp, K

satisﬁes Gq, and that both do so

non-vacuously. Then the property ϕ = Gp ∨ Gq holds on K

p-vacuously and q-vacuously. However, one of the possible

resolution proofs showing that ϕ holds proves that Gp holds

non-vacuously on K

. Thus, it is impossible to determine that

ϕ is vacuous in p from this proof. Any method based on ex-

amining only one resolution proof cannot prove the absence

of vacuity, since another resolution proof, showing the prop-

erty to be vacuous, might exist.

In this section, we introduce three algorithms of increas-

ing precision for partial vacuity detection, based on examin-

ing the UNSAT core (irrelevance and local irrelevance) and

the resolution proof produced by BMC (peripherality).

4.1 Examining UNSAT cores

Given a resolution proof that BMC

(K, ϕ) is unsatisﬁable,

we can sometimes cheaply determine that the similar theory

BMC

(K, ϕ[p ← x]) is also unsatisﬁable, and therefore, that

the property is p-vacuous. In this section, we consider how

to determine that BMC

(K, ϕ[p ← x]) is unsatisﬁable given

that BMC

(K, ϕ) is unsatisﬁable, using only an UNSAT core.

4.1.1 Irrelevance

Intuitively, any variable that does not appear in the UNSAT

core does not contribute to the reason why ϕ holds on K, so

it can be considered irrelevant.

Deﬁnition 2. Let K be a model, and ϕ an LTL formula. As-

sume that Π is an UNSAT core of BMC

(K, ϕ) witnessing

that K |=

ϕ. Then, p is irrelevant with respect to

BMC

(K, ϕ) and Π iff p

does not appear in Π for any time

instance i.

If a variable is irrelevant, it is also vacuous, as shown by

the following theorem.

Theorem 1. If p is irrelevant with respect to BMC

(K, ϕ)

and Π, then ϕ is k-step p-vacuous.

Proof: Let BMC

(K, ϕ) = CL

∪ CL

and U be the UNSAT

core returned by the SAT solver for BMC

(K, ϕ). Assume that

p is irrelevant in BMC

(K, ϕ). So U does not contain any p

Deﬁnition 2. Therefore, U ⊆ CL

∪ CL

implies U ⊆ CL

∪

← x

| 0 ≤ i < k]. U is also an UNSAT core of

BMC

(K, ϕ[p ← x]) so ϕ[p ← x] holds on K. Thus, ϕ is p-

vacuous.

Deﬁnition 2 provides an algorithm to detect some vacu-

ous variables. However, a variable can appear in the UNSAT

core and still be vacuous, as demonstrated by the following

example.

EXAMPLE 1. Consider a Kripke structure K with variables p

and q given by the constraints Init = p ∧ q, R = p ⇒ q

′

which mean that the initial state is labeled by {p, q}, and

the transition relation is expressed by the propositional for-

mula p ⇒ q

′

over unprimed and primed variables. Let ϕ =

X(p ∨ q) be the property to check. ϕ is p-vacuous since it is

satisﬁed simply because q is true in any successor of the ini-

tial state. The CNF encoding of the one-step BMC problem

is CL

= {(p

∧ q

), (p

⇒ q

)} = {(p

), (q

), (¬p

, q

)},

= {(¬p

), (p

, ¬q

)}. In this case, the unique minimal

UNSAT core contains all of the clauses of the problem except

for (q

). Thus, all p

appear in the UNSAT core, and p cannot

be determined vacuous using irrelevance.



) (¬p

, q

) (x

, ¬q

) (¬x

)

¬q

()

Fig. 3: A resolution proof for EXAMPLE 1.

This example shows that even if we are to look at every

UNSAT core of a BMC problem, irrelevance is still unable to

detect existing vacuity.

4.1.2 Local Irrelevance

Variables which do not appear in the UNSAT core are vac-

uous. The converse is not true: vacuous variables may also

appear in the UNSAT core. Intuitively, these variables are

not the central reason why ϕ holds on K. For example, the

clauses of CL

may resolve against each other, representing

some simpliﬁcation and uniﬁcation of parts of the model, be-

fore resolutions with CL

clauses are performed. If a variable

is resolved upon using only the CL

clauses or only the CL

clauses, it is potentially vacuous. By looking at the UNSAT

core, it is possible to anticipate whether a variable will not

be involved in resolutions between CL

and CL

using the

following deﬁnition.

Deﬁnition 3. Let K be a model, and ϕ an LTL formula. As-

sume that Π is an UNSAT core of BMC

(K, ϕ) witness-

ing K |=

ϕ. Then, p is locally irrelevant with respect to

BMC

(K, ϕ) and Π iff for each time instance i, either p

does

not appear in Π or p

is local to either CL

∩Π or CL

∩Π.

In EXAMPLE 1, p is locally irrelevant since p

only oc-

curs in the clauses of U taken from CL

, while p

only ap-

pears in U within CL

clauses. Moreover, the UNSAT core

of the original problem can be convertedto an UNSAT core of

the new theory, thus proving that p is vacuous. Speciﬁcally,

U = {(p

), (¬p

, q

), (¬p

), (p

, ¬q

)} is the UNSAT core

of the original problem, so substituting x for p in the clauses

of U that came from CL

gives

′

= {(p

), (¬p

, q

), (¬x

), (x

, ¬q

)}.

This is a subset of

BMC

(K, ϕ[p ← x]) = {(p

), (q

), (¬p

, q

), (¬x

, ¬q

)},

so it is a candidate for the new UNSAT core. The substitution

may have prevented the resolutions necessary to derive the

empty clause. However, Figure 3 shows a proof that U

′

also unsatisﬁable. In this case, it was possible to substitute x

for p

in the clauses coming from CL

in the original UNSAT

core and create an UNSAT core for BMC

(K, ϕ[p ← x]). In

fact, this observation applies to all cases of local irrelevance

by Theorem 2. Therefore, Deﬁnition 3 speciﬁes an algorithm

to detect some vacuous variables.

Theorem 2. If p is locally irrelevant with respect to

BMC

(K, ϕ) and Π, then ϕ is k-step p-vacuous.

Proof: Let BMC

(K, ϕ) = CL

∪ CL

and U be the UN-

SAT core returned by the SAT solver for BMC

(K, ϕ). Assume

that p is locally irrelevant in BMC

(K, ϕ). So for all p

, either

does not appear in U , or p

is local to CL

∩ U = U

to CL

∩ U = U

by Deﬁnition 3. Let U

′

be U

with each

occurence of p

replaced by x

. Since each p

that has been

replaced is local to U

, and U

∪ U

= U is unsatisﬁable, then

∪ U

′

is also unsatisﬁable. Since U

′

⊆ CL

← x

| 0 ≤

i < k], the set of clauses CL

∪ CL

← x

| 0 ≤ i < k] is

unsatisﬁable as well. Therefore, K |=

ϕ[p ← x] holds, so ϕ

is p-vacuous.

Unfortunately, if a variable p is not locally irrelevant in

an UNSAT core, the formula can still be p-vacuous, as shown

by the following example.

EXAMPLE 2. Consider a Kripke structure with atomic propo-

sitions r, p and q whose initial state is given by the constraint:

Init = ¬r ∧ p ∧ q. The formula ϕ = ¬p ∨ q is p-vacuous in

the initial state. Let us assume that the zero-step BMC prob-

lem is encoded in CNF as follows:

= (¬r

)(r

∨ p

)(¬p

∨ q

)

= (p

)(¬p

∨ ¬q

)

There are several resolution proofs that can establish un-

satisﬁability of CL

∪ CL

; one such proof is shown in Fig-

ure 2. In none of the proofs is p locally irrelevant with respect

to CL

and CL

The problem with local irrelevance is that it is impossible

to tell if a variable is going to be used in a resolution joining

and CL

clauses based on the UNSAT core alone.



4.2 Peripherality

In Section 4.1, two vacuity detection methods based on ex-

amining the variables in the UNSAT core were found to fall

short of completeness. It was seen that even if every possible

resolution proof could be analyzed, irrelevance and local ir-

relevance still might fail to detect existing vacuity. Here, we

extend the analysis to the resolution proof’s structure. The

resulting peripherality algorithm is superior, since it guaran-

tees vacuity will be found if all possible resolution proofs are

considered.

The limitations of detecting vacuity based only on the

UNSAT core were demonstrated in EXAMPLE 2. By exam-

ining the resolution proof in Figure 2, we see that although

appears both in CL

clauses and in CL

clauses, it is

always resolved “locally”. That is, if we resolve two clauses

= (..., p

, ...) and c

= (..., ¬p

, ...), p

and ¬p

must have

been preserved from their original source in some set of root

clauses. If all the originating root clauses belong to CL

all belong to CL

, then p

is being resolved on locally. In this

case, we can replace p

in either set of clauses without af-

fecting their unsatisﬁability. For example, in Figure 2, p

can

HTML Viewer

Frequently Asked Questions (11)

Q1. What contributions have the authors mentioned in the paper "Exploiting resolution proofs to speed up ltl vacuity detection for bmc" ?

The authors address the problem of efficient vacuity detection for Bounded Model Checking ( BMC ) of LTL properties, presenting three partial vacuity detection methods based on the efficient analysis of the resolution proof produced by a successful BMC run. In particular, the authors define a characteristic of resolution proofs – peripherality – and prove that if a variable is a source of vacuity, then there exists a resolution proof in which this variable is peripheral.

Q2. What are the future works in "Exploiting resolution proofs to speed up ltl vacuity detection for bmc" ?

The authors plan to investigate this further in the future. The authors plan to enhance their methods by developing a heuristic based on the clause/variable ratio and proof size that indicates when naive detection should be applied instead. Thus, the authors believe that both local irrelevance and peripherality can be used to speed up naive detection.

Q3. How long did it take to generate the CNF theories?

Naive vacuity detection required eight model-checking runs, taking 115.68 seconds to generate the corresponding CNF theories and 2.36 seconds to test their satisfiability, requiring a total of 118.04 seconds.

Q4. Why do the authors think that naive detection is more effective than local irrelevance?

The authors conjecture that the poor performance is due to a low clause/variable ratio [22] which favours naive detection in cases where vacuity is not present.

Q5. What is the recursive procedure for encoding the error constraint CLe?

The error constraint CLe is encoded according to a recursive procedure which removes the temporal and logical operators from the property [4], e.g., the algorithm encodes ϕ =

Q6. What is the way to determine the vacuity of a model?

Since the authors are interested in replacing expensive model-checking runs by inexpensive partial vacuity detection methods, the authors limit ourselves to considering the output of the original model-checking run on BMCk(K, ϕ), i.e., CLK ∪ CLe.

Q7. What is the procedure for converting the path and error constraints?

After the boolean formulas for the path and error constraints are calculated, they are converted to Conjunctive Normal Form (CNF) before being passed to a SAT solver.

Q8. What is the labeling function used to represent a clause?

If Π is a resolution proof whose root clauses are divided into two disjoint sets,A∪B, then the labeling functionL is defined recursively as shown in Figure 4, where c is used to represent a clause.

Q9. How did Armoni et al. generalize the definition of vacuity?

Armoni et al. [1] generalized the above syntactic definition of vacuity by introducing universal quantification, i.e., ∀x · ϕ[ψ ← x].

Q10. How can the authors make the SAT solver more effective?

It might be possible to make them more effective by modifying the SAT solver to guide it to a particular kind of a proof (e.g., by changing the decision order heuristic), or to output multiple proofs (if possible).

Q11. What is the way to detect vacuity?

In this context, a naive method for detecting vacuity is to replace subformulas of the temporal logic property with unconstrained boolean variables and run BMC for each such substitution.

Exploiting resolution proofs to speed up LTL vacuity detection for BMC

Summary (3 min read)

1 Introduction

2.1 Bounded Model-Checking

2.2 Resolution Proofs

3 Defining Vacuity

4 Exploiting Resolution Proofs

4.1 Examining UNSAT cores

4.2 Peripherality

6 Practical Experience

6.1 Results obtained with SA

6.2 Results obtained with SB

6.3 Conclusions

Figures (10)

Citations

Cites background from "Exploiting resolution proofs to spe..."

Cites background from "Exploiting resolution proofs to spe..."

References

"Exploiting resolution proofs to spe..." refers background or methods in this paper

"Exploiting resolution proofs to spe..." refers background in this paper

Related Papers (5)

Frequently Asked Questions (11)

Q1. What contributions have the authors mentioned in the paper "Exploiting resolution proofs to speed up ltl vacuity detection for bmc" ?

Q2. What are the future works in "Exploiting resolution proofs to speed up ltl vacuity detection for bmc" ?

Q3. How long did it take to generate the CNF theories?

Q4. Why do the authors think that naive detection is more effective than local irrelevance?

Q5. What is the recursive procedure for encoding the error constraint CLe?

Q6. What is the way to determine the vacuity of a model?

Q7. What is the procedure for converting the path and error constraints?

Q8. What is the labeling function used to represent a clause?

Q9. How did Armoni et al. generalize the definition of vacuity?

Q10. How can the authors make the SAT solver more effective?

Q11. What is the way to detect vacuity?