What future works have the authors mentioned in the paper "Exploiting inductive logic programming techniques for declarative process mining" ?

In the future, the authors plan to apply DecMiner to university students ’ careers, where positive traces are careers of students that graduated on time, and negative ones are careers of students who did not finish their studies in the prescribed time. Moreover, the authors plan to investigate the development of a mining-checking cycle, in which learning is interleaved with classification of traces into positive or negative either manually by the user or automatically using the SCIFF Checker plug-in with a user specified model.

What is the generality order used for a clause?

The generality order that is used is θ-subsumption [19], a relationships between two clauses that can be checked syntactically and is stronger than implications.

What is the function that performs the covering loop?

In it, a function named Inductive-Constraint-Logic performs a covering loop in which negative interpretations are progressively ruled out and removed from the set N .

What is the advantage of mining ConDec constraints through SCIFF?

An advantage of mining ConDec constraints through SCIFF is that the approach can be extended to induce constraints involving more than two activities, for example constraints having a conjunction of preconditions or a disjunction of postconditions, and constraints with conditions over data.

What are the activities that are represented by the billings for each service?

Activities room service, laundry service, and massage service log which services have been accessed to by the client, while billings for each service are represented by corresponding activities.

What is the approach for learning process models of [9]?

The approach for learning process models of [9] involves iterating planning and operator refinement: given the current definition of the pre-conditions and post-conditions of the activities, a plan for achieving the business goal is generated and presented to the user which has to specify whether each activity of the plan can be executed.

What is the effect of the learning of ConDec models?

They influence the accuracy of the learned model because an activity relation discriminating between compliant and non-compliant execution traces cannot be learned if the appropriate template and/or activities were not chosen.

What is the importance of a declarative style of modeling?

The importance of adopting a declarative style of modeling has been recently pointed out by van der Aalst and Pesic [18]: the authors agree with their claim that declarative languages fit better complex, unpredictable processes, where a good balance between support and flexibility is of key importance.

What are the constraints that the authors learn from them?

From them the authors learn a set of declarative constraints expressed as SCIFF rules able to accurately classify a new trace, and corresponding to a ConDec model.

What is the purpose of the proposed approach?

In order to avoid asking the user to classify activities, [10] proposed an approach for automatically generating negative events, i.e., events that are used as negative examples.

What is the main purpose of DecMiner?

DecMiner implements all the data preparation and learning phases of the mining process described above and guides the user by means of its graphical user interface.

What is the first phase of the learning approach?

In the third phase, named “Templates”, the user uses the graphical interface shown in Figure 4 to choose the set of existence, relation and negation ConDec templates to be used in the mining phase.

How can the authors apply an algorithm similar to ICL for learning ICs?

If the authors define a generality order and a generalization operator for ICs, the authors can apply an algorithm similar to ICL for learning ICs.

Why do the authors differ from these works?

The authors differ from these works because the authors use a representation that is declarative rather than procedural, without sacrificing expressiveness.

What is the relation between BPM and the field of planning in artificial intelligence?

[9] related BPM to the field of planning in artificial intelligence: activities in business process are seen as planning operators with pre-conditions and postconditions.

How did the authors study the robustness of DecMiner to noise?

The authors also investigated the robustness of DecMiner to noise in the classification of traces: the authors repeated the experiments by considering training sets with an increasing portion of misclassified examples.

(Open Access) Exploiting Inductive Logic Programming Techniques for Declarative Process Mining (2009) | Federico Chesani

Q: What contributions have the authors mentioned in the paper "Exploiting inductive logic programming techniques for declarative process mining" ?

In this paper, the authors present a logic-based approach for tackling this problem. The authors investigate how, by properly tuning the learning algorithm, the approach can be adopted to mine models expressed in the ConDec notation, a graphical language for the declarative specification of business processes. The authors finally discuss the effectiveness of the approach by means of an example which shows the ability of the language to model concurrent activities and of DecMiner to learn such a

Q: What are the main constraints in the SCIFF?

They are mainly organized into three basic groups: (i) existence constraints, unary relationships constraining the cardinality of activity executions; (ii) relation constraints, positive relationships between two activities used to specify what should be executed when a given situation holds; (iii) negation constraints, the negated version of relation ones, imposed to forbid the execution of a certain activity when a given situation holds.

Exploiting Inductive Logic Programming

Techniques for Declarative Process Mining

Federico Chesani

, Evelina Lamma

, Paola Mello

Marco Montali

, Fabrizio Riguzzi

, and Sergio Storari

DEIS – Universit`a di Bologna

viale Risorgimento, 2 – 40136 – Bologna, Italy

{federico.chesani,paola.mello,marco.montali}@unibo.it

ENDIF – Universit`a di Ferrara

Via Saragat, 1 – 44100 – Ferrara, Italy

{evelina.lamma,fabrizio.riguzzi,sergio.storari}@unife.it

Abstract. In the last few years, there has been a growing interest in

the adoption of declarative paradigms for modeling and verifying pro-

cess models. These paradigms provide an abstract and human under-

standable way of specifying constraints that must hold among activities

executions rather than focusing on a speciﬁc procedural solution. Min-

ing such declarative descriptions is still an open challenge. In this paper,

we present a logic-based approach for tackling this problem. It relies on

Inductive Logic Programming techniques and, in particular, on a modi-

ﬁed version of the Inductive Constraint Logic algorithm. We investigate

how, by properly tuning the learning algorithm, the approach can be

adopted to mine models expressed in the ConDec notation, a graphical

language for the declarative speciﬁcation of business processes. Then, we

sketch how such a mining framework has been concretely implemented

as a ProM plug-in called DecMiner. We ﬁnally discuss the eﬀectiveness

of the approach by means of an example which shows the ability of the

language to model concurrent activities and of DecMiner to learn such a

model.

1 Introduction

When facing the problem of deﬁning and developing a Business Process (BP), we

can mainly identify two diﬀerent and complementary roles: the business analyst,

a domain expert aiming at improving the performances of her company, and

the IT-expert, who has the responsibility of bringing business-level models to an

eﬀective underlying implementation. The complementarity of these roles leads

to diﬀerent perspectives about the process to be developed: while the IT-expert

typically adopts a procedural style of modeling, dealing with implementation

aspects and trying to obtain an executable process, the business analyst follows

a more declarative approach (see Figure 1). Indeed, at a business level it is very

important to represent in an intuitive and concise way the domain and problem

under study, rather than focusing on a speciﬁc solution. In this respect, the

K. Jensen and W. van der Aalst (Eds.): ToPNoC II, LNCS 5460, pp. 278–295, 2009.

 Springer-Verlag Berlin Heidelberg 2009

Exploiting Inductive Logic Programming Techniques 279

execution

modeling

procedural

model

declarative

model

policies

regulations

business

rules

execution traces

Declarative

Process Mining

mining

Fig. 1. Declarative and procedural perspectives when modeling Business Processes

model will typically involve business rules, covering best practices and internal

constraints as well as internal/external regulations and compliance requirements.

The importance of adopting a declarative style of modeling has been recently

pointed out by van der Aalst and Pesic [18]: we agree with their claim that

declarative languages ﬁt better complex, unpredictable processes, where a good

balance between support and ﬂexibility is of key importance. To this end, in [18]

they propose a new graphical language for specifying process ﬂows in a declara-

tive manner. The language, called ConDec, does not completely ﬁx the control

ﬂow among activities, but rather envisages a set of constraints expressing poli-

cies/business rules for specifying either what is forbidden as well as mandatory

in the process. Therefore, the approach is inherently open and ﬂexible, because

workers can perform actions if they are not explicitly forbidden. ConDec adopts

an underlying semantics by means of Linear Temporal Logics (LTL), and can

also be mapped onto a logic programming-based framework called SCIFF (So-

cial Constrained IFF) [2,4], which was originally developed for the speciﬁcation

and veriﬁcation of global interaction protocols in open Multi-Agent Systems but

has recently been applied in the context of BPs and SOA (Service-Oriented

Architecture) Choreographies. SCIFF provides a declarative language based on

Computational Logic, where constraints are imposed on activities in terms of re-

active rules (namely Integrity Constraints). Such reactive rules mention in their

body occurring activities, i.e., events, and additional constraints on their vari-

ables in the style of Constraint Logic Programming (CLP) [12]. SCIFF rules

contain in their head expectations over the course of events. Such expectations

can be positive, when a certain activity is required to happen, or negative, when

a certain activity is forbidden to happen.

An important topic related to declarative process speciﬁcation, which is still

an open challenge, concerns their discovery starting from execution traces, i.e.,

declarative process mining. Indeed, up to now, the goal of process mining has

been the discovery of procedural process models (such as Petri Nets or Event-

driven Process Chains [21,24]). We claim the necessity of mining also declarative

models, to enable the possibility of inferring essential process constraints, easily

understandable by business analysts and not aﬀected by procedural details.

In this paper, we present a logic-based approach to address this issue. It

relies on Inductive Logic Programming (ILP) techniques and, in particular, on

a modiﬁed version of the Inductive Constraint Logic (ICL) algorithm [15]. The

280 F. Chesani et al.

algorithm takes as input a set of process execution traces, previously labeled

as compliant or not, and produces a set of SCIFF rules which correctly classify

them. This algorithm has been further modiﬁed, by properly tuning it and relying

on the mapping presented in [4], for learning ConDec models. Then, we describe

how the whole approach has been implemented as a plug-in of the ProM [23]

process mining framework. The plug-in, called DecMiner, is capable of mining

ConDec models starting from a set of process execution traces. The plug-in

envisages diﬀerent phases, ranging from the classiﬁcation of traces into compliant

and non-compliant subsets to the choice of which ConDec constraints have to be

considered and ﬁnally to the presentation of the mined model. The eﬀectiveness

of the approach is illustrated by considering an example inspired by the one

presented in [17] that involves the management of a hotel and spa.

Our previous papers on process mining [14,13] focused on the algorithm for

learning SCIFF rules and presented only a sketch of the technique for the trans-

lation into ConDec. In this work we describe how we automated this process and

implemented it into the DecMiner ProM plug-in.

The paper is organized as follows. Section 2 describes the declarative languages

we consider, namely SCIFF and ConDec, and the mapping between ConDec

and a subset of SCIFF rules. Section 3 presents the learning process and the

DecMiner plug-in. Section 4 discusses the experiments performed for validating

the approach. Section 5 presents related works and, ﬁnally, Section 6 concludes

the paper and discusses future work.

2 Declarative Speciﬁcation of Business Processes

In this section, we ﬁrst brieﬂy introduce the SCIFF language, a logic-based

language originally developed for specifying and verifying interaction protocols in

open Multi-Agent Systems [2]. We then brieﬂy describe ConDec [18], a graphical

language supporting the intuitive modeling of declarative constraints on the ﬂow

of activities. Finally, we sketch how SCIFF can be exploited to formalize ConDec

models as well as to extend its expressiveness, relying on the results presented

in [4].

2.1 An Overview of the SCIFF Framework

The SCIFF framework [2] is based on abduction, a reasoning paradigm which

allows to formulate hypotheses (called abducibles) accounting for observations.

In most abductive frameworks, integrity constraints are imposed over possible

hypotheses in order to prevent inconsistent explanations. SCIFF considers a

set of interacting peers as an open society, formalizing interaction protocols by

means of a set of global rules (constraints) which constrain the external and

observable behavior of participants.

To represent that an event ev happened (i.e., an atomic activity has been

executed) at a certain time T , SCIFF uses the symbol H(ev, T ), where ev is a

term and T is a variable or a number indicating the time. Hence, an execution

Exploiting Inductive Logic Programming Techniques 281

trace is modeled as a set of executed (happened) events. For example, we could

formalize that bob has performed activity a at time 5 as follows: H(a(bob), 5). Fur-

thermore, SCIFF introduces the concept of expectation, which plays a key role

when deﬁning global interaction protocols, choreographies, and more in general

event-driven processes. It is quite natural, in fact, to think of a process in terms

of rules of the form: “if ev

happened, then ev

is expected to happen.” Positive

expectations are denoted by E(ev, T ) meaning that ev is expected to happen

at time T . To satisfy a positive expectation, an execution trace must contain

a matching happened event. Negative expectations are denoted by EN(ev, T )

meaning that ev is expected not to happen at time T . To satisfy a negative

expectation an execution trace must not contain a matching happened event.

SCIFF Integrity Constraints (ICs for short) are forward rules of the form

body → head,wherebody can contain literals (i.e. a logical atom or its negation)

and happened events, and head contains a disjunction of conjunctions of expec-

tations and literals. In this paper, we consider a syntax of ICs that is a subset of

the one in [2]. In this simpliﬁed syntax, an IC C is a logical formula of the form

Body → DisjE

∨ ...∨ DisjE

∨ DisjEN

∨ ...∨ DisjEN

(1)

We will use Body(C) to indicate Body and Head(C) to indicate DisjE

∨ ...∨

DisjE

∨DisjEN

∨...∨ DisjEN

of a rule C. Body is of the form b

∧...∧b

where the b

s are literals. Some of the literals may be of the form H(ev, T )

meaning that event ev has happened at time T . DisjE

is a formula of the

form E(ev, T ) ∧ d

∧ ...∧ d

where ev is an event and the d

s are literals. All

the formulas DisjE

in Head(C) will be called positive disjuncts. DisjEN

is a

formula of the form EN(ev, T )∧d

∧...∧d

where ev is an event and the d

sare

literals. All the formulas DisjEN

in Head(C) will be called negative disjuncts.

The event ev can be a term. The literals b

sandd

s refer to predicates deﬁned

in a SCIFF knowledge base. Variables in common to Body(C)andHead(C)are

universally quantiﬁed (∀) with scope the whole IC. Variables occurring only in

positive disjuncts are existentially quantiﬁed (∃) with scope the disjunct itself.

Variables occurring only in negative disjuncts are universally quantiﬁed (∀)with

scope the disjunct itself. An example of an IC is

(IC.1) H(a(bob),T) ∧ T<10

→ E(b(alice),T1) ∧ T<T1 ∨

EN (c(mary),T2) ∧ T<T2 ∧ T 2 <T+10

The meaning of the IC.1 is the following: if bob has executed action a at a time

T<10, then we expect alice to execute action b at some time T 1 later than T

(∃T 1) or we expect that mary does not execute action c at any time T 2(∀T 2)

within 9 time units after T .

The interpretation of an IC is the following: if there exists a substitution of

variables such that the body is true in an interpretation representing a trace,

then one of the disjuncts in the head must be true. A positive disjunct means

that we expect event ev to happen with T and its variables satisfying d

∧...∧d

Therefore the disjunct is true if there exist a substitution of variables occurring

282 F. Chesani et al.

in it such that ev is present in the trace and the d

s are satisﬁed. A negative

disjunct means that we expect event ev not to happen with T and its variables

satisfying d

∧ ...∧ d

. Therefore the disjunct is true if for all substitutions of

variables occurring in it and not appearing in Body either ev does not happen

or, if it happens, its properties violate d

∧ ...∧ d

The main and original application of the SCIFF framework and its proof pro-

cedure is to verify whether an execution of the process concretely adheres to

the speciﬁcation, i.e., to perform compliance checking. SCIFF is seamlessly able

to check compliance both at run-time, by dynamically collecting and reason-

ing upon occurring events, or a posteriori, by analyzing the log of an observed

execution trace.

Roughly speaking, SCIFF combines occurred events with the speciﬁed rules,

to suitably generate the corresponding expectations; then expectations are veri-

ﬁed against the execution trace: a positive expectation must have a correspond-

ing matching event, whereas a negative expectation forbids the presence of a

matching event. If such conditions are not met (i.e., a positive/negative expec-

tation is not/is matched by a corresponding event), then the expectations are

violated, and the execution trace is evaluated as non-compliant.

A posteriori compliance checking has been wrapped into a ProM plug-in called

SCIFFChecker [3], which can be exploited to classify MXML execution traces

as compliant or non-compliant w.r.t. a high-level declarative criterion. Such a

criterion is speciﬁed by conﬁguring reactive business rules expressed in a natural

language-like manner and by automatically mapping them onto the underlying

formalism.

2.2 ConDec and Its SCIFF Mapping

ConDec [18,16] is a graphical language suitable for the declarative speciﬁcation

of ﬂexible Business Processes. Flexibility is provided since ConDec does not ﬁx

a completely speciﬁed process ﬂow, but rather imposes only the (minimal) set

of constraints that must be satisﬁed when executing the process activities. Con-

straints are policies/business rules which can be exploited to describe both what

is mandatory and what is forbidden in the process. They are mainly organized

into three basic groups: (i) existence constraints, unary relationships constraining

the cardinality of activity executions; (ii) relation constraints, positive relation-

ships between two activities used to specify what should be executed when a

given situation holds; (iii) negation constraints, the negated version of relation

ones, imposed to forbid the execution of a certain activity when a given situation

holds.

We have provided a complete mapping of ConDec relationships to SCIFF [4].

Table 1 shows some basic ConDec constraints, together with their corresponding

formalization. For example, the existence constraint speciﬁes that the involved

activity must be executed at least once; this can be expressed in SCIFF by simply

stating that the activity is expected to happen.Theresponded existence between

A and B imposes the existence of B only if activity A is executed, without

putting any temporal condition between the two executions. Temporizing such

Exploiting Inductive Logic Programming Techniques for Declarative Process Mining

Figures

Citations

Declarative specification and verification of service choreographiess

User-guided discovery of declarative process models

On the Discovery of Declarative Control Flows for Artful Processes

Discovering data-aware declarative process models from event logs

Online Discovery of Declarative Process Models from Event Streams

References

A Machine-Oriented Logic Based on the Resolution Principle

Negation as failure

Workflow mining: discovering process models from event logs

Inductive Logic Programming : Theory and Methods

Constraint logic programming : A survey

Related Papers (5)

DECLARE: Full Support for Loosely-Structured Processes

Declarative workflows: Balancing between flexibility and support

Efficient discovery of understandable declarative process models from event logs

Workflow mining: discovering process models from event logs

Verifiable agent interaction in abductive logic programming: The SCIFF framework

Frequently Asked Questions (18)

Q1. What contributions have the authors mentioned in the paper "Exploiting inductive logic programming techniques for declarative process mining" ?

Q2. What future works have the authors mentioned in the paper "Exploiting inductive logic programming techniques for declarative process mining" ?

Q3. What is the generality order used for a clause?

Q4. What is the function that performs the covering loop?

Q5. What are the main constraints in the SCIFF?

Q6. What is the advantage of mining ConDec constraints through SCIFF?

Q7. What are the activities that are represented by the billings for each service?

Q8. What is the approach for learning process models of [9]?

Q9. What is the effect of the learning of ConDec models?

Q10. What is the importance of a declarative style of modeling?

Q11. What are the constraints that the authors learn from them?

Q12. What is the purpose of the proposed approach?

Q13. What is the main purpose of DecMiner?

Q14. What is the first phase of the learning approach?

Q15. How can the authors apply an algorithm similar to ICL for learning ICs?

Q16. Why do the authors differ from these works?

Q17. What is the relation between BPM and the field of planning in artificial intelligence?

Q18. How did the authors study the robustness of DecMiner to noise?