Discovering workflow nets using integer linear programming

doi:10.1007/S00607-017-0582-5

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Automated Discovery of Process Models from Event Logs: Review and Benchmark

[...]

Adriano Augusto¹, Raffaele Conforti², Marlon Dumas¹, Marcello La Rosa², Fabrizio Maria Maggi¹, Andrea Marrella³, Massimo Mecella³, Allar Soo¹ - Show less +4 more•Institutions (3)

University of Tartu¹, University of Melbourne², Sapienza University of Rome³

01 Apr 2019-IEEE Transactions on Knowledge and Data Engineering

TL;DR: The results highlight gaps and unexplored tradeoffs in the field, including the lack of scalability of some methods and a strong divergence in their performance with respect to the different quality metrics used.

...read moreread less

Abstract: Process mining allows analysts to exploit logs of historical executions of business processes to extract insights regarding the actual performance of these processes. One of the most widely studied process mining operations is automated process discovery. An automated process discovery method takes as input an event log, and produces as output a business process model that captures the control-flow relations between tasks that are observed in or implied by the event log. Various automated process discovery methods have been proposed in the past two decades, striking different tradeoffs between scalability, accuracy, and complexity of the resulting models. However, these methods have been evaluated in an ad-hoc manner, employing different datasets, experimental setups, evaluation measures, and baselines, often leading to incomparable conclusions and sometimes unreproducible results due to the use of closed datasets. This article provides a systematic review and comparative evaluation of automated process discovery methods, using an open-source benchmark and covering 12 publicly-available real-life event logs, 12 proprietary real-life event logs, and nine quality metrics. The results highlight gaps and unexplored tradeoffs in the field, including the lack of scalability of some methods and a strong divergence in their performance with respect to the different quality metrics used.

...read moreread less

225 citations

Cites background or methods from "Discovering workflow nets using int..."

...[75]), while [26], [90] were tested on artificial logs and [51] on synthetic logs only....
[...]
...[26], [104], [105] propose an improvement of the ILP miner implemented in [25]....
[...]
...ILP Miner [25] Hybrid ILP Miner [26]...
[...]

Posted Content•

Automated Discovery of Process Models from Event Logs: Review and Benchmark

[...]

Adriano Augusto¹, Raffaele Conforti², Marlon Dumas¹, Marcello La Rosa², Fabrizio Maria Maggi¹, Andrea Marrella³, Massimo Mecella³, Allar Soo¹ - Show less +4 more•Institutions (3)

University of Tartu¹, University of Melbourne², Sapienza University of Rome³

05 May 2017-arXiv: Software Engineering

TL;DR: In this paper, a systematic review and comparative evaluation of automated process discovery methods, using an open-source benchmark and covering twelve publicly-available real-life event logs, twelve proprietary real life event logs and nine quality metrics, is presented.

...read moreread less

Abstract: Process mining allows analysts to exploit logs of historical executions of business processes to extract insights regarding the actual performance of these processes. One of the most widely studied process mining operations is automated process discovery. An automated process discovery method takes as input an event log, and produces as output a business process model that captures the control-flow relations between tasks that are observed in or implied by the event log. Various automated process discovery methods have been proposed in the past two decades, striking different tradeoffs between scalability, accuracy and complexity of the resulting models. However, these methods have been evaluated in an ad-hoc manner, employing different datasets, experimental setups, evaluation measures and baselines, often leading to incomparable conclusions and sometimes unreproducible results due to the use of closed datasets. This article provides a systematic review and comparative evaluation of automated process discovery methods, using an open-source benchmark and covering twelve publicly-available real-life event logs, twelve proprietary real-life event logs, and nine quality metrics. The results highlight gaps and unexplored tradeoffs in the field, including the lack of scalability of some methods and a strong divergence in their performance with respect to the different quality metrics used.

...read moreread less

127 citations

Replaying history on process models for conformance checking and performance analysis

[...]

W.M.P. van der Aalst, A Arya Adriansyah, B. F. van Dongen

01 Jan 2011

TL;DR: The importance of maintaining a proper alignment between event log and process model is elaborated on and their application to conformance checking and performance analysis is elaborated.

...read moreread less

Abstract: Process mining techniques use event data to discover process models, to check the conformance of prede?ned process models, and to extend such models with information about bottlenecks, decisions, and resource usage. These techniques are driven by observed events rather than hand-made models. Event logs are used to learn and enrich process models. By replaying history on the model, it is possible to establish a precise relationship between events and model elements. This relationship can be used to check conformance and to analyze performance. For example, it is possible to diagnose deviations from the modeled behavior. The severity of each deviation can be quanti?ed. Moreover, the relationship established during replay and the timestamps in the event log can be combined to show bottlenecks. These examples illustrate the importance of maintaining a proper alignment between event log and process model. Therefore, we elaborate on the realization of such alignments and their application to conformance checking and performance analysis.

...read moreread less

95 citations

Book Chapter•DOI•

Improving process discovery results by filtering outliers using conditional behavioural probabilities

[...]

Mohammadreza Fani Sani¹, Sebastiaan J. van Zelst¹, Wil M. P. van der Aalst¹•Institutions (1)

Eindhoven University of Technology¹

01 Jan 2018

TL;DR: A novel general purpose filtering method that exploits observed conditional probabilities between sequences of activities and accurately removes irrelevant behaviour and, indeed, improves process discovery results.

...read moreread less

Abstract: Process discovery, one of the key challenges in process mining, aims at discovering process models from process execution data stored in event logs. Most discovery algorithms assume that all data in an event log conform to correct execution of the process, and hence, incorporate all behaviour in their resulting process model. However, in real event logs, noise and irrelevant infrequent behaviour are often present. Incorporating such behaviour results in complex, incomprehensible process models concealing the correct and/or relevant behaviour of the underlying process. In this paper, we propose a novel general purpose filtering method that exploits observed conditional probabilities between sequences of activities. The method has been implemented in both the ProM toolkit and the RapidProM framework. We evaluate our approach using real and synthetic event data. The results show that the proposed method accurately removes irrelevant behaviour and, indeed, improves process discovery results.

...read moreread less

61 citations

Process mining with streaming data

[...]

S.J. van Zelst

14 Mar 2019

TL;DR: This thesis explores, develop and analyse process mining techniques that are able to handle streaming event data and identifies three main process mining types of analysis, i.e. process discovery, conformance checking and process enhancement.

...read moreread less

Abstract: Modern information systems allow us to track, often in great detail, the execution of processes within companies. Consider for example luggage handling in airports, manufacturing processes of products and goods, or processes related to service provision, all of these processes generate traces of valuable event data. Such event data are typically stored in a company’s information system and describe the execution of the process at hand. In recent years, the field of process mining has emerged. Process mining techniques aim to translate the data captured during the process execution, i.e. the event data, into actionable insights. As such, we identify three main process mining types of analysis, i.e. process discovery, conformance checking and process enhancement. In process discovery, we aim to discover a process model, i.e. a formal behavioural description, which describes the process as captured by the event data. In conformance checking, we aim to assess to what degree the event data is in correspondence with a given reference model, i.e. a model describing how the process ought to be executed. Finally, within process enhancement, the main goal is to improve the view of the process, i.e. by enhancing process models on the basis of facts derived from event data. Recent developments in information technology allow us to capture data at increasing rates, yielding enormous volumes of data, both in terms of size and velocity. In the context of process mining, this relates to the advent of real-time, online, streams of events that result in data sets that are no longer efficiently analysable by commodity hardware. Such types of data pose both opportunities and challenges. On the one hand, it allows us to get actionable insights into the process, at the moment it is being executed. On the other hand, conventional process mining techniques do not allow us to gain these insights, as they are not designed to cope with such a new type of data. As a consequence, new methods, techniques and tools are needed to allow us to apply process mining techniques and analyses on streams of event data of arbitrary size. In this thesis, we explore, develop and analyse process mining techniques that are able to handle streaming event data. The premise of streaming event data, is the fact that we assume the stream of events under consideration to be of infinite size. As such, efficient techniques to temporarily store and use relevant recent subsets of event data

...read moreread less

60 citations

Collapse

Discovering workflow nets using integer linear programming

Citations

Cites background or methods from "Discovering workflow nets using int..."

References

"Discovering workflow nets using int..." refers methods in this paper

"Discovering workflow nets using int..." refers background or methods in this paper

"Discovering workflow nets using int..." refers background or methods in this paper

Related Papers (5)