Active Trace Clustering for Improved Process Discovery

doi:10.1109/TKDE.2013.64

Citations

PDF

Open Access

More filters

Journal Article•DOI•

A general process mining framework for correlating, predicting and clustering dynamic behavior based on event logs

[...]

Massimiliano de Leoni¹, Wil M. P. van der Aalst¹, Marcus Dees•Institutions (1)

Eindhoven University of Technology¹

01 Mar 2016-Information Systems

TL;DR: The proposed framework unifies a number of approaches for correlation analysis proposed in literature, proposing a general solution that can perform those analyses and many more and has been implemented in ProM and combines process and data mining techniques.

...read moreread less

212 citations

Journal Article•DOI•

Process mining techniques and applications – A systematic mapping study

[...]

Cleiton dos Santos Garcia¹, Alex Meincheim¹, Elio Ribeiro Faria Junior¹, Marcelo Rosano Dallagassa¹, Denise Maria Vecino Sato¹, Denise Maria Vecino Sato², Deborah Ribeiro Carvalho¹, Eduardo Alves Portela Santos¹, Edson Emílio Scalabrin¹ - Show less +5 more•Institutions (2)

Pontifícia Universidade Católica do Paraná¹, Paraná Federal Institute of Education, Science and Technology²

01 Nov 2019-Expert Systems With Applications

TL;DR: It is possible to observe that the most active research topics are associated with the process discovery algorithms, followed by conformance checking, and architecture and tools improvements, and finally application domains among different business segments are reported on.

...read moreread less

Abstract: Process mining is a growing and promising study area focused on understanding processes and to help capture the more significant findings during real execution rather than, those methods that, only observed idealized process model. The objective of this article is to map the active research topics of process mining and their main publishers by country, periodicals, and conferences. We also extract the reported application studies and classify these by exploration domains or industry segments that are taking advantage of this technique. The applied research method was systematic mapping, which began with 3713 articles. After applying the exclusion criteria, 1278 articles were selected for review. In this article, an overview regarding process mining is presented, the main research topics are identified, followed by identification of the most applied process mining algorithms, and finally application domains among different business segments are reported on. It is possible to observe that the most active research topics are associated with the process discovery algorithms, followed by conformance checking, and architecture and tools improvements. In application domains, the segments with major case studies are healthcare followed by information and communication technology, manufacturing, education, finance, and logistics.

...read moreread less

183 citations

Journal Article•DOI•

Discovering the Effects of Metacognitive Prompts on the Sequential Structure of SRL-Processes Using Process Mining Techniques

[...]

Christoph Sonnenberg¹, Maria Bannert¹•Institutions (1)

University of Würzburg¹

27 May 2015-Journal of learning Analytics

TL;DR: In this paper, the effects of metacognitive prompts on learning processes and outcomes during a computer-based learning task were analyzed using concurrent think-aloud protocols and process mining techniques were used to analyze sequential patterns.

...read moreread less

Abstract: According to research examining self-regulated learning (SRL), we regard individual regulation as a specific sequence of regulatory activities. Ideally, students perform various learning activities, such as analyzing, monitoring, and evaluating cognitive and motivational aspects during learning. Metacognitive prompts can foster SRL by inducing regulatory activities, which, in turn, improve the learning outcome. However, the specific effects of metacognitive support on the dynamic characteristics of SRL are not understood. Therefore, the aim of our study was to analyze the effects of metacognitive prompts on learning processes and outcomes during a computer-based learning task. Participants of the experimental group (EG, n = 35) were supported by metacognitive prompts, whereas participants of the control group (CG, n = 35) received no support. Data regarding learning processes were obtained by concurrent think-aloud protocols. The EG exhibited significantly more metacognitive learning events than did the CG. Furthermore, these regulatory activities correspond positively with learning outcomes. Process mining techniques were used to analyze sequential patterns. Our findings indicate differences in the process models of the EG and CG and demonstrate the added value of taking the order of learning activities into account by discovering regulatory patterns.

...read moreread less

73 citations

Journal Article•DOI•

A Co-Training Strategy for Multiple View Clustering in Process Mining

[...]

Annalisa Appice, Donato Malerba

01 Nov 2016-IEEE Transactions on Services Computing

TL;DR: This paper investigates a multiple view aware approach to trace clustering, based on a co-training strategy, and shows that the presented algorithm is able to discover a clustering pattern of the log, such that related traces result appropriately clustered.

...read moreread less

Abstract: Process mining refers to the discovery, conformance, and enhancement of process models from event logs currently produced by several information systems (e.g. workflow management systems). By tightly coupling event logs and process models, process mining makes it possible to detect deviations, predict delays, support decision making, and recommend process redesigns.Event logs are data sets containing the executions (called traces) of a business process. Several process mining algorithms have been defined to mine event logs and deliver valuable models (e.g. Petri nets) of how logged processes are being executed. However, they often generate spaghetti-like process models, which can be hard to understand. This is caused by the inherent complexity of real-life processes, which tend to be less structured and more flexible than what the stakeholders typically expect. In particular, spaghetti-like process models are discovered when all possible behaviors are shown in a single model as a result of considering the set of traces in the event log all at once.To minimize this problem, trace clustering can be used as a preprocessing step. It splits up an event log into clusters of similar traces, so as to handle variability in the recorded behavior and facilitate process model discovery. In this paper, we investigate a multiple view aware approach to trace clustering, based on a co-training strategy. In an assessment, using benchmark event logs, we show that the presented algorithm is able to discover a clustering pattern of the log, such that related traces result appropriately clustered. We evaluate the significance of the formed clusters using established machine learning and process mining metrics.

...read moreread less

65 citations

Cites background from "Active Trace Clustering for Improve..."

...[8] show how this approach may suffer from scalability problems....
[...]
...This category of algorithms, which determine clusters by optimizing a distance-based criterion function, is frequently employed in process mining [3], [4], [5], [6], [7], [8], as the distance is, in...
[...]

Book Chapter•DOI•

Act2vec, trace2vec, log2vec, and model2vec: Representation learning for business processes

[...]

Pieter De Koninck¹, Seppe vanden Broucke¹, Jochen De Weerdt¹•Institutions (1)

Katholieke Universiteit Leuven¹

09 Sep 2018

TL;DR: The main contribution of this paper is the proposal of representation learning architectures at the level of activities, traces, logs, and models that can produce a distributed representation of these objects and a thorough analysis of potential applications.

...read moreread less

Abstract: In process mining, the challenge is typically to turn raw event data into meaningful models, insights, or actions. One of the key problems of a data-driven analysis of processes, is the high dimensionality of the data. In this paper, we address this problem by developing representation learning techniques for business processes. More specifically, the representation learning paradigm is applied to activities, traces, logs, and models in order to learn highly informative but low-dimensional vectors, often referred to as embeddings, based on a neural network architecture. Subsequently, these vectors can be used for automated inference tasks such as trace clustering, process comparison, predictive process monitoring, anomaly detection, etc. Accordingly, the main contribution of this paper is the proposal of representation learning architectures at the level of activities, traces, logs, and models that can produce a distributed representation of these objects and a thorough analysis of potential applications. In an experimental evaluation, we show the power of such derived representations in the context of trace clustering and process model comparison.

...read moreread less

58 citations

Collapse

Active Trace Clustering for Improved Process Discovery

Citations

Cites background from "Active Trace Clustering for Improve..."

References

"Active Trace Clustering for Improve..." refers background in this paper

"Active Trace Clustering for Improve..." refers background in this paper

"Active Trace Clustering for Improve..." refers background in this paper

Related Papers (5)