Home
/
Authors
/
Jan Vanthienen

Author

Jan Vanthienen

Other affiliations: The Catholic University of America

Bio: Jan Vanthienen is an academic researcher from Katholieke Universiteit Leuven. The author has contributed to research in topics: Process mining & Decision table. The author has an hindex of 48, co-authored 291 publications receiving 10299 citations. Previous affiliations of Jan Vanthienen include The Catholic University of America.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1988
1981

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Process Mining Manifesto

[...]

Wil M. P. van der Aalst¹, Wil M. P. van der Aalst², A Arya Adriansyah¹, Ana Karla Alves de Medeiros³, Franco Arcieri⁴, Thomas Baier⁵, Tobias Blickle⁶, Jagadeesh Chandra Bose¹, Peter van den Brand, Ronald Brandtjen, Joos C. A. M. Buijs¹, Andrea Burattin⁷, Josep Carmona⁸, Malu Castellanos⁹, Jan Claes¹⁰, Jonathan Cook¹¹, Nicola Costantini, Francisco Curbera¹², Ernesto Damiani¹³, Massimiliano de Leoni¹, Pavlos Delias, Boudewijn F. van Dongen¹, Marlon Dumas¹⁴, Schahram Dustdar¹⁵, Dirk Fahland¹, Diogo R. Ferreira¹⁶, Walid Gaaloul¹⁷, Frank van Geffen¹⁸, Sukriti Goel¹⁹, CW Christian Günther, Antonella Guzzo²⁰, Paul Harmon, Arthur H. M. ter Hofstede², Arthur H. M. ter Hofstede¹, John Hoogland, Jon Espen Ingvaldsen, Koki Kato²¹, Rudolf Kuhn, Akhil Kumar²², Marcello La Rosa², Fabrizio Maria Maggi¹, Donato Malerba²³, RS Ronny Mans¹, Alberto Manuel, Martin McCreesh, Paola Mello²⁴, Jan Mendling²⁵, Marco Montali²⁶, Hamid Reza Motahari-Nezhad⁹, Michael zur Muehlen²⁷, Jorge Munoz-Gama⁸, Luigi Pontieri²⁸, Joel Ribeiro¹, A Anne Rozinat, Hugo Seguel Pérez, Ricardo Seguel Pérez, Marcos Sepúlveda²⁹, Jim Sinur, Pnina Soffer³⁰, Minseok Song³¹, Alessandro Sperduti⁷, Giovanni Stilo⁴, Casper Stoel, Keith D. Swenson²¹, Maurizio Talamo⁴, Wei Tan¹², Christopher Turner³², Jan Vanthienen³³, George Varvaressos, Eric Verbeek¹, Marc Verdonk³⁴, Roberto Vigo, Jianmin Wang³⁵, Barbara Weber³⁶, Matthias Weidlich³⁷, Ton Weijters¹, Lijie Wen³⁵, Michael Westergaard¹, Moe Thandar Wynn² - Show less +75 more•Institutions (37)

Eindhoven University of Technology¹, Queensland University of Technology², Capgemini³, University of Rome Tor Vergata⁴, Humboldt University of Berlin⁵, Software AG⁶, University of Padua⁷, Polytechnic University of Catalonia⁸, Hewlett-Packard⁹, Ghent University¹⁰, New Mexico State University¹¹, IBM¹², University of Milan¹³, University of Tartu¹⁴, University of Vienna¹⁵, Technical University of Lisbon¹⁶, Telecom SudParis¹⁷, Rabobank¹⁸, Infosys¹⁹, University of Calabria²⁰, Fujitsu²¹, Pennsylvania State University²², University of Bari²³, University of Bologna²⁴, Vienna University of Economics and Business²⁵, Free University of Bozen-Bolzano²⁶, Stevens Institute of Technology²⁷, Indian Council of Agricultural Research²⁸, Pontifical Catholic University of Chile²⁹, University of Haifa³⁰, Ulsan National Institute of Science and Technology³¹, Cranfield University³², Katholieke Universiteit Leuven³³, Deloitte³⁴, Tsinghua University³⁵, University of Innsbruck³⁶, Hasso Plattner Institute³⁷

01 Jan 2012

TL;DR: This manifesto hopes to serve as a guide for software developers, scientists, consultants, business managers, and end-users to increase the maturity of process mining as a new tool to improve the design, control, and support of operational business processes.

...read moreread less

Abstract: Process mining techniques are able to extract knowledge from event logs commonly available in today’s information systems. These techniques provide new means to discover, monitor, and improve processes in a variety of application domains. There are two main drivers for the growing interest in process mining. On the one hand, more and more events are being recorded, thus, providing detailed information about the history of processes. On the other hand, there is a need to improve and support business processes in competitive and rapidly changing environments. This manifesto is created by the IEEE Task Force on Process Mining and aims to promote the topic of process mining. Moreover, by defining a set of guiding principles and listing important challenges, this manifesto hopes to serve as a guide for software developers, scientists, consultants, business managers, and end-users. The goal is to increase the maturity of process mining as a new tool to improve the (re)design, control, and support of operational business processes.

...read moreread less

1,135 citations

Journal Article•DOI•

Benchmarking state-of-the-art classification algorithms for credit scoring

[...]

Bart Baesens¹, T. Van Gestel¹, Stijn Viaene¹, M Stepanova², Johan A. K. Suykens¹, Jan Vanthienen¹ - Show less +2 more•Institutions (2)

Katholieke Universiteit Leuven¹, UBS²

09 Jun 2003-Journal of the Operational Research Society

TL;DR: It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.

...read moreread less

Abstract: In this paper, we study the performance of various state-of-the-art classification algorithms applied to eight real-life credit scoring data sets. Some of the data sets originate from major Benelux and UK financial institutions. Different types of classifiers are evaluated and compared. Besides the well-known classification algorithms (eg logistic regression, discriminant analysis, k-nearest neighbour, neural networks and decision trees), this study also investigates the suitability and performance of some recently proposed, advanced kernel-based classification algorithms such as support vector machines and least-squares support vector machines (LS-SVMs). The performance is assessed using the classification accuracy and the area under the receiver operating characteristic curve. Statistically significant performance differences are identified using the appropriate test statistics. It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.

...read moreread less

906 citations

Journal Article•DOI•

Benchmarking Least Squares Support Vector Machine Classifiers

[...]

Tony Van Gestel¹, Johan A. K. Suykens¹, Bart Baesens¹, Stijn Viaene¹, Jan Vanthienen¹, Guido Dedene¹, Bart De Moor¹, Joos Vandewalle¹ - Show less +4 more•Institutions (1)

Katholieke Universiteit Leuven¹

01 Jan 2004-Machine Learning

TL;DR: Both the SVM and LS-SVM classifier with RBF kernel in combination with standard cross-validation procedures for hyperparameter selection achieve comparable test set performances, consistently very good when compared to a variety of methods described in the literature.

...read moreread less

Abstract: In Support Vector Machines (SVMs), the solution of the classification problem is characterized by a (convex) quadratic programming (QP) problem. In a modified version of SVMs, called Least Squares SVM classifiers (LS-SVMs), a least squares cost function is proposed so as to obtain a linear set of equations in the dual space. While the SVM classifier has a large margin interpretation, the LS-SVM formulation is related in this paper to a ridge regression approach for classification with binary targets and to Fisher's linear discriminant analysis in the feature space. Multiclass categorization problems are represented by a set of binary classifiers using different output coding schemes. While regularization is used to control the effective number of parameters of the LS-SVM classifier, the sparseness property of SVMs is lost due to the choice of the 2-norm. Sparseness can be imposed in a second stage by gradually pruning the support value spectrum and optimizing the hyperparameters during the sparse approximation procedure. In this paper, twenty public domain benchmark datasets are used to evaluate the test set performance of LS-SVM classifiers with linear, polynomial and radial basis function (RBF) kernels. Both the SVM and LS-SVM classifier with RBF kernel in combination with standard cross-validation procedures for hyperparameter selection achieve comparable test set performances. These SVM and LS-SVM performances are consistently very good when compared to a variety of methods described in the literature including decision tree based algorithms, statistical algorithms and instance based learning methods. We show on ten UCI datasets that the LS-SVM sparse approximation procedure can be successfully applied.

...read moreread less

698 citations

Journal Article•DOI•

Using Neural Network Rule Extraction and Decision Tables for Credit-Risk Evaluation

[...]

Bart Baesens, Rudy Setiono, Christophe Mues, Jan Vanthienen

01 Mar 2003-Management Science

TL;DR: It is concluded that neural network rule extraction and decision tables are powerful management tools that allow us to build advanced and userfriendly decision-support systems for credit-risk evaluation.

...read moreread less

Abstract: Credit-risk evaluation is a very challenging and important management science problem in the domain of financial analysis. Many classification methods have been suggested in the literature to tackle this problem. Neural networks, especially, have received a lot of attention because of their universal approximation property. However, a major drawback associated with the use of neural networks for decision making is their lack of explanation capability. While they can achieve a high predictive accuracy rate, the reasoning behind how they reach their decisions is not readily available. In this paper, we present the results from analysing three real-life credit-risk data sets using neural network rule extraction techniques. Clarifying the neural network decisions by explanatory rules that capture the learned knowledge embedded in the networks can help the credit-risk manager in explaining why a particular applicant is classified as either bad or good. Furthermore, we also discuss how these rules can be visualized as a decision table in a compact and intuitive graphical format that facilitates easy consultation. It is concluded that neural network rule extraction and decision tables are powerful management tools that allow us to build advanced and userfriendly decision-support systems for credit-risk evaluation.

...read moreread less

504 citations

Journal Article•DOI•

Classification With Ant Colony Optimization

[...]

David Martens¹, M. de Backer¹, Raf Haesen¹, Jan Vanthienen¹, Monique Snoeck¹, Bart Baesens¹ - Show less +2 more•Institutions (1)

Katholieke Universiteit Leuven¹

01 Oct 2007-IEEE Transactions on Evolutionary Computation

TL;DR: This paper provides an overview of previous ant-based approaches to the classification task and compares them with state-of-the-art classification techniques, such as C4.5, RIPPER, and support vector machines in a benchmark study, and proposes a new AntMiner+.

...read moreread less

Abstract: Ant colony optimization (ACO) can be applied to the data mining field to extract rule-based classifiers. The aim of this paper is twofold. On the one hand, we provide an overview of previous ant-based approaches to the classification task and compare them with state-of-the-art classification techniques, such as C4.5, RIPPER, and support vector machines in a benchmark study. On the other hand, a new ant-based classification technique is proposed, named AntMiner+. The key differences between the proposed AntMiner+ and previous AntMiner versions are the usage of the better performing MAX-MIN ant system, a clearly defined and augmented environment for the ants to walk through, with the inclusion of the class variable to handle multiclass problems, and the ability to include interval rules in the rule list. Furthermore, the commonly encountered problem in ACO of setting system parameters is dealt with in an automated, dynamic manner. Our benchmarking experiments show an AntMiner+ accuracy that is superior to that obtained by the other AntMiner versions, and competitive or better than the results achieved by the compared classification techniques.

...read moreread less

427 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Journal Article•DOI•

Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

[...]

Cynthia Rudin¹•Institutions (1)

Duke University¹

01 May 2019-Nature Machine Intelligence

TL;DR: This Perspective clarifies the chasm between explaining black boxes and using inherently interpretable models, outlines several key reasons why explainable black boxes should be avoided in high-stakes decisions, identifies challenges to interpretable machine learning, and provides several example applications whereinterpretable models could potentially replace black box models in criminal justice, healthcare and computer vision.

...read moreread less

Abstract: Black box machine learning models are currently being used for high-stakes decision making throughout society, causing problems in healthcare, criminal justice and other domains. Some people hope that creating methods for explaining these black box models will alleviate some of the problems, but trying to explain black box models, rather than creating models that are interpretable in the first place, is likely to perpetuate bad practice and can potentially cause great harm to society. The way forward is to design models that are inherently interpretable. This Perspective clarifies the chasm between explaining black boxes and using inherently interpretable models, outlines several key reasons why explainable black boxes should be avoided in high-stakes decisions, identifies challenges to interpretable machine learning, and provides several example applications where interpretable models could potentially replace black box models in criminal justice, healthcare and computer vision. There has been a recent rise of interest in developing methods for ‘explainable AI’, where models are created to explain how a first ‘black box’ machine learning model arrives at a specific decision. It can be argued that instead efforts should be directed at building inherently interpretable models in the first place, in particular where they are applied in applications that directly affect human lives, such as in healthcare and criminal justice.

...read moreread less

3,609 citations

An Introduction to MultiAgent Systems.

[...]

Barbara Messing

01 Jan 2003

3,093 citations

Book•

Least Squares Support Vector Machines

[...]

Johan A. K. Suykens¹, Tony Van Gestel, Jos De Brabanter, Bart De Moor, Joos Vandewalle - Show less +1 more•Institutions (1)

Katholieke Universiteit Leuven¹

12 Nov 2002

TL;DR: Support Vector Machines Basic Methods of Least Squares Support Vector Machines Bayesian Inference for LS-SVM Models Robustness Large Scale Problems LS- sVM for Unsupervised Learning LS- SVM for Recurrent Networks and Control.

...read moreread less

Abstract: Support Vector Machines Basic Methods of Least Squares Support Vector Machines Bayesian Inference for LS-SVM Models Robustness Large Scale Problems LS-SVM for Unsupervised Learning LS-SVM for Recurrent Networks and Control.

...read moreread less

2,983 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse