Home
/
Authors
/
Thilo Stadelmann

Author

Thilo Stadelmann

Zurich University of Applied Sciences/ZHAW

Other affiliations: Zürcher Fachhochschule, University of Siegen, University of Marburg ...read more

Bio: Thilo Stadelmann is an academic researcher from Zurich University of Applied Sciences/ZHAW. The author has contributed to research in topics: Deep learning & Cluster analysis. The author has an hindex of 13, co-authored 63 publications receiving 456 citations. Previous affiliations of Thilo Stadelmann include Zürcher Fachhochschule & University of Siegen.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2010
2009
2008
2007
2006
2005

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Speaker identification and clustering using convolutional neural networks

[...]

Yanick Xavier Lukic¹, Carlo Vogt¹, Oliver Dürr¹, Thilo Stadelmann¹•Institutions (1)

Zürcher Fachhochschule¹

01 Sep 2016

TL;DR: This paper uses simple spectrograms as input to a CNN and study the optimal design of those networks for speaker identification and clustering, and demonstrates the approach on the well known TIMIT dataset, achieving results comparable with the state of the art-without the need for handcrafted features.

...read moreread less

Abstract: Deep learning, especially in the form of convolutional neural networks (CNNs), has triggered substantial improvements in computer vision and related fields in recent years. This progress is attributed to the shift from designing features and subsequent individual sub-systems towards learning features and recognition systems end to end from nearly unprocessed data. For speaker clustering, however, it is still common to use handcrafted processing chains such as MFCC features and GMM-based models. In this paper, we use simple spectrograms as input to a CNN and study the optimal design of those networks for speaker identification and clustering. Furthermore, we elaborate on the question how to transfer a network, trained for speaker identification, to speaker clustering. We demonstrate our approach on the well known TIMIT dataset, achieving results comparable with the state of the art-without the need for handcrafted features.

...read moreread less

73 citations

Proceedings Article•DOI•

Automated Machine Learning in Practice: State of the Art and Recent Results

[...]

Lukas Tuggener, Mohammadreza Amirian¹, Katharina Rombach, Stefan Lörwald², Anastasia Varlet², Christian Westermann², Thilo Stadelmann - Show less +3 more•Institutions (2)

University of Ulm¹, PricewaterhouseCoopers²

14 Jun 2019

TL;DR: An overview of the state of the art in AutoML with a focus on practical applicability in a business context, and recent benchmark results of the most important AutoML algorithms are provided in this article.

...read moreread less

Abstract: A main driver behind the digitization of industry and society is the belief that data-driven model building and decision making can contribute to higher degrees of automation and more informed decisions. Building such models from data often involves the application of some form of machine learning. Thus, there is an ever growing demand in work force with the necessary skill set to do so. This demand has given rise to a new research topic concerned with fitting machine learning models fully automatically—AutoML. This paper gives an overview of the state of the art in AutoML with a focus on practical applicability in a business context, and provides recent benchmark results of the most important AutoML algorithms.

...read moreread less

42 citations

Book Chapter•DOI•

Deep Learning in the Wild

[...]

Thilo Stadelmann, Mohammadreza Amirian¹, Ismail Arabaci, Marek Arnold, Gilbert François Duivesteijn, Ismail Elezi², Melanie Geiger, Stefan Lörwald³, Benjamin Bruno Meier, Katharina Rombach, Lukas Tuggener⁴ - Show less +7 more•Institutions (4)

University of Ulm¹, Ca' Foscari University of Venice², PricewaterhouseCoopers³, Dalle Molle Institute for Artificial Intelligence Research⁴

19 Sep 2018

TL;DR: This paper explored the specific challenges arising in the realm of real world tasks, based on case studies from research & development in conjunction with industry, and extracts lessons learned from them. But they did not provide any guidance on how to make them work in practice.

...read moreread less

Abstract: Deep learning with neural networks is applied by an increasing number of people outside of classic research environments, due to the vast success of the methodology on a wide range of machine perception tasks. While this interest is fueled by beautiful success stories, practical work in deep learning on novel tasks without existing baselines remains challenging. This paper explores the specific challenges arising in the realm of real world tasks, based on case studies from research & development in conjunction with industry, and extracts lessons learned from them. It thus fills a gap between the publication of latest algorithmic and methodical developments, and the usually omitted nitty-gritty of how to make them work. Specifically, we give insight into deep learning projects on face matching, print media monitoring, industrial quality control, music scanning, strategy game playing, and automated machine learning, thereby providing best practices for deep learning in practice.

...read moreread less

32 citations

Proceedings Article•DOI•

Deep watershed detector for music object recognition

[...]

Lukas Tuggener¹, Ismail Elezi², Jürgen Schmidhuber¹, Thilo Stadelmann³•Institutions (3)

Dalle Molle Institute for Artificial Intelligence Research¹, Ca' Foscari University of Venice², Winterthur Museum, Garden and Library³

26 May 2018

TL;DR: Deep Watershed Detector (DWD) as discussed by the authors is a novel object detection method based on synthetic energy maps and watershed transform, which is specifically tailored to deal with high resolution images that contain a large number of very small objects and is therefore able to process full pages of written music.

...read moreread less

Abstract: Optical Music Recognition (OMR) is an important and challenging area within music information retrieval, the accurate detection of music symbols in digital images is a core functionality of any OMR pipeline. In this paper, we introduce a novel object detection method, based on synthetic energy maps and the watershed transform, called Deep Watershed Detector (DWD). Our method is specifically tailored to deal with high resolution images that contain a large number of very small objects and is therefore able to process full pages of written music. We present state-of-the-art detection results of common music symbols and show DWD's ability to work with synthetic scores equally well as on handwritten music.

...read moreread less

30 citations

Proceedings Article•DOI•

Fully Convolutional Neural Networks for Newspaper Article Segmentation

[...]

Benjamin Bruno Meier, Thilo Stadelmann¹, Jan Stampfli², Marek Arnold, Mark Cieliebak² - Show less +1 more•Institutions (2)

Zürcher Fachhochschule¹, Zurich University of Applied Sciences/ZHAW²

01 Nov 2017

TL;DR: A fully convolutional neural network (FCN) is applied that is trained in an end-to-end fashion to transform the input image into a segmentation mask in one pass and outperforms a deep learning-based commercial solution by a large margin in terms of segmentation quality while in addition being computationally two orders of magnitude more efficient.

...read moreread less

Abstract: Segmenting newspaper pages into articles that semantically belong together is a necessary prerequisite for article-based information retrieval on print media collections like e.g. archives and libraries. It is challenging due to vastly differing layouts of papers, various content types and different languages, but commercially very relevant for e.g. media monitoring. We present a semantic segmentation approach based on the visual appearance of each page. We apply a fully convolutional neural network (FCN) that we train in an end-to-end fashion to transform the input image into a segmentation mask in one pass. We show experimentally that the FCN performs very well: it outperforms a deep learning-based commercial solution by a large margin in terms of segmentation quality while in addition being computationally two orders of magnitude more efficient.

...read moreread less

24 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

The Design and Analysis of Experiments

[...]

Margaret J. Robertson

01 Jun 1953-Yale Journal of Biology and Medicine

TL;DR: This book by a teacher of statistics (as well as a consultant for "experimenters") is a comprehensive study of the philosophical background for the statistical design of experiment.

...read moreread less

Abstract: THE DESIGN AND ANALYSIS OF EXPERIMENTS. By Oscar Kempthorne. New York, John Wiley and Sons, Inc., 1952. 631 pp. $8.50. This book by a teacher of statistics (as well as a consultant for \"experimenters\") is a comprehensive study of the philosophical background for the statistical design of experiment. It is necessary to have some facility with algebraic notation and manipulation to be able to use the volume intelligently. The problems are presented from the theoretical point of view, without such practical examples as would be helpful for those not acquainted with mathematics. The mathematical justification for the techniques is given. As a somewhat advanced treatment of the design and analysis of experiments, this volume will be interesting and helpful for many who approach statistics theoretically as well as practically. With emphasis on the \"why,\" and with description given broadly, the author relates the subject matter to the general theory of statistics and to the general problem of experimental inference. MARGARET J. ROBERTSON

...read moreread less

13,333 citations

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

The PASCAL Visual Object Classes Challenge

[...]

Jianguo Zhang

01 Jan 2006

3,012 citations

Journal Article•DOI•

Digital processing of speech signals

[...]

M.G. Bellanger

01 Oct 1980