scispace - formally typeset
S

Simon Wiesler

Researcher at Amazon.com

Publications -  37
Citations -  681

Simon Wiesler is an academic researcher from Amazon.com. The author has contributed to research in topics: Hidden Markov model & Acoustic model. The author has an hindex of 14, co-authored 34 publications receiving 620 citations. Previous affiliations of Simon Wiesler include RWTH Aachen University.

Papers
More filters

RASR - The RWTH Aachen University Open Source Speech Recognition Toolkit

Abstract: RASR is the open source version of the well-proven speech recognition toolkit developed and used at RWTH Aachen University. The current version of the package includes state of the art speech recognition technology for acoustic model training and decoding. Speaker adaptation, speaker adaptive training, unsupervised training, discriminative training, lattice processing tools, flexible signal analysis, a finite state automata library, and an efficient dynamic network decoder are notable components. Comprehensive documentation, example setups for training and recognition, and tutorials are provided to support newcomers.
Proceedings ArticleDOI

Mean-normalized stochastic gradient for large-scale deep learning

TL;DR: This work proposes a novel second-order stochastic optimization algorithm based on analytic results showing that a non-zero mean of features is harmful for the optimization, and proves convergence of the algorithm in a convex setting.
Proceedings Article

A Convergence Analysis of Log-Linear Training

TL;DR: It is shown that the training of log-linear models can be highly ill-conditioned and good results are obtained on a large-scale continuous handwriting recognition task with a simple and generic approach.
Proceedings ArticleDOI

RASR/NN: The RWTH neural network toolkit for speech recognition

TL;DR: The results show that RASR achieves a state-of-the-art performance on a real-world large vocabulary task, while offering a complete pipeline for building and applying large scale speech recognition systems.
Proceedings ArticleDOI

The RWTH 2010 Quaero ASR evaluation system for English, French, and German

TL;DR: Recognizing Broadcast Conversational speech data is a difficult task, which can be regarded as one of the major challenges beyond the recognition of Broadcast News.