scispace - formally typeset
J

Jahn Heymann

Researcher at Amazon.com

Publications -  37
Citations -  2857

Jahn Heymann is an academic researcher from Amazon.com. The author has contributed to research in topics: Beamforming & Artificial neural network. The author has an hindex of 18, co-authored 34 publications receiving 1815 citations. Previous affiliations of Jahn Heymann include University of Paderborn.

Papers
More filters
Proceedings ArticleDOI

ESPNet: End-to-end speech processing toolkit

TL;DR: In this article, a new open source platform for end-to-end speech processing named ESPnet is introduced, which mainly focuses on automatic speech recognition (ASR), and adopts widely used dynamic neural network toolkits, Chainer and PyTorch, as a main deep learning engine.
Posted Content

ESPnet: End-to-End Speech Processing Toolkit

TL;DR: A major architecture of this software platform, several important functionalities, which differentiate ESPnet from other open source ASR toolkits, and experimental results with major ASR benchmarks are explained.
Proceedings ArticleDOI

Neural network based spectral mask estimation for acoustic beamforming

TL;DR: A neural network based approach to acoustic beamforming is presented, used to estimate spectral masks from which the Cross-Power Spectral Density matrices of speech and noise are estimated, which are used to compute the beamformer coefficients.
Proceedings ArticleDOI

BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge

TL;DR: A new beamformer front-end for Automatic Speech Recognition that leverages the power of a bi-directional Long Short-Term Memory network to robustly estimate soft masks for a subsequent beamforming step and achieves a 53% relative reduction of the word error rate over the best baseline enhancement system for the relevant test data set.
Proceedings ArticleDOI

Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system

TL;DR: This paper presents an end-to-end training approach for a beamformer-supported multi-channel ASR system, where a neural network which estimates masks for a statistically optimum beamformer is jointly trained with a network for acoustic modeling.