The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

doi:10.1109/WASPAA.2013.6701894

Proceedings ArticleDOI

The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

Keisuke Kinoshita, +6 more

- pp 1-4

Chats0

TLDR

A common evaluation framework including datasets, tasks, and evaluation metrics for both speech enhancement and ASR techniques is proposed, which will be used as a common basis for the REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge.

Abstract:

Recently, substantial progress has been made in the field of reverberant speech signal processing, including both single- and multichannel dereverberation techniques, and automatic speech recognition (ASR) techniques robust to reverberation. To evaluate state-of-the-art algorithms and obtain new insights regarding potential future research directions, we propose a common evaluation framework including datasets, tasks, and evaluation metrics for both speech enhancement and ASR techniques. The proposed framework will be used as a common basis for the REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. This paper describes the rationale behind the challenge, and provides a detailed description of the evaluation framework and benchmark results.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

A study on data augmentation of reverberant speech for robust speech recognition

Tom Ko, +4 more

TL;DR: It is found that the performance gap between using simulated and real RIRs can be eliminated when point-source noises are added, and the trained acoustic models not only perform well in the distant- talking scenario but also provide better results in the close-talking scenario.

...read moreread less

Journal ArticleDOI

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research

Keisuke Kinoshita, +11 more

- 18 Jan 2016 -

EURASIP Journal on Advances in Signal Pr...

TL;DR: The REVERB challenge is described, which is an evaluation campaign that was designed to evaluate such speech enhancement and ASR techniques to reveal the state-of-the-art techniques and obtain new insights regarding potential future research directions.

...read moreread less

Journal ArticleDOI

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

Emmanuel Vincent, +4 more

- 01 Nov 2017 -

Computer Speech & Language

TL;DR: It is found that training on different noise environments and different microphones barely affects the ASR performance, especially when several environments are present in the training data: only the number of microphones has a significant impact.

...read moreread less

Proceedings ArticleDOI

Improved MVDR beamforming using single-channel mask prediction networks

Hakan Erdogan, +4 more

TL;DR: It is shown that using a single mask across microphones for covariance prediction with minima-limited post-masking yields the best result in terms of signal-level quality measures and speech recognition word error rates in a mismatched training condition.

...read moreread less

Proceedings ArticleDOI

A learning-based approach to direction of arrival estimation in noisy and reverberant environments

Xiong Xiao, +5 more

TL;DR: A learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation and uses a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions

David Pearce, +1 more

TL;DR: A database designed to evaluate the performance of speech recognition algorithms in noisy conditions and recognition results are presented for the first standard DSR feature extraction scheme that is based on a cepstral analysis.

...read moreread less

Book

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Xuedong Huang, +3 more

TL;DR: Spoken Language Processing draws on the latest advances and techniques from multiple fields: computer science, electrical engineering, acoustics, linguistics, mathematics, psychology, and beyond to create the state of the art in spoken language technology.

...read moreread less

Journal ArticleDOI

Evaluation of Objective Quality Measures for Speech Enhancement

Yi Hu, +1 more

- 01 Jan 2008 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.

...read moreread less

The HTK book version 3.4

Steve Young, +9 more

Book

Speech Dereverberation

Patrick A. Naylor, +1 more

TL;DR: Speech Dereverberation presents the most important current approaches to the problem of reverberation and defines the current state of the art and encourages further work on this topic by offering open research questions to exercise the curiosity of the reader.

...read moreread less

Related Papers (5)

The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

Jon Barker, +3 more

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

Geoffrey E. Hinton, +10 more

- 18 Oct 2012 -

IEEE Signal Processing Magazine

An investigation of deep neural networks for noise robust speech recognition

Michael L. Seltzer, +2 more

Image method for efficiently simulating small‐room acoustics

Jont B. Allen, +1 more

- 01 Nov 1976 -

Journal of the Acoustical Society of Ame...

The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

Citations

A study on data augmentation of reverberant speech for robust speech recognition

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

Improved MVDR beamforming using single-channel mask prediction networks

A learning-based approach to direction of arrival estimation in noisy and reverberant environments

References

The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Evaluation of Objective Quality Measures for Speech Enhancement

The HTK book version 3.4

Speech Dereverberation

Related Papers (5)

The Kaldi Speech Recognition Toolkit

The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

An investigation of deep neural networks for noise robust speech recognition

Image method for efficiently simulating small‐room acoustics