Roland Maas

Researcher at Amazon.com

Publications - 74

Citations - 1964

Roland Maas is an academic researcher from Amazon.com. The author has contributed to research in topics: Hidden Markov model & Word error rate. The author has an hindex of 18, co-authored 68 publications receiving 1619 citations. Previous affiliations of Roland Maas include University of Erlangen-Nuremberg.

Papers

PDF

Open Access

More filters

Proceedings ArticleDOI

The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

Keisuke Kinoshita, +6 more

TL;DR: A common evaluation framework including datasets, tasks, and evaluation metrics for both speech enhancement and ASR techniques is proposed, which will be used as a common basis for the REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge.

...read moreread less

Journal ArticleDOI

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research

Keisuke Kinoshita, +11 more

- 18 Jan 2016 -

EURASIP Journal on Advances in Signal Pr...

TL;DR: The REVERB challenge is described, which is an evaluation campaign that was designed to evaluate such speech enhancement and ASR techniques to reveal the state-of-the-art techniques and obtain new insights regarding potential future research directions.

...read moreread less

Journal ArticleDOI

Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition

Takuya Yoshioka, +6 more

- 18 Oct 2012 -

IEEE Signal Processing Magazine

TL;DR: For a number of unexplored but important applications, distant microphones are a prerequisite for extending the availability of speech recognizers as well as enhancing the convenience of existing speech recognition applications.

...read moreread less

Patent

Anchored speech detection and speech recognition

Sree Hari Krishnan Parthasarathi, +3 more

TL;DR: In this article, a system configured to process speech commands may classify incoming audio as desired speech, undesired speech, or non-speech, where desired speech is speech that is from a same speaker as reference speech.

...read moreread less

Journal ArticleDOI

Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition

Armin Sehr, +2 more

- 01 Sep 2010 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A novel reformulation of the constraint, which allows for an efficient solution by nonlinear optimization algorithms, is derived in this paper so that a practicable implementation of REMOS for logmelspec features becomes possible.

...read moreread less

Collapse