Marc Delcroix

Researcher at Nippon Telegraph and Telephone

Publications - 189

Citations - 5075

Marc Delcroix is an academic researcher from Nippon Telegraph and Telephone. The author has contributed to research in topics: Speech enhancement & Artificial neural network. The author has an hindex of 31, co-authored 189 publications receiving 3679 citations. Previous affiliations of Marc Delcroix include NTT Communications Corp & Hokkaido University.

Papers

PDF

Open Access

More filters

Journal ArticleDOI

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research

Keisuke Kinoshita, +11 more

- 18 Jan 2016 -

EURASIP Journal on Advances in Signal Pr...

TL;DR: The REVERB challenge is described, which is an evaluation campaign that was designed to evaluate such speech enhancement and ASR techniques to reveal the state-of-the-art techniques and obtain new insights regarding potential future research directions.

...read moreread less

Proceedings ArticleDOI

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices

Takuya Yoshioka, +11 more

TL;DR: NTT's CHiME-3 system is described, which integrates advanced speech enhancement and recognition techniques, which achieves a 3.45% development error rate and a 5.83% evaluation error rate.

...read moreread less

Journal ArticleDOI

Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition

Takuya Yoshioka, +6 more

- 18 Oct 2012 -

IEEE Signal Processing Magazine

TL;DR: For a number of unexplored but important applications, distant microphones are a prerequisite for extending the availability of speech recognizers as well as enhancing the convenience of existing speech recognition applications.

...read moreread less

Proceedings ArticleDOI

Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration

Shigeki Karita, +5 more

TL;DR: This work integrates connectionist temporal classification (CTC) with Transformer for joint training and decoding of automatic speech recognition (ASR) tasks and makes training faster than with RNNs and assists LM integration.

...read moreread less

Journal ArticleDOI

Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction

Keisuke Kinoshita, +3 more

- 01 May 2009 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A room impulse response is assumed to consist of three parts: a direct-path response, early reflections and late reverberations, which is known to be a major cause of ASR performance degradation.

...read moreread less

Collapse