scispace - formally typeset
J

Jean-Luc Gauvain

Researcher at Centre national de la recherche scientifique

Publications -  222
Citations -  11110

Jean-Luc Gauvain is an academic researcher from Centre national de la recherche scientifique. The author has contributed to research in topics: Language model & Word error rate. The author has an hindex of 48, co-authored 222 publications receiving 10838 citations. Previous affiliations of Jean-Luc Gauvain include Vocapia Research & Bell Labs.

Papers
More filters
Journal ArticleDOI

Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains

TL;DR: A framework for maximum a posteriori (MAP) estimation of hidden Markov models (HMM) is presented, and Bayesian learning is shown to serve as a unified approach for a wide range of speech recognition applications.
Book ChapterDOI

Neural Probabilistic Language Models

TL;DR: This work proposes to fight the curse of dimensionality by learning a distributed representation for words which allows each training sentence to inform the model about an exponential number of semantically neighboring sentences, and incorporates this new language model into a state-of-the-art speech recognizer of conversational speech.
Journal ArticleDOI

The LIMSI Broadcast News transcription system

TL;DR: Development work in moving from laboratory read speech data to real-world or `found' speech data in preparation for the DARPA evaluations on this task from 1996 to 1999 is described.
Journal ArticleDOI

Lightly supervised and unsupervised acoustic model training

TL;DR: Experiments providing supervision only via the language model training materials show that including texts which are contemporaneous with the audio data is not crucial for success of the approach, and that the acoustic models can be initialized with as little as 10 min of manually annotated data.
Proceedings Article

BREF, a large vocabulary spoken corpus for French.

TL;DR: This paper presents some of the design considerations of BREF, a large read-speech corpus for French designed to provide continuous speech data for the development of dictation machines, for the evaluation of continuous speech recognition systems, and for the study of phonological variations.