Towards increasing speech recognition error rates

doi:10.1016/0167-6393(96)00003-9

Journal ArticleDOI

Towards increasing speech recognition error rates

Hervé Bourlard, +5 more

- 01 May 1996 -

Speech Communication

- Vol. 18, Iss: 3, pp 205-231

Chats0

TLDR

In this article, the authors discuss some research directions for ASR that may not always yield an immediate and guaranteed decrease in error rate but which hold some promise for ultimately improving performance in the end applications, including discrimination between rival utterance models, the role of prior information in speech recognition, merging the language and acoustic models, feature extraction and temporal information, and decoding procedures reflecting human perceptual properties.

About:

This article is published in Speech Communication.The article was published on 1996-05-01. It has received 182 citations till now. The article focuses on the topics: Word error rate.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Speech recognition by machines and humans

Richard P. Lippmann

- 01 Jul 1997 -

Speech Communication

TL;DR: Comparisons suggest that the human-machine performance gap can be reduced by basic research on improving low-level acoustic-phonetic modeling, on improving robustness with noise and channel variability, and on more accurately modeling spontaneous speech.

...read moreread less

MonographDOI

Text-to-Speech Synthesis

Paul Taylor

TL;DR: Text-to-Speech Synthesis provides an in-depth explanation of all aspects of current speech synthesis technology, and is designed for graduate students in electrical engineering, computer science, and linguistics.

...read moreread less

Journal ArticleDOI

Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition

Jort F. Gemmeke, +2 more

- 01 Sep 2011 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: The results show that the hybrid system performed substantially better than source separation or missing data mask estimation at lower signal-to-noise ratios (SNRs), achieving up to 57.1% accuracy at SNR = -5 dB.

...read moreread less

Journal ArticleDOI

Invited paper: Automatic speech recognition: History, methods and challenges

Douglas D. O'Shaughnessy

- 01 Oct 2008 -

Pattern Recognition

TL;DR: This tutorial examines the problem area, its methods, successes and failures, focusing on the nature of the speech signal and techniques to accomplish useful data reduction, and compares it with other areas of PR.

...read moreread less

Journal ArticleDOI

Interacting with computers by voice: automatic speech recognition and synthesis

Douglas O'Shaughnessy

TL;DR: This paper examines how people communicate with computers using speech, and the popular mathematical model called the hidden Markov model (HMM) is examined; first-order HMMs are efficient but ignore long-range correlations in actual speech.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Maximum likelihood from incomplete data via the EM algorithm

Arthur P. Dempster, +2 more

- 01 Sep 1977 -

Journal of the royal statistical society...

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Journal ArticleDOI

Handbook of Sensory Physiology

M. D Sanders

- 01 Feb 1975 -

British Journal of Ophthalmology

Journal ArticleDOI

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences

S. Davis, +1 more

- 01 Aug 1980 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: In this article, several parametric representations of the acoustic signal were compared with regard to word recognition performance in a syllable-oriented continuous speech recognition system, and the emphasis was on the ability to retain phonetically significant acoustic information in the face of syntactic and duration variations.

...read moreread less

Journal Article

Maximum likelihood estimation from incomplete data via the EM algorithm

A. Dempster

- 01 Jan 1977 -

Journal of the Royal Statistical Society

Collapse

Towards increasing speech recognition error rates

Citations

Speech recognition by machines and humans

Text-to-Speech Synthesis

Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition

Invited paper: Automatic speech recognition: History, methods and challenges

Interacting with computers by voice: automatic speech recognition and synthesis

References

Maximum likelihood from incomplete data via the EM algorithm

A tutorial on hidden Markov models and selected applications in speech recognition

Handbook of Sensory Physiology

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences

Maximum likelihood estimation from incomplete data via the EM algorithm

Related Papers (5)

Fundamentals of speech recognition

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences

RASTA processing of speech

Perceptual linear predictive (PLP) analysis of speech

Statistical methods for speech recognition