In search of better pronunciation models for speech recognition

doi:10.1016/S0167-6393(99)00034-5

Journal ArticleDOI

In search of better pronunciation models for speech recognition

Nick Cremelie, +1 more

- 01 Nov 1999 -

Speech Communication

- Vol. 29, Iss: 2, pp 115-136

TLDR

A method for upgrading initially simple pronunciation models to new models that can explain several pronunciation variants of each word, and the introduction of such variants in a segment-based recognizer significantly improves the recognition accuracy.

About:

This article is published in Speech Communication.The article was published on 1999-11-01. It has received 63 citations till now. The article focuses on the topics: Pronunciation & Word error rate.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Modeling pronunciation variation for ASR

Helmer Strik, +1 more

- 01 Nov 1999 -

Speech Communication

TL;DR: This contribution provides an overview of the publications on pronunciation variation modeling in automatic speech recognition, paying particular attention to the papers in this special issue and the papers presented at 'the Rolduc workshop'.

...read moreread less

Patent

Method and apparatus for constructing and using syllable-like unit language models

Mei-Yuh Hwang, +2 more

TL;DR: In this paper, a method and computer-readable medium use syllable-like units (SLUs) to decode a pronunciation into a phonetic description, which are generally larger than a single phoneme but smaller than a word.

...read moreread less

Journal ArticleDOI

Pronunciation modeling for ASR – knowledge-based and data-derived methods

Mirjam Wester

- 01 Jan 2003 -

Computer Speech & Language

TL;DR: A comparison between the knowledge-based and data-derived methods showed that 17% of variants generated by the phonological rules were also found using phone recognition, and this increases to 46% when the phone recognition output is smoothed by using D-trees.

...read moreread less

PatentDOI

Method for adding phonetic descriptions to a speech recognition lexicon

Mei-Yuh Hwang, +2 more

- 26 Dec 2000 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, a method and computer-readable medium convert the text of a word and a user's pronunciation of the word into a phonetic description to be added to a speech recognition lexicon.

...read moreread less

Journal ArticleDOI

A data-driven method for modeling pronunciation variation

Judith M. Kessens, +2 more

- 01 Jun 2003 -

Speech Communication

TL;DR: This analysis shows that although modeling pronunciation variation brings about improvements, deteriorations are also introduced and it is not possible to improve ASR performance by excluding the rules that cause deteriorations, because these rules also produce a considerable number of improvements.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Speaker-independent phone recognition using hidden Markov models

Kai-Fu Lee, +1 more

- 01 Nov 1989 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: The authors introduce the co-occurrence smoothing algorithm, which enables accurate recognition even with very limited training data, and can be used as benchmarks to evaluate future systems.

...read moreread less

Journal ArticleDOI

An application of recurrent nets to phone probability estimation

A.J. Robinson

- 01 Mar 1994 -

IEEE Transactions on Neural Networks

TL;DR: Recognition results are presented for the DARPA TIMIT and Resource Management tasks, and it is concluded that recurrent nets are competitive with traditional means for performing phone probability estimation.

...read moreread less

Proceedings ArticleDOI

A probabilistic framework for feature-based speech recognition

James Glass, +2 more

TL;DR: This paper examines a maximum a-posteriori decoding strategy for feature-based recognizers and develops a normalization criterion that is useful for a segment-based Viterbi or A* search.

...read moreread less

Proceedings Article

High performance speaker-independent phone recognition using CDHMM.

Lori Lamel, +1 more

TL;DR: It is shown that it is worthwhile to perform phone recognition experiments as opposed to only focusing attention on word recognition results, and high phone accuracies on three corpora: WSJ0, BREF and TIMIT are reported.

...read moreread less