Modeling pronunciation variation for ASR

doi:10.1016/S0167-6393(99)00038-2

Journal ArticleDOI

Modeling pronunciation variation for ASR

Helmer Strik, +1 more

- 01 Nov 1999 -

Speech Communication

- Vol. 29, Iss: 2, pp 225-246

Chats0

TLDR

This contribution provides an overview of the publications on pronunciation variation modeling in automatic speech recognition, paying particular attention to the papers in this special issue and the papers presented at 'the Rolduc workshop'.

About:

This article is published in Speech Communication.The article was published on 1999-11-01. It has received 259 citations till now.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Automatic speech recognition and speech variability: A review

Mohamed Faouzi BenZeghiba, +12 more

- 01 Oct 2007 -

Speech Communication

TL;DR: Current advances related to automatic speech recognition (ASR) and spoken language systems and deficiencies in dealing with variation naturally present in speech are outlined.

...read moreread less

Massive reduction in conversational American English

Keith A. Johnson

TL;DR: The English are a lazy lot, and will not speak a word as it should be spoken when they can slide through it as discussed by the authors. Why be bothered to say extraordinary when you can get away with strawdiny?... Many of the Oxford Cockneys are weaklings too languid or emasculated to speak their noble language with any vigor.

...read moreread less

Journal ArticleDOI

The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability

Mark A. Pitt, +4 more

- 01 Jan 2005 -

Speech Communication

TL;DR: The method used to elicit and record the speech is described, followed by a description of the protocol that was developed to phonemically label what talkers said, and the results of a test of labeling consistency are presented.

...read moreread less

Patent

Method and apparatus for assembling a prediction list of name pronunciation variations for use during speech recognition

George Anton Kiraz, +2 more

TL;DR: In this article, a method and apparatus is provided for generating a plurality of plausible pronunciations for a proper name, the method or apparatus for use in performing speech recognition of utterances comprising the proper name by individuals within a given population of speakers.

...read moreread less

Journal ArticleDOI

Pronunciation modeling by sharing Gaussian densities across phonetic models

Murat Saraclar, +2 more

- 01 Apr 2000 -

Computer Speech & Language

TL;DR: In this paper, instead of allowing a phoneme in the canonical pronunciation to be realized as one of a few distinct alternate phones, the hidden Markov model (HMM) states of the phoneme?s model are instead allowed to share Gaussian mixture components with the HMM states of model(s) of the alternate realization(s).

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Language Style as Audience Design

Allan Bell

- 01 Jun 1984 -

Language in Society

TL;DR: The basic principle of language style is that an individual speaker does not always talk in the same way on all occasions as discussed by the authors, which is one of the most challenging aspects of sociolinguistic variation.

...read moreread less

Journal ArticleDOI

Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion

Iain R. Murray, +1 more

- 01 Feb 1993 -

Journal of the Acoustical Society of Ame...

TL;DR: The voice parameters affected by emotion are found to be of three main types: voice quality, utterance timing, and utterance pitch contour.

...read moreread less

Book

Autosegmental and Metrical Phonology

John Goldsmith

TL;DR: Autosegmental representation the skeletal tier the syllable metrical phonology lexical phonology further issues as discussed by the authors, which is not the case in this paper, are discussed.

...read moreread less

Book