scispace - formally typeset
Journal ArticleDOI

Modeling pronunciation variation for ASR

Helmer Strik, +1 more
- 01 Nov 1999 - 
- Vol. 29, Iss: 2, pp 225-246
Reads0
Chats0
TLDR
This contribution provides an overview of the publications on pronunciation variation modeling in automatic speech recognition, paying particular attention to the papers in this special issue and the papers presented at 'the Rolduc workshop'.
About
This article is published in Speech Communication.The article was published on 1999-11-01. It has received 259 citations till now.

read more

Citations
More filters
Journal ArticleDOI

Automatic speech recognition and speech variability: A review

TL;DR: Current advances related to automatic speech recognition (ASR) and spoken language systems and deficiencies in dealing with variation naturally present in speech are outlined.

Massive reduction in conversational American English

TL;DR: The English are a lazy lot, and will not speak a word as it should be spoken when they can slide through it as discussed by the authors. Why be bothered to say extraordinary when you can get away with strawdiny?... Many of the Oxford Cockneys are weaklings too languid or emasculated to speak their noble language with any vigor.
Journal ArticleDOI

The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability

TL;DR: The method used to elicit and record the speech is described, followed by a description of the protocol that was developed to phonemically label what talkers said, and the results of a test of labeling consistency are presented.
Patent

Method and apparatus for assembling a prediction list of name pronunciation variations for use during speech recognition

TL;DR: In this article, a method and apparatus is provided for generating a plurality of plausible pronunciations for a proper name, the method or apparatus for use in performing speech recognition of utterances comprising the proper name by individuals within a given population of speakers.
Journal ArticleDOI

Pronunciation modeling by sharing Gaussian densities across phonetic models

TL;DR: In this paper, instead of allowing a phoneme in the canonical pronunciation to be realized as one of a few distinct alternate phones, the hidden Markov model (HMM) states of the phoneme?s model are instead allowed to share Gaussian mixture components with the HMM states of model(s) of the alternate realization(s).
References
More filters
Journal ArticleDOI

Language Style as Audience Design

TL;DR: The basic principle of language style is that an individual speaker does not always talk in the same way on all occasions as discussed by the authors, which is one of the most challenging aspects of sociolinguistic variation.
Journal ArticleDOI

Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion

TL;DR: The voice parameters affected by emotion are found to be of three main types: voice quality, utterance timing, and utterance pitch contour.
Book

Autosegmental and Metrical Phonology

TL;DR: Autosegmental representation the skeletal tier the syllable metrical phonology lexical phonology further issues as discussed by the authors, which is not the case in this paper, are discussed.
Journal ArticleDOI

Principles of Phonetics

Frances Ingemann, +1 more
- 01 Mar 1997 - 
Related Papers (5)