Journal ArticleDOI
Modeling pronunciation variation for ASR
Helmer Strik,Catia Cucchiarini +1 more
Reads0
Chats0
TLDR
This contribution provides an overview of the publications on pronunciation variation modeling in automatic speech recognition, paying particular attention to the papers in this special issue and the papers presented at 'the Rolduc workshop'.About:
This article is published in Speech Communication.The article was published on 1999-11-01. It has received 259 citations till now.read more
Citations
More filters
Journal ArticleDOI
Automatic speech recognition and speech variability: A review
Mohamed Faouzi BenZeghiba,R. De Mori,Olivier Deroo,Stéphane Dupont,T. Erbes,D. Jouvet,Luciano Fissore,Pietro Laface,Alfred Mertins,Christophe Ris,Richard Rose,Vivek Tyagi,Christian Wellekens +12 more
TL;DR: Current advances related to automatic speech recognition (ASR) and spoken language systems and deficiencies in dealing with variation naturally present in speech are outlined.
Massive reduction in conversational American English
TL;DR: The English are a lazy lot, and will not speak a word as it should be spoken when they can slide through it as discussed by the authors. Why be bothered to say extraordinary when you can get away with strawdiny?... Many of the Oxford Cockneys are weaklings too languid or emasculated to speak their noble language with any vigor.
Journal ArticleDOI
The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability
TL;DR: The method used to elicit and record the speech is described, followed by a description of the protocol that was developed to phonemically label what talkers said, and the results of a test of labeling consistency are presented.
Patent
Method and apparatus for assembling a prediction list of name pronunciation variations for use during speech recognition
TL;DR: In this article, a method and apparatus is provided for generating a plurality of plausible pronunciations for a proper name, the method or apparatus for use in performing speech recognition of utterances comprising the proper name by individuals within a given population of speakers.
Journal ArticleDOI
Pronunciation modeling by sharing Gaussian densities across phonetic models
TL;DR: In this paper, instead of allowing a phoneme in the canonical pronunciation to be realized as one of a few distinct alternate phones, the hidden Markov model (HMM) states of the phoneme?s model are instead allowed to share Gaussian mixture components with the HMM states of model(s) of the alternate realization(s).
References
More filters
Journal ArticleDOI
Language Style as Audience Design
TL;DR: The basic principle of language style is that an individual speaker does not always talk in the same way on all occasions as discussed by the authors, which is one of the most challenging aspects of sociolinguistic variation.
Journal ArticleDOI
Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
Iain R. Murray,John L. Arnott +1 more
TL;DR: The voice parameters affected by emotion are found to be of three main types: voice quality, utterance timing, and utterance pitch contour.
Book
Autosegmental and Metrical Phonology
TL;DR: Autosegmental representation the skeletal tier the syllable metrical phonology lexical phonology further issues as discussed by the authors, which is not the case in this paper, are discussed.