Journal ArticleDOI
Frequency warping for VTLN and speaker adaptation by linear transformation of standard MFCC
TLDR
The performance of the new LT was comparable to that of regular VTLN implemented by warping the Mel filterbank, when the MLS criterion was used for FW estimation, and it is shown that the approximations involved do not lead to any performance degradation.About:
This article is published in Computer Speech & Language.The article was published on 2009-01-01. It has received 46 citations till now.read more
Citations
More filters
Journal ArticleDOI
Acoustic Analysis for Automatic Speech Recognition
TL;DR: While the main focus in ASR is to obtain spectral envelope measures, human speech communication efficiently exploits the manipulation of one's vocal-cord vibration rate, and so F0 extraction and its integration into ASR are also reviewed.
Dissertation
Addressing pitch Mismatch for Children's Automatic Speech Recognition
TL;DR: In this paper, the effect of variations in each of these acoustic correlates across speech signals is studied on Mel frequency cepstral coefficient (MFCC) features and ASR models.
Journal ArticleDOI
Golden speaker builder – An interactive tool for pronunciation training
Shaojin Ding,Christopher Liberatore,Sinem Sonsaat,Ivana Lucic,Alif Silpachai,Guanlong Zhao,Evgeny Chukharev-Hudilainen,John M. Levis,Ricardo Gutierrez-Osuna +8 more
TL;DR: Golden Speaker Builder is presented, a tool that allows learners to generate a personalized “golden-speaker” voice: one that mirrors their own voice but with a native accent.
Journal ArticleDOI
A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model
TL;DR: A quantitative study of acoustic-to-articulatory inversion for vowel speech sounds by analysis-by-synthesis using the Maeda articulatory model is performed and good agreement between estimated midsagittal VT outlines and measured XRMB tongue pellet positions was achieved.
Journal ArticleDOI
Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis
TL;DR: This paper presents an efficient implementation of VTLN using expectation maximization and addresses the key challenges faced in implementing V TLN for synthesis.
References
More filters
Journal ArticleDOI
Maximum likelihood from incomplete data via the EM algorithm
Journal ArticleDOI
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
S. Davis,Paul Mermelstein +1 more
TL;DR: In this article, several parametric representations of the acoustic signal were compared with regard to word recognition performance in a syllable-oriented continuous speech recognition system, and the emphasis was on the ability to retain phonetically significant acoustic information in the face of syntactic and duration variations.
Journal ArticleDOI
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
TL;DR: An important feature of the method is that arbitrary adaptation data can be used—no special enrolment sentences are needed and that as more data is used the adaptation performance improves.
Journal ArticleDOI
Maximum likelihood linear transformations for HMM-based speech recognition
TL;DR: The paper compares the two possible forms of model-based transforms: unconstrained, where any combination of mean and variance transform may be used, and constrained, which requires the variance transform to have the same form as the mean transform.
Journal ArticleDOI
Minimum classification error rate methods for speech recognition
TL;DR: The issue of speech recognizer training from a broad perspective with root in the classical Bayes decision theory is discussed, and the superiority of the minimum classification error (MCE) method over the distribution estimation method is shown by providing the results of several key speech recognition experiments.
Related Papers (5)
Vocal tract normalization equals linear transformation in cepstral space
Michael Pitz,Hermann Ney +1 more