Robust speech recognition through selection of speaker and environment transforms
References
1,755 citations
"Robust speech recognition through s..." refers methods in this paper
...Two broad approaches to speaker-normalization are speaker-adaptation based approaches such as Maximum Likelihood Linear Regres sion (MLLR) or Constrained-MLLR (CMLLR) [1] and Vocal-tract Length Normalization (VTLN) [2]....
[...]
...The pre-computed environment transforms were obtained from training data using CMLLR framework while the speaker transform are a set of Linear-VTLN matrices corresponding to the range of warp factors....
[...]
...(1)(c) also shows the results for the best case (which is upper bound) where in CMLLR transform is used instead of VTLN as speaker transform....
[...]
...Once the speaker variability is removed from the features we can use all the train utterances collected in a specific noise environment (e.g. car noise, restaurant etc.) at different noise levels (e.g very noisy, noisy, less noisy, clean) and estimate environment noise specific CMLLR transforms....
[...]
...In [10] cascade of CMLLR transforms are used which enables the use of transform estimated in one environment to be used with same speaker in another environment....
[...]
480 citations
"Robust speech recognition through s..." refers methods in this paper
...Combination of VTS with VTLN [7] and VTS with MLLR are studied in [8]....
[...]
...In the histogram based approaches, adequate speech data is required to get robust estimates of the quantiles, while in the VTS based approach the noise models are obtained from the first few and last few frames of the utterance....
[...]
...Two com monly used noise-compensation approaches are those based on his togram equalization (HEQ) [3] and those based on Vector Taylor Se ries (VTS) [4]....
[...]
...Two commonly used noise-compensation approaches are those based on histogram equalization (HEQ) [3] and those based on Vector Taylor Series (VTS) [4]....
[...]
...nonlinear compensation techniques like HEQ and VTS with VTLN [5][7]....
[...]
338 citations
"Robust speech recognition through s..." refers methods in this paper
...Two broad approaches to speaker-normalization are speaker-adaptation based approaches such as Maximum Likelihood Linear Regres sion (MLLR) or Constrained-MLLR (CMLLR) [1] and Vocal-tract Length Normalization (VTLN) [2]....
[...]
332 citations
"Robust speech recognition through s..." refers methods in this paper
...Two commonly used noise-compensation approaches are those based on histogram equalization (HEQ) [3] and those based on Vector Taylor Series (VTS) [4]....
[...]
...Two com monly used noise-compensation approaches are those based on his togram equalization (HEQ) [3] and those based on Vector Taylor Se ries (VTS) [4]....
[...]
...nonlinear compensation techniques like HEQ and VTS with VTLN [5][7]....
[...]
...Combination of VTLN with HEQ is studied in [5][6]....
[...]
41 citations
"Robust speech recognition through s..." refers methods in this paper
...Further, in [9][10], the noise and environments transforms have to be estimated using test utterances as adaptation data....
[...]
...Gales [9] proposed the acoustic-factorization approach to separate the noise and speaker effects and uses cluster-adaptive (CAT) approach for environment transform estimation and MLLR for speaker-transform estimation....
[...]