FMLLR Speaker Normalization With i-Vector: In Pseudo-FMLLR and Distillation Framework
Citations
10 citations
4 citations
Cites background from "FMLLR Speaker Normalization With i-..."
...In [17], the authors introduced an auxiliary network to generate psuedo-fMLLR features from filterbank...
[...]
4 citations
Cites methods from "FMLLR Speaker Normalization With i-..."
...Generalized distillation is extended and also applied to speech normalization tasks [25], [26]....
[...]
2 citations
Cites background from "FMLLR Speaker Normalization With i-..."
...Where S denotes the super-vector associated with the speaker and the channel; m denotes the super-vector independent of the speaker and the channel; the subspace matrix T of overall variation completes the mapping from the high dimensional space to the low dimensional space, thereby it makes the vector after dimensionality reduction is more conducive to further classifying and recognizing; ω represents the vector associated with the speaker channel, which is a full-variable spatial difference factor containing speaker information and channel information [14]....
[...]
Cites background or methods from "FMLLR Speaker Normalization With i-..."
...Recently these issues were addressed in [9], where the authors proposed to learn a DNN based mapping from filterbank (speaker independent space) to fMLLR-normalized features in a regression framework, and then use these normalized features predicted by the DNN to train acoustic model....
[...]
...To the best of our knowledge, the DA based mapping does not show consistent improvements [9] for speaker adaptation....
[...]
...From the results, it may be concluded that the generalization power of DA to unseen speakers [9] is rather poor....
[...]
...The third model is trained on 41-dimensional log-mel filterbank features with ±2 splicing concatenated with 40-dimensional i-vectors [9] for speaker aware training....
[...]
...Following [9], the baseline DA is trained to map filterbank features to fMLLRnormalized features....
[...]
References
12,857 citations
5,857 citations
"FMLLR Speaker Normalization With i-..." refers methods in this paper
...Kaldi toolkit [27] was used for feature extraction, GMMHMM training and DNN modeling....
[...]
1,250 citations
"FMLLR Speaker Normalization With i-..." refers methods in this paper
...The speech enhancing DNN proposed in [14]–[16] is the inspiration for the proposed method....
[...]
860 citations
"FMLLR Speaker Normalization With i-..." refers methods in this paper
...The speech enhancing DNN proposed in [14]–[16] is the inspiration for the proposed method....
[...]
714 citations
"FMLLR Speaker Normalization With i-..." refers background in this paper
...i-vectors are estimated for each train speaker and is then concatenated with all the filterbank feature frames belonging to that speaker [12]....
[...]
...Speaker codes [6], [9], eigenvectors in speaker space [10], speaker separation bottleneck features [11], and i-vectors [12], [13] are a few examples of auxiliary speaker-specific codes....
[...]