Frequency warping for VTLN and speaker adaptation by linear transformation of standard MFCC
Citations
101 citations
59 citations
48 citations
47 citations
Cites methods from "Frequency warping for VTLN and spea..."
...Following Panchapagesan and Alwan [22], we then learn a linear transform between the MFCCs of both speakers using ridge regression: ∗ = argmin ‖ − ‖ (6)...
[...]
...[22] S. Panchapagesan and A. Alwan, "Frequency warping for VTLN and speaker adaptation by linear transformation of standard MFCC," Computer Speech and Language, vol. 23, no. 1, pp. 42-64, Jan 2009....
[...]
...Following Panchapagesan and Alwan [22], we then learn a linear transform between the MFCCs of both speakers using ridge regression: ∗ = argmin ‖ − ‖ (6) where and are vectors of MFCCs from the native and L2 speakers, respectively, and ∗ is the VTLN transform....
[...]
41 citations
References
4,822 citations
2,504 citations
"Frequency warping for VTLN and spea..." refers methods in this paper
...After estimating the LT (see Section 5 below), a bias vector b and an unconstrained variance transform matrix H may be estimated according to the Maximum Likelihood Linear Regression (MLLR) technique (Leggetter and Woodland, 1995; Gales, 1996)....
[...]
1,755 citations
"Frequency warping for VTLN and spea..." refers methods in this paper
...This objective function is identical to the one used for MLLR and CMLLR (constrained MLLR, (Gales, 1998)), except the linear transformation to be estimated is constrained by the FW parametrization....
[...]
...Also, even if the Jacobian determinant term were neglected, the accumulator based approach (Gales, 1998) for efficient optimization of the EM auxiliary function with CLTFW cannot be used with regular VTLN....
[...]
...Different CLTFW transforms can also be estimated for different classes of distributions similar to CMLLR, without much increase in computations, since it is seen from Eq....
[...]
728 citations