A parametric approach to vocal tract length normalization
Citations
586 citations
Cites methods from "A parametric approach to vocal trac..."
...In [2] a parametric model of vocal tract length normalization reduces the inter-speaker variability of the acoustic space by appropriately warping the frequency axis for each training speaker prior to computing the cepstral coefficients....
[...]
507 citations
Cites background or methods from "A parametric approach to vocal trac..."
...A direct application of the tube resonator model of the vocal tract lead to the different vocal tract length normalization (VTLN) techniques: speaker-dependent formant mapping (Di Benedetto and Liénard, 1992; Wakita, 1977), transformation of the LPC pole modeling (Slifka and Anderson, 1995), frequency warping, either linear (Eide and Gish, 1996; Lee and Rose, 1996; Tuerk and Robinson, 1993; Zhan and Westphal, 1997) or non-linear (Ono et al....
[...]
...The estimation of the VTL factor can either be perform by a maximum likelihood approach (Lee and Rose, 1996; Zhan and Waibel, 1997) or from a direct estimation of the formant positions (Eide and Gish, 1996; Lincoln et al., 1997)....
[...]
...…Benedetto and Liénard, 1992; Wakita, 1977), transformation of the LPC pole modeling (Slifka and Anderson, 1995), frequency warping, either linear (Eide and Gish, 1996; Lee and Rose, 1996; Tuerk and Robinson, 1993; Zhan and Westphal, 1997) or non-linear (Ono et al., 1993), all consist of…...
[...]
...Note that VTLN is often combined with an adaptation of the acoustic model to the canonical speaker (Eide and Gish, 1996; Lee and Rose, 1996) (cf. Section 4.2.1)....
[...]
...Note that VTLN is often combined with an adaptation of the acoustic model to the canonical speaker (Eide and Gish, 1996; Lee and Rose, 1996) (cf....
[...]
338 citations
244 citations
Cites background from "A parametric approach to vocal trac..."
...As in the past, we expect that further research and development will enable us to create increasingly powerful systems, deployable on a worldwide basis....
[...]
221 citations
Cites background or methods from "A parametric approach to vocal trac..."
...In [36], a parametric approach is suggested which eliminates much of the computational overhead associated with the exhaustive search for the optimal scaling factor....
[...]
...In VTN [67, 36] a scaling factor } is applied to the preprocessed speech signal to achieve a linear frequency warping, ~ } ~ (2....
[...]
References
498 citations
103 citations