Joint Learning of Phonetic Units and Word Pronunciations for ASR
Citations
166 citations
Cites background from "Joint Learning of Phonetic Units an..."
...ples include grapheme-to-phoneme conversion [2], pronunciation learning [15, 10], and joint learning of phonetic units and word pronunciations [1, 9]....
[...]
158 citations
Cites background from "Joint Learning of Phonetic Units an..."
...Recent work has introduced models that do not require pronunciation lexicons, but train only on speech with text transcriptions (Lee et al., 2013; Maas et al., 2015; Graves et al., 2006)....
[...]
112 citations
Cites methods from "Joint Learning of Phonetic Units an..."
...2 and employ the backward message-passing and forward-sampling algorithm described in Lee et al. (2013), designed for aligning a letter sequence and speech signals, to propose samples for ~vi and zi....
[...]
104 citations
57 citations
References
16,079 citations
6,081 citations
"Joint Learning of Phonetic Units an..." refers methods in this paper
...We employ Gibbs sampling (Gelman et al., 2004) to approximate the posterior distribution of the latent variables in our model....
[...]
4,822 citations
1,024 citations
"Joint Learning of Phonetic Units an..." refers methods in this paper
...These K HMMs are used to model the phonetic units in the language (Jelinek, 1976)....
[...]
781 citations
Additional excerpts
...Conventionally, to train a context-dependent acoustic model, a list of questions based on the linguistic properties of phonetic units is required for growing decision tree classifiers (Young et al., 1994)....
[...]