Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model

doi:10.1016/J.SPECOM.2014.12.006

Open AccessJournal ArticleDOI

Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model

Ramya Rasipuram, +1 more

- 01 Apr 2015 -

Speech Communication

- Vol. 68, pp 23-40

TLDR

This paper shows that the relationship between lexical units and acoustic features can be factored into two parts through a latent variable, namely, an acoustic model and a lexical model and proposes an approach that addresses both acoustic and phonetic lexical resource constraints in ASR system development.

About:

This article is published in Speech Communication.The article was published on 2015-04-01 and is currently open access. It has received 23 citations till now. The article focuses on the topics: Acoustic model & Literature survey.

Citations

PDF

Open Access

More filters

IEEE transactions on pattern analysis and machine intelligence

Ieee Xplore

TL;DR: This special issue aims at gathering the recent advances in learning with shared information methods and their applications in computer vision and multimedia analysis and addressing interesting real-world computer Vision and multimedia applications.

...read moreread less

Journal ArticleDOI

Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition

Myung Jong Kim, +4 more

TL;DR: A speaker adaptation method based on a combination of L2 regularization and confusion-reducing regularization, which can enhance discriminability between categorical distributions of the KL-HMM states while preserving speaker-specific information is proposed.

...read moreread less

Journal ArticleDOI

Articulatory feature based continuous speech recognition using probabilistic lexical modeling

Ramya Rasipuram, +1 more

- 01 Mar 2016 -

Computer Speech & Language

TL;DR: Analysis of the probabilistic relationship captured by the parameters has shown that the approach is capable of adapting the knowledge-based phoneme-to-AF representations using speech data; and allows different AFs to evolve asynchronously.

...read moreread less

Proceedings ArticleDOI

On modeling context-dependent clustered states: Comparing HMM/GMM, hybrid HMM/ANN and KL-HMM approaches

Marzieh Razavi, +2 more

TL;DR: It is shown that in KL-HMM framework the authors may not require as many clustered states as the best HMM/GMM system in the ANN output layer, which has broader implications on model complexity and data sparsity issues.

...read moreread less

Journal ArticleDOI

Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework

Marzieh Razavi, +3 more

- 01 Jun 2016 -

Speech Communication

TL;DR: The recently proposed acoustic G2P approach in the Kullback Leibler divergence-based HMM (KL-HMM) framework is a particular case of this formalism, and experimental studies on English and French show that despite relatively poor performance at the pronunciation level, the performance of the proposed approach is not significantly different than the state-of-the-art G 2P methods at the ASR level.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Journal ArticleDOI

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

Geoffrey E. Hinton, +10 more

- 18 Oct 2012 -

IEEE Signal Processing Magazine

TL;DR: This article provides an overview of progress and represents the shared views of four research groups that have had recent successes in using DNNs for acoustic modeling in speech recognition.

...read moreread less

Journal ArticleDOI

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition

George E. Dahl, +3 more

- 01 Jan 2012 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A pre-trained deep neural network hidden Markov model (DNN-HMM) hybrid architecture that trains the DNN to produce a distribution over senones (tied triphone states) as its output that can significantly outperform the conventional context-dependent Gaussian mixture model (GMM)-HMMs.

...read moreread less

Journal Article

Deep Neural Networks for Acoustic Modeling in Speech Recognition

Geoffrey E. Hinton, +10 more

- 01 Nov 2012 -

IEEE Signal Processing Magazine

TL;DR: This paper provides an overview of this progress and repres nts the shared views of four research groups who have had recent successes in using deep neural networks for a coustic modeling in speech recognition.

...read moreread less

IEEE transactions on pattern analysis and machine intelligence

Ieee Xplore

TL;DR: This special issue aims at gathering the recent advances in learning with shared information methods and their applications in computer vision and multimedia analysis and addressing interesting real-world computer Vision and multimedia applications.

...read moreread less

Collapse

Speech Communication

Probabilistic classification of HMM states for large vocabulary continuous speech recognition

Xiaoqiang Luo, +1 more

Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model

Citations

IEEE transactions on pattern analysis and machine intelligence

Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition

Articulatory feature based continuous speech recognition using probabilistic lexical modeling

On modeling context-dependent clustered states: Comparing HMM/GMM, hybrid HMM/ANN and KL-HMM approaches

Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework

References

A tutorial on hidden Markov models and selected applications in speech recognition

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition

Deep Neural Networks for Acoustic Modeling in Speech Recognition

IEEE transactions on pattern analysis and machine intelligence

Related Papers (5)

Using KL-based Acoustic Models in a Large Vocabulary Recognition Task

Comparing different acoustic modeling techniques for multilingual boosting

An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features

Joint-sequence models for grapheme-to-phoneme conversion

Probabilistic classification of HMM states for large vocabulary continuous speech recognition