Use of semi-Markov models for speaker-independent phoneme recognition

doi:10.1109/ICASSP.1992.225845

Proceedings ArticleDOI

Use of semi-Markov models for speaker-independent phoneme recognition

- Vol. 1, pp 565-568

TLDR

Preliminary tests conducted using only the linear prediction coding (LPC) cepstrum as features have shown that the use of HSMM increased the phoneme recognition accuracy to 53.7% from the 48.4% obtained using an HMM.

Abstract:

Hidden Markov models (HMMs) have been used to model speech in many areas of speech processing. One characteristic of the HMM is that the probability of time spent in a particular state, or state occupancy, is geometrically distributed. This, however, becomes a serious limitation and results in inaccurate modeling when the HMMs are used for phoneme recognition. The authors use hidden semi-Markov models (HSMM) to overcome the above limitation. Semi-Markov models are a more general class of Markov chains in which the state occupancy can be explicitly modeled by an arbitrary probability mass distribution. The authors use non-parametric distributions to describe the state occupancies instead of parametric distributions such as gamma. Poisson or binomial, as analysis of actual data shows that the duration of some phonemes could not be approximated by any of the above. Preliminary tests conducted using only the linear prediction coding (LPC) cepstrum as features have shown that the use of HSMM increased the phoneme recognition accuracy to 53.7% from the 48.4% obtained using an HMM. >

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Hidden semi-Markov models

Shun-Zheng Yu

- 01 Feb 2010 -

Artificial Intelligence

TL;DR: An overview of HSMMs is presented, including modelling, inference, estimation, implementation and applications, which has been applied in thirty scientific and engineering areas, including speech recognition/synthesis, human activity recognition/prediction, handwriting recognition, functional MRI brain mapping, and network anomaly detection.

...read moreread less

Book

Hidden Semi-Markov Models: Theory, Algorithms and Applications

Shun-Zheng Yu

TL;DR: How to master the basic techniques needed for using HSMMs and how to apply them are shown, as well as a description of applications in various areas including, Human Activity Recognition, Handwriting recognition, Network Traffic Characterization and Anomaly Detection, and Functional MRI Brain Mapping.

...read moreread less

Dissertation

HMM-based Speech Synthesis Using an Acoustic Glottal Source Model

João P. Cabral

TL;DR: A new approach to using an acoustic glottal source model in HMM-based synthesisers to improve speech quality and parametric flexibility to better model and transform voice characteristics.

...read moreread less

Journal ArticleDOI

Sticky Hidden Markov Modeling of Comparative Genomic Hybridization

Lan Du, +3 more

- 01 Oct 2010 -

IEEE Transactions on Signal Processing

TL;DR: A sticky hidden Markov model with a Dirichlet distribution (DD) prior is developed, motivated by the problem of analyzing comparative genomic hybridization (CGH) data, and the form of the proposed hierarchical model allows efficient variational Bayesian (VB) inference.

...read moreread less

Optimal Curve Fitting of Speech Signal for Disabled Children

Anandthirtha B. Gudi, +3 more

TL;DR: In this paper, the amplitude profile of sampled speech data were fitted by employing sum of sine functions with a confidence level more than 90% and amplitude correlation technique is applied between original speech signal samples of normal and pathological subjects and correlation technique also applied between the curve fit constant values for normal or pathological subjects.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Speaker-independent phone recognition using hidden Markov models

Kai-Fu Lee, +1 more

- 01 Nov 1989 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: The authors introduce the co-occurrence smoothing algorithm, which enables accurate recognition even with very limited training data, and can be used as benchmarks to evaluate future systems.

...read moreread less

Journal ArticleDOI

Continuously variable duration hidden Markov models for automatic speech recognition

Stephen E. Levinson

- 01 Mar 1986 -

Computer Speech & Language

TL;DR: The solution proposed here is to replace the probability distributions of duration with continuous probability density functions to form a continuously variable duration hidden Markov model (CVDHMM) which is ideally suited to specification of the durational density.

...read moreread less

Journal ArticleDOI

On the application of vector quantization and hidden Markov models to speaker-independent, isolated word recognition

Lawrence R. Rabiner, +2 more

- 01 Apr 1983 -

Bell System Technical Journal

TL;DR: This paper presents an approach to speaker-independent, isolated word recognition in which the well-known techniques of vector quantization and hidden Markov modeling are combined with a linear predictive coding analysis front end in the framework of a standard statistical pattern recognition model.

...read moreread less

Journal ArticleDOI

Recognition of isolated digits using hidden Markov models with continuous mixture densities

Lawrence R. Rabiner, +3 more

- 08 Jul 1985 -

AT&T technical journal

TL;DR: This paper extends previous work on isolated-word recognition based on hidden Markov models by replacing the discrete symbol representation of the speech signal with a continuous Gaussian mixture density, thereby eliminating the inherent quantization error introduced by the discrete representation.

...read moreread less

Proceedings ArticleDOI

Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition

Martin J. Russell, +1 more

TL;DR: Results have been presented which show that these semi-Markov models provide an appropriate framework for modelling durational structure and can lead to significant improvements in recognition accuracy.

...read moreread less

Use of semi-Markov models for speaker-independent phoneme recognition

Citations

Hidden semi-Markov models

Hidden Semi-Markov Models: Theory, Algorithms and Applications

HMM-based Speech Synthesis Using an Acoustic Glottal Source Model

Sticky Hidden Markov Modeling of Comparative Genomic Hybridization

Optimal Curve Fitting of Speech Signal for Disabled Children

References

Speaker-independent phone recognition using hidden Markov models

Continuously variable duration hidden Markov models for automatic speech recognition

On the application of vector quantization and hidden Markov models to speaker-independent, isolated word recognition

Recognition of isolated digits using hidden Markov models with continuous mixture densities

Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition

Related Papers (5)

A tutorial on hidden Markov models and selected applications in speech recognition

Shared-distribution hidden Markov models for speech recognition

Continuously variable duration hidden Markov models for speech analysis

On the application of mixture AR hidden Markov models to text independent speaker recognition

Capacity and complexity of HMM duration modeling techniques