Lexicon-building methods for an acoustic sub-word based speech recognizer

doi:10.1109/ICASSP.1990.115888

Proceedings ArticleDOI

Lexicon-building methods for an acoustic sub-word based speech recognizer

Kuldip K. Paliwal

- Vol. 1990, pp 729-732

Chats0

TLDR

The use of an acoustic subword unit (ASWU)-based speech recognition system for the recognition of isolated words is discussed and it is shown that the use of a modified k-means algorithm on the likelihoods derived through the Viterbi algorithm provides the best deterministic-type of word lexicon.

Abstract:

The use of an acoustic subword unit (ASWU)-based speech recognition system for the recognition of isolated words is discussed. Some methods are proposed for generating the deterministic and the statistical types of word lexicon. It is shown that the use of a modified k-means algorithm on the likelihoods derived through the Viterbi algorithm provides the best deterministic-type of word lexicon. However, the ASWU-based speech recognizer leads to better performance with the statistical type of word lexicon than with the deterministic type. Improving the design of the word lexicon makes it possible to narrow the gap in the recognition performances of the whole word unit (WWU)-based and the ASWU-based speech recognizers considerably. Further improvements are expected by designing the word lexicon better. >

Citations

PDF

Open Access

More filters

Voice query-by-example for resource-limited languages using an ergodic hidden Markov model of speech

Asif Ali

Posted Content

Cluster Size Management in Multi-Stage Agglomerative Hierarchical Clustering of Acoustic Speech Segments.

Lerato Lerato, +1 more

- 30 Oct 2018 -

arXiv: Learning

TL;DR: This work proposes the integration of a simple space management strategy into the iterative process, and shows experimentally that this leads to no loss in performance in terms of F-measure while guaranteeing that a threshold space complexity is not breached.

...read moreread less

Proceedings ArticleDOI

A nonparametric Bayesian approach for automatic discovery of a lexicon and acoustic units

Amir Hossein Harati Nejad Torbati, +1 more

TL;DR: It is demonstrated that a speech recognition system using these discovered resources can approach the performance of a speech recognizer trained using resources developed by experts.

...read moreread less

Proceedings Article

Optimization of units for continuous-digit recognition task.

Sachin S. Kajarekar, +1 more

TL;DR: In this work, new constraints on the units were introduced: 1) they should contain suÆcient statistics of the features and 2) they must contain su ÆcientStatistics of the vocabulary.

...read moreread less

Dissertation

Automatic determination of sub-word units for automatic speech recognition

Fiona Couper Kenney

TL;DR: This thesis presents a method for the automatic derivation of a sub-word unit inventory, whose main components are an ergodic hidden Markov model whose complexity is controlled using the Bayesian Information Criterion and an automatic generation of probabilistic dictionaries using joint multigrams.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Journal ArticleDOI

An Algorithm for Vector Quantizer Design

Y. Linde, +2 more

- 01 Jan 1980 -

IEEE Transactions on Communications

TL;DR: An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data.

...read moreread less

Journal ArticleDOI

A Maximum Likelihood Approach to Continuous Speech Recognition

Lalit R. Bahl, +2 more

- 01 Feb 1983 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper describes a number of statistical models for use in speech recognition, with special attention to determining the parameters for such models from sparse data, and describes two decoding methods appropriate for constrained artificial languages and one appropriate for more realistic decoding tasks.

...read moreread less

Proceedings ArticleDOI

Acoustic Markov models used in the Tangora speech recognition system

Lalit R. Bahl, +3 more

TL;DR: An automatic technique for constructing Markov word models is described and results are included of experiments with speaker-dependent and speaker-independent models on several isolated-word recognition tasks.

...read moreread less

Journal ArticleDOI

A modified K-means clustering algorithm for use in isolated work recognition

Jay G. Wilpon, +1 more

- 01 Jun 1985 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A clustering algorithm based on a standard K-means approach which requires no user parameter specification is presented and experimental data show that this new algorithm performs as well or better than the previously used clustering techniques when tested as part of a speaker-independent isolated word recognition system.

...read moreread less

Lexicon-building methods for an acoustic sub-word based speech recognizer

Citations

Voice query-by-example for resource-limited languages using an ergodic hidden Markov model of speech

Cluster Size Management in Multi-Stage Agglomerative Hierarchical Clustering of Acoustic Speech Segments.

A nonparametric Bayesian approach for automatic discovery of a lexicon and acoustic units

Optimization of units for continuous-digit recognition task.

Automatic determination of sub-word units for automatic speech recognition

References

A tutorial on hidden Markov models and selected applications in speech recognition

An Algorithm for Vector Quantizer Design

A Maximum Likelihood Approach to Continuous Speech Recognition

Acoustic Markov models used in the Tangora speech recognition system

A modified K-means clustering algorithm for use in isolated work recognition

Related Papers (5)

Joint lexicon, acoustic unit inventory and model design

An improved sub-word based speech recognizer

Word recognition using whole word and subword models

Automatic generation of subword units for speech recognition systems

On the automatic segmentation of speech signals