Lexicon-building methods for an acoustic sub-word based speech recognizer

doi:10.1109/ICASSP.1990.115888

Proceedings ArticleDOI

Lexicon-building methods for an acoustic sub-word based speech recognizer

Kuldip K. Paliwal

- Vol. 1990, pp 729-732

Chats0

TLDR

The use of an acoustic subword unit (ASWU)-based speech recognition system for the recognition of isolated words is discussed and it is shown that the use of a modified k-means algorithm on the likelihoods derived through the Viterbi algorithm provides the best deterministic-type of word lexicon.

Abstract:

The use of an acoustic subword unit (ASWU)-based speech recognition system for the recognition of isolated words is discussed. Some methods are proposed for generating the deterministic and the statistical types of word lexicon. It is shown that the use of a modified k-means algorithm on the likelihoods derived through the Viterbi algorithm provides the best deterministic-type of word lexicon. However, the ASWU-based speech recognizer leads to better performance with the statistical type of word lexicon than with the deterministic type. Improving the design of the word lexicon makes it possible to narrow the gap in the recognition performances of the whole word unit (WWU)-based and the ASWU-based speech recognizers considerably. Further improvements are expected by designing the word lexicon better. >

Citations

PDF

Open Access

More filters

Book ChapterDOI

Toward ALISP: A proposal for Automatic Language Independent Speech Processing

Gérard Chollet, +5 more

TL;DR: This article exposes and develops the concept of ALISP (Automatic Language Independent Speech Processing), namely a general methodology which consists in inferring the intermediate representation between the acoustic and the linguistic levels, from speech and linguistic data rather than from a priori knowledge, with as little supervision as possible.

...read moreread less

Book

Statistical Pronunciation Modeling for Non-Native Speech Processing

Rainer Gruhn, +2 more

TL;DR: This work presents a fully statistical approach to model non--native speakers' pronunciation, based on a discrete hidden Markov model as a word pronunciation model, initialized on a standard pronunciation dictionary.

...read moreread less

Learning features and segments from waveforms : a statistical model of early phonological acquisition

Ying Lin

TL;DR: Of the Dissertation Learning Features and Segments from Waveforms: A Statistical Model of Early Phonological Acquisition and its Applications.

...read moreread less

Dissertation

Unsupervised pattern discovery in speech: applications to word acquisition and speaker segmentation

James Glass, +1 more

TL;DR: It is shown how pattern discovery can be used to automatically acquire lexical entities directly from an untranscribed audio stream, and two methods for automatically identifying sound clusters generated through pattern discovery are proposed and evaluated.

...read moreread less

Proceedings ArticleDOI

Design of a speech recognition system based on acoustically derived segmental units

Michiel Bacchiani, +3 more

TL;DR: An iterative unit design procedure is formulated which consistently uses a maximum likelihood (ML) objective in successive application of resegmentation and model re-estimation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Journal ArticleDOI

An Algorithm for Vector Quantizer Design

Y. Linde, +2 more

- 01 Jan 1980 -

IEEE Transactions on Communications

TL;DR: An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data.

...read moreread less

Journal ArticleDOI

A Maximum Likelihood Approach to Continuous Speech Recognition

Lalit R. Bahl, +2 more

- 01 Feb 1983 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper describes a number of statistical models for use in speech recognition, with special attention to determining the parameters for such models from sparse data, and describes two decoding methods appropriate for constrained artificial languages and one appropriate for more realistic decoding tasks.

...read moreread less

Proceedings ArticleDOI

Acoustic Markov models used in the Tangora speech recognition system

Lalit R. Bahl, +3 more

TL;DR: An automatic technique for constructing Markov word models is described and results are included of experiments with speaker-dependent and speaker-independent models on several isolated-word recognition tasks.

...read moreread less

Journal ArticleDOI

A modified K-means clustering algorithm for use in isolated work recognition

Jay G. Wilpon, +1 more

- 01 Jun 1985 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A clustering algorithm based on a standard K-means approach which requires no user parameter specification is presented and experimental data show that this new algorithm performs as well or better than the previously used clustering techniques when tested as part of a speaker-independent isolated word recognition system.

...read moreread less

Lexicon-building methods for an acoustic sub-word based speech recognizer

Citations

Toward ALISP: A proposal for Automatic Language Independent Speech Processing

Statistical Pronunciation Modeling for Non-Native Speech Processing

Learning features and segments from waveforms : a statistical model of early phonological acquisition

Unsupervised pattern discovery in speech: applications to word acquisition and speaker segmentation

Design of a speech recognition system based on acoustically derived segmental units

References

A tutorial on hidden Markov models and selected applications in speech recognition

An Algorithm for Vector Quantizer Design

A Maximum Likelihood Approach to Continuous Speech Recognition

Acoustic Markov models used in the Tangora speech recognition system

A modified K-means clustering algorithm for use in isolated work recognition

Related Papers (5)

Joint lexicon, acoustic unit inventory and model design

An improved sub-word based speech recognizer

Word recognition using whole word and subword models

Automatic generation of subword units for speech recognition systems

On the automatic segmentation of speech signals