A modified K-means clustering algorithm for use in isolated work recognition

doi:10.1109/TASSP.1985.1164581

Open AccessJournal ArticleDOI

A modified K-means clustering algorithm for use in isolated work recognition

Jay G. Wilpon, +1 more

- 01 Jun 1985 -

IEEE Transactions on Acoustics, Speech, ...

- Vol. 33, Iss: 3, pp 587-594

Chats0

TLDR

A clustering algorithm based on a standard K-means approach which requires no user parameter specification is presented and experimental data show that this new algorithm performs as well or better than the previously used clustering techniques when tested as part of a speaker-independent isolated word recognition system.

Abstract:

Studies of isolated word recognition systems have shown that a set of carefully chosen templates can be used to bring the performance of speaker-independent systems up to that of systems trained to the individual speaker. The earliest work in this area used a sophisticated set of pattern recognition algorithms in a human-interactive mode to create the set of templates (multiple patterns) for each word in the vocabulary. Not only was this procedure time consuming but it was impossible to reproduce exactly because it was highly dependent on decisions made by the experimenter. Subsequent work led to an automatic clustering procedure which, given only a set of clustering parameters, clustered patterns with the same performance as the previously developed supervised algorithms. The one drawback of the automatic procedure was that the specification of the input parameter set was found to be somewhat dependent on the vocabulary type and size of population to be clustered. Since a naive user of such a statistical clustering algorithm could not be expected, in general, to know how to choose the word clustering parameters, even this automatic clustering algorithm was not appropriate for a completely general word recognition system. It is the purpose of this paper to present a clustering algorithm based on a standard K-means approach which requires no user parameter specification. Experimental data show that this new algorithm performs as well or better than the previously used clustering techniques when tested as part of a speaker-independent isolated word recognition system.

A modified K-means clustering algorithm for use in isolated work recognition

Citations

Clustering of time series data-a survey

Clustering: A neural network approach

Method for representing word models for use in speech recognition

A segmental k-means training procedure for connected word recognition

Design and construction of a binary-tree system for language modelling

References

Some methods for classification and analysis of multivariate observations

Least squares quantization in PCM

Least Squares Quantization in PCM

An Algorithm for Vector Quantizer Design

Minimum prediction residual principle applied to speech recognition

Related Papers (5)

Dynamic programming algorithm optimization for spoken word recognition

A tutorial on hidden Markov models and selected applications in speech recognition

An Algorithm for Vector Quantizer Design

Some methods for classification and analysis of multivariate observations

Minimum prediction residual principle applied to speech recognition