Speech recognition using variable frame rate coding

doi:10.1109/ICASSP.1983.1171933

Proceedings ArticleDOI

Speech recognition using variable frame rate coding

C. Chuang, +1 more

- Vol. 8, pp 1033-1036

Chats0

TLDR

Results found are: (1) limited time sequence compression does not impose any negative effect on DP or its alternatives and (2) variable threshold scheme performs better than the fixed threshold scheme.

Abstract:

This paper investigates the effect of LPC based time compression schemes on dynamic programming (DP) and its alternatives. Two compression schemes, one with fixed threshold and the other with variable threshold both incorporated with two control factors, the rate of frame overlap and the step of interframe interval, are investigated. The test speech is 40-word alpha-digit vocabulary pronounced by 10 males and 10 females. Results found are: (1) limited time sequence compression does not impose any negative effect on DP or its alternatives and (2) variable threshold scheme performs better than the fixed threshold scheme. More detailed discussion on the compression schemes and DP interaction are included.

Citations

PDF

Open Access

More filters

Patent

Speech recognition using preclassification and spectral normalization

Chiu-Kuang Chuang

TL;DR: In this article, a two-stage classification process is used in a speech recognition system, in which a slope vector template is generated from an extended LPC analysis using a universal bandwidth expansion technique.

...read moreread less

Journal ArticleDOI

An integrated-circuit-based speech recognition system

H. Murveit, +1 more

- 01 Dec 1986 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A high-performance, flexible, and potentially inexpensive speech recognition system that can compare an input word with 1000-word templates and respond to a user within 1-4 s demonstrates that computational complexity need not be a major limiting factor in the design of speech recognition systems.

...read moreread less

Proceedings ArticleDOI

Toward a massively parallel system for word recognition

Maurice K. Wong, +1 more

TL;DR: A linguistic knowledge base is built into the network, allowing both data-driven processing and top-down prediction to cooperate or compete in working toward the correct lexical hypothesis.

...read moreread less

Journal ArticleDOI

Pattern compression in isolated word recognition

R. Pieraccini

- 01 Sep 1984 -

Signal Processing

TL;DR: In this work three different pattern compression techniques are compared on the basis of efficiency as well as recognition performance when applied to pattern matching by means of dynamic programming in a speaker dependent context.

...read moreread less

Proceedings ArticleDOI

Evaluation of time compression for connected word recognition

Jean-Luc Gauvain, +1 more

TL;DR: The results show that the variable length trace segmentation technique gives the best scores under all conditions, and that the uniform sampling approach can therefore be advantageously used in connected word recognition processes.

Michael Kuhn, +2 more

TL;DR: A fast nonlinear time alignment method is presented, which is based on a preprocessing of the normalized speech spectrogram by means of a segmentation of the trace in the spectral feature space, which offers savings in computing time by a factor of 10 or more as compared to conventional dynamic programming.

...read moreread less

Speech recognition using variable frame rate coding

Citations

Speech recognition using preclassification and spectral normalization

An integrated-circuit-based speech recognition system

Toward a massively parallel system for word recognition

Pattern compression in isolated word recognition

Evaluation of time compression for connected word recognition

References

Dynamic programming algorithm optimization for spoken word recognition

Minimum prediction residual principle applied to speech recognition

Fast sequential decoding algorithm using a stack

Maximum likelihood estimation for multivariate observations of Markov sources

Fast nonlinear time alignment for isolated word recognition

Related Papers (5)

New Technique to Reduce Bit Rate of LPC-10 Speech Coder

New speech enhancement techniques for low bit rate speech coding

Efficient algorithm for multi-pulse LPC analysis of speech

Regular excitation reduction for effective and efficient LP-coding of speech

Efficient coding of LPC parameters by temporal decomposition