Speech recognition using dynamic features of acoustic subword spectra

doi:10.1109/ICASSP.1991.150833

Proceedings ArticleDOI

Speech recognition using dynamic features of acoustic subword spectra

K.L. Brown, +1 more

- pp 293-296

Chats0

TLDR

A novel approach for speech signal analysis has been developed that incorporates both steady-state and dynamic spectral features into a unified model that has been successfully applied in automatic speech recognition contexts and does not require frame-based optimal search algorithms.

Abstract:

A novel approach for speech signal analysis has been developed that incorporates both steady-state and dynamic spectral features into a unified model. This model has been successfully applied in automatic speech recognition contexts and does not require frame-based optimal search algorithms. The model decomposes an utterance into a chain of acoustic subwords and simultaneously generates a mathematical description of instantaneous acoustic-phonetic features and dynamic transitions. The algorithm was tested using a speaker-dependent limited vocabulary recognition task and achieved higher recognition rates than both vector quantization and hidden Markov models. >

Citations

PDF

Open Access

More filters

Real-time recognition of spoken words

Louis C. W. Pols, +2 more

TL;DR: In this article, a real-time word recognition system using only a small computer (8K memory) and a few analog peripherals is described, where a spectral analysis is carried out by a bank of 17 1/3-octave bandpass filters.

...read moreread less

Proceedings ArticleDOI

Speech coding by the efficient transformation of the spectral envelope of subwords

V.R. Algazi, +5 more

TL;DR: A signal-dependent representation which captures, with a few KL vectors and transform coefficients, the perceptually and phonetically important structure of the spectral envelope has been applied to the analysis, synthesis and coding of speech with promising results in the 5-kb/s range.

...read moreread less

A Survey of Temporal Techniques Applied Toward Neural Network Based Continuous Speech Recognition

Chris D. Love

TL;DR: Neural network architectures for the recognition of continuous speech are reviewed and Hierarchic structures that recognize events of increasing temporal scale seem to provide the most promising path toward effective recognition ofContinuous speech.

...read moreread less

Proceedings ArticleDOI

Dynamic recognition of vowels by machine using trajectories in a two dimensional feature space

H.F.V. Boshoff

TL;DR: This article used a k-nearest neighbor rule with 2300 training vowels and as many test vowels, taken from continuous speech samples of the same group of 33 male speakers, achieved an average success rate of 72% in six way classification.

...read moreread less

References

PDF

Open Access

More filters

Book

Phoneme recognition using time-delay neural networks

Alex Waibel, +4 more

TL;DR: The authors present a time-delay neural network (TDNN) approach to phoneme recognition which is characterized by two important properties: using a three-layer arrangement of simple computing units, a hierarchy can be constructed that allows for the formation of arbitrary nonlinear decision surfaces, which the TDNN learns automatically using error backpropagation.

...read moreread less

Journal ArticleDOI

Phoneme recognition using time-delay neural networks

Alex Waibel, +4 more

- 01 Mar 1989 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: In this article, the authors presented a time-delay neural network (TDNN) approach to phoneme recognition, which is characterized by two important properties: (1) using a three-layer arrangement of simple computing units, a hierarchy can be constructed that allows for the formation of arbitrary nonlinear decision surfaces, which the TDNN learns automatically using error backpropagation; and (2) the time delay arrangement enables the network to discover acoustic-phonetic features and the temporal relationships between them independently of position in time and therefore not blurred by temporal shifts in the input

...read moreread less

Proceedings ArticleDOI

A database for speaker-independent digit recognition

R. Leonard

TL;DR: A large speech database has been collected for use in designing and evaluating algorithms for speaker independent recognition of connected digit sequences and formal human listening tests on this database provided certification of the labelling of the digit sequences.

...read moreread less

Journal ArticleDOI

Dynamic specification of coarticulated vowels

Winifred Strange, +2 more

- 01 Sep 1983 -

Journal of the Acoustical Society of Ame...

TL;DR: Experiments summarized herein support the view that the most important source of information for speaker-invariant vowel identity is carried in dynamic specification of vowel onset and offset spectral patterns, with vowel duration also playing a role.

...read moreread less

Journal ArticleDOI

Computers: Speech recognition: Turning theory to practice: New ICs have brought the requisite computer power to speech technology; an evaluation of equipment shows where it stands today

G. R. Doddington, +1 more

- 01 Sep 1981 -

IEEE Spectrum

TL;DR: An evaluation of the equipment now available for turning the theory of electronic speech recognition into practice and the fulfilment of this goal seems much closer than it did because of the pace of advance in IC technology.

...read moreread less

Related Papers (5)

Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition

V.R. Algazi, +5 more

- 01 Apr 1993 -

IEEE Transactions on Speech and Audio Pr...

Etri Journal

Conversational speech recognition using acoustic and articulatory input

Katrin Kirchhoff, +2 more

Speech recognition using dynamic features of acoustic subword spectra

Citations

Real-time recognition of spoken words

Speech coding by the efficient transformation of the spectral envelope of subwords

A Survey of Temporal Techniques Applied Toward Neural Network Based Continuous Speech Recognition

Dynamic recognition of vowels by machine using trajectories in a two dimensional feature space

References

Phoneme recognition using time-delay neural networks

Phoneme recognition using time-delay neural networks

A database for speaker-independent digit recognition

Dynamic specification of coarticulated vowels

Computers: Speech recognition: Turning theory to practice: New ICs have brought the requisite computer power to speech technology; an evaluation of equipment shows where it stands today

Related Papers (5)

Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition

Improving speech recognition performance by using multi-model approaches

Automatic speech recognition using acoustic sub-words and no time alignment

Intra‐ and Inter‐frame Features for Automatic Speech Recognition

Conversational speech recognition using acoustic and articulatory input