Speaker‐independent speech recognition using a neural prediction model

doi:10.1002/ECJC.4430740803

Journal ArticleDOI

Speaker‐independent speech recognition using a neural prediction model

Ken-ichi Iso, +1 more

- 01 Jan 1991 -

Electronics and Communications in Japan ...

- Vol. 74, Iss: 8, pp 22-30

Chats0

TLDR

This paper proposes a speech recognition system based on the pattern prediction using neural network and an iterative algorithm combining the dynamic programming and the error backpropagation is proposed, together with the proof for the convergence.

Abstract:

This paper proposes a speech recognition system based on the pattern prediction using neural network. In the proposed system, an independent nonlinear predictor composed of a series of multilayer perceptrons (MLP) is prepared for each class which is the object of recognition. The temporal structure of the speech pattern, especially the temporal correlation structure between feature vector sequence, is represented by the nonlinear mapping between the input and the output, and is utilized as the important feature in the recognition. On the other hand, the variation of the temporal structure of the speech pattern, due to the difference of speakers and the fluctuation of the utterance, is normalized by the dynamic programming. As the training algorithm to determine the MLP parameters composing each predictor, an iterative algorithm combining the dynamic programming and the error backpropagation is proposed, together with the proof for the convergence. A speaker independent isolated digit recognition experiment is executed to examine the basic operation of the proposed system. The parameters are estimated in a satisfactory way even from a small number of training data, and it is indicated that a high recognition performance is realized.

Speaker‐independent speech recognition using a neural prediction model

Citations

Time series classification for the prediction of dialysis in critically ill patients using echo statenetworks

Speech Recognition Using Recurrent Neural Prediction Model

Phoneme recognition using time-warping neural networks

References

An introduction to computing with neural nets

An introduction to hidden Markov models

On the approximate realization of continuous mappings by neural networks

Phoneme recognition using time-delay neural networks

Phoneme recognition using time-delay neural networks

Related Papers (5)

Velocity and acceleration features in speaker recognition

Learning statistically efficient features for speaker recognition

Speaker adaptation by a linear transformation with optimised parameters

PC-Based System for Robust Speaker Recognition

Speaker adaptation for recognition systems with a large vocabulary