scispace - formally typeset
Journal ArticleDOI

Speaker‐independent speech recognition using a neural prediction model

Ken-ichi Iso, +1 more
- 01 Jan 1991 - 
- Vol. 74, Iss: 8, pp 22-30
Reads0
Chats0
TLDR
This paper proposes a speech recognition system based on the pattern prediction using neural network and an iterative algorithm combining the dynamic programming and the error backpropagation is proposed, together with the proof for the convergence.
Abstract
This paper proposes a speech recognition system based on the pattern prediction using neural network. In the proposed system, an independent nonlinear predictor composed of a series of multilayer perceptrons (MLP) is prepared for each class which is the object of recognition. The temporal structure of the speech pattern, especially the temporal correlation structure between feature vector sequence, is represented by the nonlinear mapping between the input and the output, and is utilized as the important feature in the recognition. On the other hand, the variation of the temporal structure of the speech pattern, due to the difference of speakers and the fluctuation of the utterance, is normalized by the dynamic programming. As the training algorithm to determine the MLP parameters composing each predictor, an iterative algorithm combining the dynamic programming and the error backpropagation is proposed, together with the proof for the convergence. A speaker independent isolated digit recognition experiment is executed to examine the basic operation of the proposed system. The parameters are estimated in a satisfactory way even from a small number of training data, and it is indicated that a high recognition performance is realized.

read more

Citations
More filters
Journal ArticleDOI

Time series classification for the prediction of dialysis in critically ill patients using echo statenetworks

TL;DR: ESN has an added value in predicting the need for dialysis through the analysis of time series data and requires significantly less processing time, needs no domain knowledge, is easy to implement, and can be configured using rules of thumb.
Journal Article

Speech Recognition Using Recurrent Neural Prediction Model

TL;DR: In this paper, a recurrent neural prediction model (RNPM) was proposed to reduce the size of the network, with as high a recognition rate as the original model, and with a high efficiency of learning, for speaker-independent isolated words.
Journal ArticleDOI

Phoneme recognition using time-warping neural networks

TL;DR: The proposed Time-Warping Neural Network (TWNN) demonstrates a higher phoneme recognition accuracy than a baseline recognizer composed of time-delay neural networks with a linear time alignment mechanism.
References
More filters
Journal ArticleDOI

An introduction to computing with neural nets

TL;DR: This paper provides an introduction to the field of artificial neural nets by reviewing six important neural net models that can be used for pattern classification and exploring how some existing classification and clustering algorithms can be performed using simple neuron-like components.
Journal ArticleDOI

An introduction to hidden Markov models

TL;DR: The purpose of this tutorial paper is to give an introduction to the theory of Markov models, and to illustrate how they have been applied to problems in speech recognition.
Journal ArticleDOI

On the approximate realization of continuous mappings by neural networks

K. Funahashi
- 01 May 1989 - 
TL;DR: It is proved that any continuous mapping can be approximately realized by Rumelhart-Hinton-Williams' multilayer neural networks with at least one hidden layer whose output functions are sigmoid functions.
Book

Phoneme recognition using time-delay neural networks

TL;DR: The authors present a time-delay neural network (TDNN) approach to phoneme recognition which is characterized by two important properties: using a three-layer arrangement of simple computing units, a hierarchy can be constructed that allows for the formation of arbitrary nonlinear decision surfaces, which the TDNN learns automatically using error backpropagation.
Journal ArticleDOI

Phoneme recognition using time-delay neural networks

TL;DR: In this article, the authors presented a time-delay neural network (TDNN) approach to phoneme recognition, which is characterized by two important properties: (1) using a three-layer arrangement of simple computing units, a hierarchy can be constructed that allows for the formation of arbitrary nonlinear decision surfaces, which the TDNN learns automatically using error backpropagation; and (2) the time delay arrangement enables the network to discover acoustic-phonetic features and the temporal relationships between them independently of position in time and therefore not blurred by temporal shifts in the input