scispace - formally typeset
Open Access

On-Line, Real-Time Spoken Words Recognition System with Learning Capability of Speaker Differences.

Toshiyuki Sakai, +1 more
- Vol. 10, Iss: 10, pp 41-59
Reads0
Chats0
TLDR
The LISTEN (LIMITED SPOKEN TEXT ENcODER) system which automatically recognizes spoken words in isolation for a limited vocabulary is developed, a subpart of the LITHAN (LlsTEN-THINK-ANsWER) speech understanding system.
Abstract
SUMMARY We have developed the LISTEN (LIMITED SPOKEN TEXT ENcODER) system which automatically recognizes spoken words in isolation for a limited vocabulary. This system is a subpart of the LITHAN (LlsTEN-THINK-ANsWER) speech understanding system 1 ,2). This makes a great feature of the recognition in real time on a mini-computer. Owing to this development, it became capable of trying the various experiments on many speech data. There are other two features in this system: One is to learn the speaker differences by preliminary uttered vowels. The other is that the system is composed of two stages, i.e., phoneme recognition and word recognition. In the latter stage, the effect of coarticulation is taken into account. The system performance obtained the recognition rate of 98.0% on experiments of spoken digits that were uttered by 40 male adults. And also the system obtained the rate of 98.4% on preliminary learning by some spoken digits. When no learning procedure, however, the rate decreased to 95.8%.

read more

Content maybe subject to copyright    Report

Citations
More filters

Continuous Speech Understanding System LITHAN.

TL;DR: LITHAN (LIsten-THink-ANswer) speech understanding system which automatically recognizes continuously uttered speech utilizing higher linguistic information such as syntactic, semantic, pragmatic information and applies this efficient, flexible system to restricted utterances.

A Pre-Matching Method for a Real Time Spoken Word Recognition System and a Learning Procedure of Speaker Differences.

TL;DR: The method which reduced candidate words in the vocabulary by means of pre-matching using both local and global features of a spoken word was adopted, to eliminate the most unlike group of candidates using the measurements of both features from the vocabulary list to reduce the recognition time.
References
More filters
Journal ArticleDOI

Evaluation of various parameter sets in spoken digits recognition

TL;DR: Two effective means to improve the errors of PAC's are found; one is variable use of the PAC dimensions controlled by computation accuracy, and the other is smoothing along the time axis.

Continuous Speech Understanding System LITHAN.

TL;DR: LITHAN (LIsten-THink-ANswer) speech understanding system which automatically recognizes continuously uttered speech utilizing higher linguistic information such as syntactic, semantic, pragmatic information and applies this efficient, flexible system to restricted utterances.