scispace - formally typeset
Proceedings ArticleDOI

Application of isolated word recognition to a voice controlled repertory dialer system

TLDR
The speaker trained, voice controlled, repertory dialer system was tested extensively by 6 talkers and there were no recognition errors and a request for a repeat of a spoken word occurred about 2% of the time.
Abstract
In this paper we describe a speaker trained, voice controlled, repertory dialer system. The main elements of the system include: 1. A real-time speech analyzer that detects the presence of speech on the input line, and analyzes the speech to give features appropriate for a word recognizer. 2. An isolated word recognizer that decides which of a set of words was spoken. 3. A voice response system to provide spoken commands to the user to guide the use of the repertory dialer system. 4. A dialer (simulated) to outpulse the desired telephone number. The repertory dialer system is implemented on a minicomputer with a high speed array processor performing the real-time operations. The vocabulary for the system consists of 7 command words, 10 digits, and any number of names up to some specified maximum Recognition is performed on one or more subsets of the vocabulary, depending on fine state of the system. To train the system the user is requested to speak each of the vocabulary words twice to provide reference templates for the system. Following training, the system can dial the telephone number corresponding to any name in the repertory, or it can dial a 4 digit telephone extension spoken as an isolated string of digits. The system was tested extensively by 6 talkers (3 male, 3 female - 3 of whom were naive and 3 experienced users) over a three week period. A total of 4620 words were spoken and during the course of the test there were no recognition errors. A request for a repeat of a spoken word occurred about 2% of the time. These tests demonstrate the reliability and robustness of this voice repertory dialer system.

read more

Citations
More filters
Patent

Hands-free control system for a radiotelephone

TL;DR: In this paper, an improved hands-free user-interactive control and dialing system is disclosed for use with a speech communications device, which includes a dynamic noise suppressor (410), a speech recognizer (420) for implementing voice-control, a device controller (430) responsive to the speech recognition, and a speech synthesizer (440) for providing reply information to the user as to the communication device operating status.
Patent

Voice-controlled telephone using visual display

TL;DR: A "dialess" telephone communicates with a user via a visual display to provide readily-understandable cues which permit voice-controlled dialing as mentioned in this paper, where a number to be dialed may be spoken digit-by-digit and dialed automatically or a name may also be spoken and the telephone will automatically dial the number stored in a user's repertory corresponding to the spoken name.
Patent

Method and device for generating user defined spoken speed dial directories

TL;DR: In this article, a voice recognition telephone system (10) allows a user to generate a plurality of directories (76) and each directory has a corresponding entry list containing the entry names and corresponding phone numbers.
Patent

Method and system for providing adaptive interactive command response

TL;DR: In this paper, a method and system for providing adaptive interactive command response to a user, in which the user may protest upon incorrect recognition by the system of a command given by the user in response to question from the system.
Patent

Communications system with direct access mailbox

TL;DR: In this paper, a communication system in which a user can elect to leave a message directly in the mailbox (11a) of another user (B) without calling or disturbing other users (B).
References
More filters
Journal ArticleDOI

Minimum prediction residual principle applied to speech recognition

TL;DR: A computer system is described in which isolated words, spoken by a designated talker, are recognized through calculation of a minimum prediction residual through optimally registering the reference LPC onto the input autocorrelation coefficients using the dynamic programming algorithm.
Journal ArticleDOI

Speech analysis and synthesis by linear prediction of the speech wave.

TL;DR: Application of this method for efficient transmission and storage of speech signals as well as procedures for determining other speechcharacteristics, such as formant frequencies and bandwidths, the spectral envelope, and the autocorrelation function, are discussed.
Journal ArticleDOI

Adaptive quantization in differential PCM coding of speech

TL;DR: It is believed that at bit rates of 24 to 32 kb/s, ADPCM provides a robust and efficient technique for speech communication and for digital storage of speech.
Journal ArticleDOI

Practical applications of voice input to machines

T.B. Martin
TL;DR: Future developments in both new applications and increased capability voice input systems can be expected to considerably expand the usage of this form of man-machine communications.
Journal ArticleDOI

Digital techniques for computer voice response: Implementations and applications

TL;DR: The general method of concatenating isolated words and phrases to form a message is discussed, along with the various possibilities for digital representations of speech signals.