scispace - formally typeset
Patent

Selective speaker adaptation for an in-vehicle speech recognition system

TLDR
In this paper, a method of improving the recognition accuracy of an in-vehicle speech recognition system is presented. But, the method of the present invention selectively adapts the system's speech engine to a speaker's voice characteristics using an N-best matching technique.
Abstract
Disclosed herein is a method of improving the recognition accuracy of an in-vehicle speech recognition system. The method of the present invention selectively adapts the system's speech engine to a speaker's voice characteristics using an N-best matching technique. In this method, the speech recognition system receives and processes a spoken utterance relating to a car command and having particular speaker-dependent speech characteristics so as to select a set of N-best voice commands matching the spoken utterance. Upon receiving a training mode input from the speaker, the system outputs the N-best command set to the speaker who selects the correct car command. The system then adapts the speech engine to recognize a spoken utterance having the received speech characteristics as the user-selected car command.

read more

Citations
More filters
Patent

Method and system for considering information about an expected response when performing speech recognition

TL;DR: In this paper, a speech recognition system receives and analyzes speech input from a user in order to recognize and accept a response from the user, under certain conditions, information about the response expected from user may be available.
Patent

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

TL;DR: In this paper, a method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed, in response to one or more environmental conditions.
Patent

Mobile terminal and menu control method thereof

TL;DR: In this paper, a mobile terminal including an input unit configured to receive an input to activate a voice recognition function on the mobile terminal, a memory configured to store information related to operations performed on mobile terminals, and a controller configured to activate the voice recognition functions upon receiving the input, is used to determine a meaning of an input voice instruction based on at least one prior operation performed on a mobile device and a language included in the instruction.
Patent

Methods and systems for identifying errors in a speech recognition system

TL;DR: In this article, a method for identifying possible errors made by a speech recognition system without using a transcript of words input to the system is described. But this method does not consider the use of a word-to-word model.
Patent

Method and system for mitigating delay in receiving audio stream during production of sound from audio stream

TL;DR: In this article, a communication component modifies production of an audio waveform at determined modification segments to mitigate the effects of a delay in processing and/or receiving a subsequent audio wave form.
References
More filters
PatentDOI

Method for interactive speech recognition and training

TL;DR: A method for creating word models for a large vocabulary, natural language dictation system that may be used for connected speech as well as for discrete utterances.
PatentDOI

Recognition unit model training based on competing word and word string models

TL;DR: The principle of minimum recognition error rate is applied by the present invention using discriminative training and various issues related to the special structure of HMMs are presented.
Patent

Method and apparatus for speech recognition adapted to an individual speaker

TL;DR: In this paper, a method and apparatus for automatic recognition of speech adapts to a particular speaker by using adaptation data to develop a transformation through which speaker independent models are transformed into speaker adapted models.
Patent

Unsupervised speech model adaptation using reliable information among N-best strings

TL;DR: In this paper, a portable input apparatus for an elevator destination call to be generated remotely from an associated elevator installation with respect to time and location is presented. But it is not shown to the user.
Related Papers (5)