scispace - formally typeset
Patent

Knowledge-based strategies applied to n-best lists in automatic speech recognition systems

Reads0
Chats0
TLDR
In this paper, a highly accurate technique for recognizing spoken digit strings is described, in which a spoken digit string is received and analyzed by a speech recognizer, which generates a list of hypothesized digit strings arranged in ranked order based on a likelihood of matching the spoken string.
Abstract
A highly accurate technique for recognizing spoken digit strings is described. A spoken digit string is received (14) and analyzed by a speech recognizer (18), which generates a list of hypothesized digit strings arranged in ranked order (16) based on a likelihood of matching the spoken digit string (20). The individual hypothesized strings are then analyzed in order beginning with the hypothesized string having the greatest likelihood of matching the spoken string to determine whether they satisfy a given constraint. The first hypothesized string in the list satisfying the constraint is selected as the recognized string (22).

read more

Citations
More filters
Patent

Method and system for considering information about an expected response when performing speech recognition

TL;DR: In this paper, a speech recognition system receives and analyzes speech input from a user in order to recognize and accept a response from the user, under certain conditions, information about the response expected from user may be available.
Patent

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

TL;DR: In this paper, a method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed, in response to one or more environmental conditions.
Patent

Methods and systems for identifying errors in a speech recognition system

TL;DR: In this article, a method for identifying possible errors made by a speech recognition system without using a transcript of words input to the system is described. But this method does not consider the use of a word-to-word model.
Patent

Method and system for mitigating delay in receiving audio stream during production of sound from audio stream

TL;DR: In this article, a communication component modifies production of an audio waveform at determined modification segments to mitigate the effects of a delay in processing and/or receiving a subsequent audio wave form.
Patent

Word recognition using choice lists

TL;DR: In this article, a scrollable, visually-displayed word recognition choice list, where the recognition candidates on the choice list are each associated with a choice-selecting symbol the user can use to select a desired recognition candidate by pressing an associated button.
References
More filters
Patent

Automated directory assistance system using word recognition and phoneme processing method

TL;DR: A mechanized directory assistance system for use in a telecommunications network includes multiple speech recognition devices comprising a word recognition device, a phoneme recognition device and an alphabet recognition device as mentioned in this paper.
PatentDOI

Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists

TL;DR: In this article, a method of repairing machine-recognized speech is comprised of the steps of receiving from a recognition engine a first n-best list of hypotheses and scores for each hypothesis generated in response to a primary utterance to be recognized.
Journal ArticleDOI

Discriminative utterance verification for connected digits recognition

TL;DR: In this paper, a hidden Markov model-based (HMM-based) utterance verification system using the framework of statistical hypothesis testing is described. But the proposed verification technique was integrated into a state-of-the-art connected digit recognition system, and the string error rate for valid digit strings was found to decrease by 57% when setting the rejection rate to 5% and was able to correctly reject over 999% of nonvocabulary word strings.
PatentDOI

Speech recognition apparatus which predicts word classes from context and words from word classes

TL;DR: In this paper, a language generator for a speech recognition apparatus scores a word-series hypothesis by combining individual scores for each word in the hypothesis, and the hypothesis score for a single word comprises a combination of the estimated conditional probability of occurrence of a first class of words comprising the word being scored, given the occurrence of the context comprising the words in the word series hypothesis other than the word was being scored.
Related Papers (5)