scispace - formally typeset
Patent

Retraining and updating speech models for speech recognition

TLDR
In this article, a technique is provided for updating speech models for speech recognition by identifying, from a class of users, speech data for a predetermined set of utterances that differ from a set of stored speech models by at least a predetermined amount.
Abstract
A technique is provided for updating speech models for speech recognition by identifying, from a class of users, speech data for a predetermined set of utterances that differ from a set of stored speech models by at least a predetermined amount. The identified speech data for similar utterances from the class of users is collected and used to correct the set of stored speech models. As a result, the corrected speech models are a closer match to the utterances than were the set of stored speech models. The set of speech models are subsequently updated with the corrected speech models to provide improved speech recognition of utterances from the class of users. For example, the corrected speech models may be processed and stored at a central database and returned, via a suitable communications channel (e.g. the Internet) to individual user sites to update the speech recognition apparatus at those sites.

read more

Citations
More filters
Patent

Method and system for considering information about an expected response when performing speech recognition

TL;DR: In this paper, a speech recognition system receives and analyzes speech input from a user in order to recognize and accept a response from the user, under certain conditions, information about the response expected from user may be available.
Patent

System and method for a cooperative conversational voice user interface

TL;DR: In this paper, a cooperative conversational voice user interface is presented, which builds upon short-term and long-term shared knowledge to generate one or more explicit and/or implicit hypotheses about an intent of a user utterance.
Patent

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

TL;DR: In this paper, a method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed, in response to one or more environmental conditions.
Patent

System and method for processing multi-modal device interactions in a natural language voice services environment

TL;DR: In this article, a system and method for processing multi-modal device interactions in a natural language voice services environment is presented, in which context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multidomal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent.
Patent

System and method for an integrated, multi-modal, multi-device natural language voice services environment

TL;DR: In this article, a system and method for an integrated, multi-modal, mult-device natural language voice services environment may be provided, in particular, the environment may include a plurality of voice-enabled devices each having intent determination capabilities for processing multidomain natural language inputs in addition to knowledge of the intent determination capability of other devices in the environment.
References
More filters
Patent

Adaptation of a speech recognition system across multiple remote sessions with a speaker

TL;DR: In this article, a technique for adaptation of a speech recognizing system across multiple remote communication sessions with a speaker is presented. But, the technique requires the speaker to engage in a training session.
Patent

Concurrent multi-lingual use in data processing systems

TL;DR: In this paper, an improvement to a method of providing a distributed, interactive data processing system with concurrent multi-lingual use by a plurality of users is disclosed, which provides message models of informational or error messages generated by application program components, these message models being stored in the message model data collection.
Patent

Speaker model adaptation via network of similar users

TL;DR: In this article, a speech recognition system, method and program product for recognizing speech input from computer users connected together over a network of computers is described, where each computer in the speech recognition network includes at least one user based acoustic model trained for a particular user.
Patent

System and method for resolving decoding ambiguity via dialog

TL;DR: In this article, decoding ambiguities are identified and at least partially resolved intermediate to the language decoding procedures to reduce the subsequent number of final decoding alternatives, where the user is questioned about identified decoding ambiguity as they are being decoded.
PatentDOI

Speech recognition using thresholded speaker class model selection or model adaptation

TL;DR: In this paper, a speaker class processing model which is speaker independent within the class may be trained on one or more members of the class and selected for implementation in a speech recognition processor in accordance with the speaker class recognized to further improve speech recognition to level comparable to that of a speaker dependent model.
Related Papers (5)