Retraining and updating speech models for speech recognition

Patent

Retraining and updating speech models for speech recognition

TLDR

In this article, a technique is provided for updating speech models for speech recognition by identifying, from a class of users, speech data for a predetermined set of utterances that differ from a set of stored speech models by at least a predetermined amount.

Abstract:

A technique is provided for updating speech models for speech recognition by identifying, from a class of users, speech data for a predetermined set of utterances that differ from a set of stored speech models by at least a predetermined amount. The identified speech data for similar utterances from the class of users is collected and used to correct the set of stored speech models. As a result, the corrected speech models are a closer match to the utterances than were the set of stored speech models. The set of speech models are subsequently updated with the corrected speech models to provide improved speech recognition of utterances from the class of users. For example, the corrected speech models may be processed and stored at a central database and returned, via a suitable communications channel (e.g. the Internet) to individual user sites to update the speech recognition apparatus at those sites.

Citations

PDF

Open Access

More filters

Patent

Method and system for considering information about an expected response when performing speech recognition

Keith Braho, +2 more

TL;DR: In this paper, a speech recognition system receives and analyzes speech input from a user in order to recognize and accept a response from the user, under certain conditions, information about the response expected from user may be available.

...read moreread less

Patent

System and method for a cooperative conversational voice user interface

Larry Baldwin, +4 more

TL;DR: In this paper, a cooperative conversational voice user interface is presented, which builds upon short-term and long-term shared knowledge to generate one or more explicit and/or implicit hypotheses about an intent of a user utterance.

...read moreread less

Patent

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

James Hendrickson, +4 more

TL;DR: In this paper, a method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed, in response to one or more environmental conditions.

...read moreread less

Patent

System and method for processing multi-modal device interactions in a natural language voice services environment

Larry Baldwin, +1 more

TL;DR: In this article, a system and method for processing multi-modal device interactions in a natural language voice services environment is presented, in which context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multidomal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent.

...read moreread less

Patent

System and method for an integrated, multi-modal, multi-device natural language voice services environment

Robert A. Kennewick, +1 more

TL;DR: In this article, a system and method for an integrated, multi-modal, mult-device natural language voice services environment may be provided, in particular, the environment may include a plurality of voice-enabled devices each having intent determination capabilities for processing multidomain natural language inputs in addition to knowledge of the intent determination capability of other devices in the environment.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Patent

Adaptation of a speech recognition system across multiple remote sessions with a speaker

Hy Murveit, +1 more

TL;DR: In this article, a technique for adaptation of a speech recognizing system across multiple remote communication sessions with a speaker is presented. But, the technique requires the speaker to engage in a training session.

...read moreread less

Patent

Concurrent multi-lingual use in data processing systems

Crabtree Robert Pierre

TL;DR: In this paper, an improvement to a method of providing a distributed, interactive data processing system with concurrent multi-lingual use by a plurality of users is disclosed, which provides message models of informational or error messages generated by application program components, these message models being stored in the message model data collection.

...read moreread less

Patent

Speaker model adaptation via network of similar users

Dimitri Kanevsky, +3 more

TL;DR: In this article, a speech recognition system, method and program product for recognizing speech input from computer users connected together over a network of computers is described, where each computer in the speech recognition network includes at least one user based acoustic model trained for a particular user.

...read moreread less

Patent

System and method for resolving decoding ambiguity via dialog

Dimitri Kanevsky, +3 more

- 28 Oct 1999 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, decoding ambiguities are identified and at least partially resolved intermediate to the language decoding procedures to reduce the subsequent number of final decoding alternatives, where the user is questioned about identified decoding ambiguity as they are being decoded.

...read moreread less

PatentDOI

Speech recognition using thresholded speaker class model selection or model adaptation

Abraham Ittycheriah, +1 more

- 28 Jan 1997 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, a speaker class processing model which is speaker independent within the class may be trained on one or more members of the class and selected for implementation in a speech recognition processor in accordance with the speaker class recognized to further improve speech recognition to level comparable to that of a speaker dependent model.

...read moreread less

Retraining and updating speech models for speech recognition

Citations

Method and system for considering information about an expected response when performing speech recognition

System and method for a cooperative conversational voice user interface

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

System and method for processing multi-modal device interactions in a natural language voice services environment

System and method for an integrated, multi-modal, multi-device natural language voice services environment

References

Adaptation of a speech recognition system across multiple remote sessions with a speaker

Concurrent multi-lingual use in data processing systems

Speaker model adaptation via network of similar users

System and method for resolving decoding ambiguity via dialog

Speech recognition using thresholded speaker class model selection or model adaptation

Related Papers (5)

Distributed client-server speech recognition system

Method and system for speech recognition using grammar weighted based upon location information

Mobile navigation of network-based electronic information using spoken input

Speech user interface for portable personal devices

Error correction in speech recognition