Dynamic switching between local and remote speech rendering

Patent

Dynamic switching between local and remote speech rendering

Chats0

TLDR

In this paper, a multimodal browser for rendering a multi-modal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodi-al document, and a voice browser component, which can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content.

Abstract:

A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

Citations

PDF

Open Access

More filters

Patent

Intelligent Automated Assistant

Thomas R. Gruber, +7 more

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Patent

Using context information to facilitate processing of commands in a virtual assistant

Thomas R. Gruber, +4 more

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.

...read moreread less

Patent

Method and apparatus for building an intelligent automated assistant

Adam Cheyer, +1 more

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.

...read moreread less

Patent

Automatically adapting user interfaces for hands-free interaction

Thomas R. Gruber, +1 more

TL;DR: In this article, the authors present a method for automatically determining whether a digital assistant application has been separately invoked by a user without regard to whether a user has separately invoked the application.

...read moreread less

Patent

Voice trigger for a digital assistant

Justin G. Binder, +3 more

TL;DR: In this paper, a method for operating a voice trigger is presented, which includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Patent

Unified client-server distributed architectures for spoken dialogue systems

Sangita Sharma, +1 more

TL;DR: In this paper, a configuration selection switch located within the client device selects a configuration based upon user functionality and network conditions to implement speech recognition functions for the spoken dialogue system, where each of these configurations is selected to provide the user with the most efficient speech recognition for the function being utilized by the user based upon network conditions.

...read moreread less

Patent

System and method for providing remote automatic speech recognition and text to speech services via a packet network

Pamela Leigh Dragosh, +2 more

TL;DR: In this article, a system and method of operating an automatic speech recognition application over an Internet Protocol network is described, where a grammar for recognizing received speech from a user over the IP network is selected from a plurality of grammars according to a user-selected application.

...read moreread less

PatentDOI

User model-improvement-data-driven selection and update of user-oriented recognition model of a given type for word recognition at network server

Stefan Besling, +1 more

- 16 Oct 1998 -

Journal of the Acoustical Society of Ame...

TL;DR: A distributed pattern recognition system includes at least one user station and a server station that retrieves a recognition model selected for the user and provides the retrieved recognition model to a recognition unit for recognising the input pattern using the recognition models.

...read moreread less

Patent

Multi-modal messaging

Jan Kleindienst, +3 more

TL;DR: In this article, a method for composing messages combines the advantages of a multi-modal interface (e.g., grammar-based speech and touchscreen or similar input devices) and message templates, which allows a user to construct a message with significantly less effort in a fraction of the time required by conventional methods.

...read moreread less

PatentDOI

Client-server speech processing system, apparatus, method, and storage medium

Teruhiko Ueyama, +4 more

- 04 Oct 2004 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, the system implements high-accuracy speech recognition while suppressing the amount of data transfer between the client and the server, where the client receives the compression-encoded speech parameters, a speech processing unit makes speech recognition of the compressed speech parameters and sends information corresponding to the speech recognition result to the client.

...read moreread less

Collapse

Dynamic switching between local and remote speech rendering

Citations

Intelligent Automated Assistant

Using context information to facilitate processing of commands in a virtual assistant

Method and apparatus for building an intelligent automated assistant

Automatically adapting user interfaces for hands-free interaction

Voice trigger for a digital assistant

References

Unified client-server distributed architectures for spoken dialogue systems

System and method for providing remote automatic speech recognition and text to speech services via a packet network

User model-improvement-data-driven selection and update of user-oriented recognition model of a given type for word recognition at network server

Multi-modal messaging

Client-server speech processing system, apparatus, method, and storage medium

Related Papers (5)

Method and system for voice-enabled autofill

A method and system for voice activating web pages

Enabling voice click in a multimodal page

Method of enhancing voice interactions using visual messages

Client / server application task allocation based upon client resources