scispace - formally typeset
Patent

Dynamic switching between local and remote speech rendering

Reads0
Chats0
TLDR
In this paper, a multimodal browser for rendering a multi-modal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodi-al document, and a voice browser component, which can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content.
Abstract
A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

read more

Citations
More filters
Patent

Intelligent Automated Assistant

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Patent

Using context information to facilitate processing of commands in a virtual assistant

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.
Patent

Method and apparatus for building an intelligent automated assistant

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.
Patent

Automatically adapting user interfaces for hands-free interaction

TL;DR: In this article, the authors present a method for automatically determining whether a digital assistant application has been separately invoked by a user without regard to whether a user has separately invoked the application.
Patent

Voice trigger for a digital assistant

TL;DR: In this paper, a method for operating a voice trigger is presented, which includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice.
References
More filters
Patent

Unified client-server distributed architectures for spoken dialogue systems

TL;DR: In this paper, a configuration selection switch located within the client device selects a configuration based upon user functionality and network conditions to implement speech recognition functions for the spoken dialogue system, where each of these configurations is selected to provide the user with the most efficient speech recognition for the function being utilized by the user based upon network conditions.
Patent

System and method for providing remote automatic speech recognition and text to speech services via a packet network

TL;DR: In this article, a system and method of operating an automatic speech recognition application over an Internet Protocol network is described, where a grammar for recognizing received speech from a user over the IP network is selected from a plurality of grammars according to a user-selected application.
PatentDOI

User model-improvement-data-driven selection and update of user-oriented recognition model of a given type for word recognition at network server

TL;DR: A distributed pattern recognition system includes at least one user station and a server station that retrieves a recognition model selected for the user and provides the retrieved recognition model to a recognition unit for recognising the input pattern using the recognition models.
Patent

Multi-modal messaging

TL;DR: In this article, a method for composing messages combines the advantages of a multi-modal interface (e.g., grammar-based speech and touchscreen or similar input devices) and message templates, which allows a user to construct a message with significantly less effort in a fraction of the time required by conventional methods.
PatentDOI

Client-server speech processing system, apparatus, method, and storage medium

TL;DR: In this article, the system implements high-accuracy speech recognition while suppressing the amount of data transfer between the client and the server, where the client receives the compression-encoded speech parameters, a speech processing unit makes speech recognition of the compressed speech parameters and sends information corresponding to the speech recognition result to the client.