scispace - formally typeset
Patent

Dynamic switching between local and remote speech rendering

Reads0
Chats0
TLDR
In this paper, a multimodal browser for rendering a multi-modal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodi-al document, and a voice browser component, which can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content.
Abstract
A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

read more

Citations
More filters
Patent

Entropy-guided text prediction using combined word and character n-gram language models

TL;DR: In this paper, a word n-gram language model and a character m-gram model are combined to predict words in a text entry environment, and a reduction in entropy can be determined from integrated candidate word probabilities before and after the entry of the most recent character.
Patent

Application integration with a digital assistant

TL;DR: In this paper, a system and processes for application integration with a digital assistant are provided, where the intent object and the parameter are derived from the natural language user input, and the method further includes identifying a software application associated with the intent objects of the set of intent objects.
Patent

Speech-enabled content navigation and control of a distributed multimodal browser

TL;DR: In this article, a distributed multimodal browser with a graphical user agent (GUA) and a voice user agent(VUA) operating on a voice server is described, with the GUA transmitting a link message to the VUA, the link message specifying voice commands that control the browser and an event corresponding to each voice command.
Patent

Context-sensitive handling of interruptions

TL;DR: In this paper, a speech output to be provided to a user of a device is received, and it is determined if the device is currently receiving speech input from a user and if the speech output is urgent or not.
Patent

Identification of voice inputs providing credentials

TL;DR: In this paper, a system for identifying a voice input providing one or more user credentials is described, where a first character, a phrase identifying a second character, and a word can be identified based on the voice input.
References
More filters
Patent

Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources

TL;DR: In this paper, the authors present systems and methods for building distributed conversational applications using a Web services-based model where speech engines (e.g., speech recognition) and audio I/O systems are programmable services that can be asynchronously programmed by an application using a standard, extensible SERCP (speech engine remote control protocol), to provide scalable and flexible IP-based architectures that enable deployment of the same application or application development environment across a wide range of voice processing platforms and networks/gateways.
PatentDOI

Distributed voice user interface

TL;DR: In this article, a distributed voice user interface system includes a local device which receives speech input issued from a user, such speech input may specify a command or a request by the user, and the local device performs preliminary processing of the speech input and determines whether it is able to respond to the command or request by itself.
PatentDOI

Distributed voice recognition system

TL;DR: In this article, a distributed voice recognition system includes a digital signal processor (DSP), a nonvolatile storage medium (108), and a microprocessor (106), which is configured to extract parameters from digitized input speech samples and provide the extracted parameters to the microprocessor.
Patent

System and method for providing network coordinated conversational services

TL;DR: In this paper, a system and method for providing automatic and coordinated sharing of conversational resources, e.g., functions and arguments, between network-connected servers and devices and their corresponding applications is presented.
Patent

Systems and methods for implementing modular DOM (Document Object Model)-based multi-modal browsers

TL;DR: In this article, the authors present a framework for building modular multi-modal browsers using a DOM (Document Object Model) and MVC (Model-View-Controller) framework that enables a user to interact in parallel with the same information via a multiplicity of channels, devices, and/or user interfaces.