Dynamic switching between local and remote speech rendering

Patent

Dynamic switching between local and remote speech rendering

Chats0

TLDR

In this paper, a multimodal browser for rendering a multi-modal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodi-al document, and a voice browser component, which can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content.

Abstract:

A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

Citations

PDF

Open Access

More filters

Patent

Entropy-guided text prediction using combined word and character n-gram language models

Jerome R. Bellegarda

TL;DR: In this paper, a word n-gram language model and a character m-gram model are combined to predict words in a text entry environment, and a reduction in entropy can be determined from integrated candidate word probabilities before and after the entry of the most recent character.

...read moreread less

Patent

Application integration with a digital assistant

II Robert A. Walker, +9 more

TL;DR: In this paper, a system and processes for application integration with a digital assistant are provided, where the intent object and the parameter are derived from the natural language user input, and the method further includes identifying a software application associated with the intent objects of the set of intent objects.

...read moreread less

Patent

Speech-enabled content navigation and control of a distributed multimodal browser

Soonthorn Ativanichayaphong, +2 more

TL;DR: In this article, a distributed multimodal browser with a graphical user agent (GUA) and a voice user agent(VUA) operating on a voice server is described, with the GUA transmitting a link message to the VUA, the link message specifying voice commands that control the browser and an event corresponding to each voice command.

...read moreread less

Patent

Context-sensitive handling of interruptions

Anthony L. Larson, +2 more

TL;DR: In this paper, a speech output to be provided to a user of a device is received, and it is determined if the device is currently receiving speech input from a user and if the speech output is urgent or not.

...read moreread less

Patent

Identification of voice inputs providing credentials

Murat Akbacak, +2 more

TL;DR: In this paper, a system for identifying a voice input providing one or more user credentials is described, where a first character, a phrase identifying a second character, and a word can be identified based on the voice input.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Patent

Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources

Stephane H. Maes, +2 more

TL;DR: In this paper, the authors present systems and methods for building distributed conversational applications using a Web services-based model where speech engines (e.g., speech recognition) and audio I/O systems are programmable services that can be asynchronously programmed by an application using a standard, extensible SERCP (speech engine remote control protocol), to provide scalable and flexible IP-based architectures that enable deployment of the same application or application development environment across a wide range of voice processing platforms and networks/gateways.

...read moreread less

PatentDOI

Distributed voice user interface

George M. White, +4 more

- 22 Jan 2002 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, a distributed voice user interface system includes a local device which receives speech input issued from a user, such speech input may specify a command or a request by the user, and the local device performs preliminary processing of the speech input and determines whether it is able to respond to the command or request by itself.

...read moreread less

PatentDOI

Distributed voice recognition system

Paul E. Jacobs, +1 more

- 20 Dec 1994 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, a distributed voice recognition system includes a digital signal processor (DSP), a nonvolatile storage medium (108), and a microprocessor (106), which is configured to extract parameters from digitized input speech samples and provide the extracted parameters to the microprocessor.

...read moreread less

Patent

System and method for providing network coordinated conversational services

Stephane H. Maes, +1 more

TL;DR: In this paper, a system and method for providing automatic and coordinated sharing of conversational resources, e.g., functions and arguments, between network-connected servers and devices and their corresponding applications is presented.

...read moreread less

Patent

Systems and methods for implementing modular DOM (Document Object Model)-based multi-modal browsers

David Boloker, +7 more

TL;DR: In this article, the authors present a framework for building modular multi-modal browsers using a DOM (Document Object Model) and MVC (Model-View-Controller) framework that enables a user to interact in parallel with the same information via a multiplicity of channels, devices, and/or user interfaces.

...read moreread less

Collapse

Dynamic switching between local and remote speech rendering

Citations

Entropy-guided text prediction using combined word and character n-gram language models

Application integration with a digital assistant

Speech-enabled content navigation and control of a distributed multimodal browser

Context-sensitive handling of interruptions

Identification of voice inputs providing credentials

References

Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources

Distributed voice user interface

Distributed voice recognition system

System and method for providing network coordinated conversational services

Systems and methods for implementing modular DOM (Document Object Model)-based multi-modal browsers

Related Papers (5)

Method and system for voice-enabled autofill

A method and system for voice activating web pages

Enabling voice click in a multimodal page

Method of enhancing voice interactions using visual messages

Client / server application task allocation based upon client resources