scispace - formally typeset
Patent

Dynamic switching between local and remote speech rendering

Reads0
Chats0
TLDR
In this paper, a multimodal browser for rendering a multi-modal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodi-al document, and a voice browser component, which can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content.
Abstract
A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

read more

Citations
More filters
Patent

Unified ranking with entropy-weighted information for phrase-based semantic auto-completion

TL;DR: In this paper, predictive information based on usage frequency, usage recency, and semantic information encapsulated in an ontology (e.g., a network of domains) implemented by the digital assistant is integrated in a balanced and sensible way within a unified framework.
Patent

Auto-activating smart responses based on activities from remote devices

TL;DR: In this article, an electronic device with one or more processors and memory includes a procedure for using a digital assistant to automatically respond to incoming communications, such as a speech input from a user, and the device determines whether the speech input includes instructions for performing a specified action in response to receipt of a subsequent incoming communication from specified senders.
Patent

Methods and apparatuses for automatic speech recognition

TL;DR: In this paper, the first representation of the input signal is a discrete parameter representation, and the second representation is a continuous parameter representation of residuals of input signal, which are mapped into a vector space.
Patent

Intelligent automated assistant for media exploration

TL;DR: In this article, a speech input representing a request for one or more media items is received from a user, and the process determines whether the speech input corresponds to a user intent of obtaining personalized recommendations for media items.
Patent

Actionable reminder entries

TL;DR: In this paper, a task item is an electronic data that represents a task to be performed, whether manually or automatically, and it includes one or more details about its corresponding task, such as a description of the task and a location.
References
More filters
Patent

Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources

TL;DR: In this paper, the authors present systems and methods for building distributed conversational applications using a Web services-based model where speech engines (e.g., speech recognition) and audio I/O systems are programmable services that can be asynchronously programmed by an application using a standard, extensible SERCP (speech engine remote control protocol), to provide scalable and flexible IP-based architectures that enable deployment of the same application or application development environment across a wide range of voice processing platforms and networks/gateways.
PatentDOI

Distributed voice user interface

TL;DR: In this article, a distributed voice user interface system includes a local device which receives speech input issued from a user, such speech input may specify a command or a request by the user, and the local device performs preliminary processing of the speech input and determines whether it is able to respond to the command or request by itself.
PatentDOI

Distributed voice recognition system

TL;DR: In this article, a distributed voice recognition system includes a digital signal processor (DSP), a nonvolatile storage medium (108), and a microprocessor (106), which is configured to extract parameters from digitized input speech samples and provide the extracted parameters to the microprocessor.
Patent

System and method for providing network coordinated conversational services

TL;DR: In this paper, a system and method for providing automatic and coordinated sharing of conversational resources, e.g., functions and arguments, between network-connected servers and devices and their corresponding applications is presented.
Patent

Systems and methods for implementing modular DOM (Document Object Model)-based multi-modal browsers

TL;DR: In this article, the authors present a framework for building modular multi-modal browsers using a DOM (Document Object Model) and MVC (Model-View-Controller) framework that enables a user to interact in parallel with the same information via a multiplicity of channels, devices, and/or user interfaces.