Patent
Speech unit selection using HMM acoustic models
TLDR
In this article, a concatenating speech synthesizer concatenates selected speech units to obtain the desired synthesized speech by selecting replacement speech units based on measures representative of the difference between the HMM acoustic models of the desired speech unit and available speech units.Abstract:
A concatenating speech synthesizer concatenates selected speech units to obtain the desired synthesized speech. When desired speech units of phonetic and/or prosodic context are not available, the synthesizer selects replacement speech units based on measures representative of the difference between the HMM acoustic models of the desired speech unit and available speech units.read more
Citations
More filters
Patent
Intelligent Automated Assistant
Thomas R. Gruber,Adam Cheyer,Dag Kittlaus,Didier Rene Guzzoni,Christopher Dean Brigham,Richard Donald Giuli,Marcello Bastea-Forte,Harry J. Saddler +7 more
TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Patent
Using context information to facilitate processing of commands in a virtual assistant
TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.
Patent
Method and apparatus for building an intelligent automated assistant
Adam Cheyer,Didier Rene Guzzoni +1 more
TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.
Patent
Electronic Devices with Voice Command and Contextual Data Processing Capabilities
TL;DR: In this paper, an electronic device may capture a voice command from a user and store contextual information about the state of the electronic device when the voice command is received, such as a desktop computer or a remote server.
Patent
Automatically adapting user interfaces for hands-free interaction
TL;DR: In this article, the authors present a method for automatically determining whether a digital assistant application has been separately invoked by a user without regard to whether a user has separately invoked the application.
References
More filters
Patent
Front-end architecture for a multi-lingual text-to-speech system
TL;DR: In this article, a text processing system for processing multi-lingual text for a speech synthesizer includes a first language dependent module for performing at least one of text and prosody analysis on a portion of input text comprising first language.
PatentDOI
Methods and Apparatus for Rapid Acoustic Unit Selection From a Large Speech Corpus
TL;DR: In this article, a method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenations, and storing those concatenated costs likely to occur.
PatentDOI
Concatenation of speech segments by use of a speech synthesizer
Nick Campbell,Andrew Hunt +1 more
TL;DR: In this article, a speech unit selector searches for a combination of phoneme candidates which correspond to a phoneme sequence of an input sentence and which minimizes a cost including a target cost representing approximate costs between a target phoneme and the phoneme candidate and a concatenation cost corresponding approximate costs to be adjacently concatenated, and outputs index information on the searched out combination of candidates.
PatentDOI
Voice log-in using spoken name input
TL;DR: A voice log-in system is based on a person's spoken name input only, using speaker-dependent acoustic name recognition models in a performing speaker-independent name recognition.