Speech unit selection using HMM acoustic models

Patent

Speech unit selection using HMM acoustic models

TLDR

In this article, a concatenating speech synthesizer concatenates selected speech units to obtain the desired synthesized speech by selecting replacement speech units based on measures representative of the difference between the HMM acoustic models of the desired speech unit and available speech units.

Abstract:

A concatenating speech synthesizer concatenates selected speech units to obtain the desired synthesized speech. When desired speech units of phonetic and/or prosodic context are not available, the synthesizer selects replacement speech units based on measures representative of the difference between the HMM acoustic models of the desired speech unit and available speech units.

Citations

PDF

Open Access

More filters

Patent

Intelligent Automated Assistant

Thomas R. Gruber, +7 more

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Patent

Using context information to facilitate processing of commands in a virtual assistant

Thomas R. Gruber, +4 more

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.

...read moreread less

Patent

Method and apparatus for building an intelligent automated assistant

Adam Cheyer, +1 more

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.

...read moreread less

Patent

Electronic Devices with Voice Command and Contextual Data Processing Capabilities

Aram Lindahl

TL;DR: In this paper, an electronic device may capture a voice command from a user and store contextual information about the state of the electronic device when the voice command is received, such as a desktop computer or a remote server.

...read moreread less

Patent

Automatically adapting user interfaces for hands-free interaction

Thomas R. Gruber, +1 more

TL;DR: In this article, the authors present a method for automatically determining whether a digital assistant application has been separately invoked by a user without regard to whether a user has separately invoked the application.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Speech synthesis

John Nicholas Holmes

Patent

Front-end architecture for a multi-lingual text-to-speech system

Min Chu, +2 more

TL;DR: In this article, a text processing system for processing multi-lingual text for a speech synthesizer includes a first language dependent module for performing at least one of text and prosody analysis on a portion of input text comprising first language.

...read moreread less

PatentDOI

Methods and Apparatus for Rapid Acoustic Unit Selection From a Large Speech Corpus

Mark Charles Beutnagel, +2 more

- 06 Feb 2003 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, a method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenations, and storing those concatenated costs likely to occur.

...read moreread less

PatentDOI

Concatenation of speech segments by use of a speech synthesizer

Nick Campbell, +1 more

- 16 Feb 1999 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, a speech unit selector searches for a combination of phoneme candidates which correspond to a phoneme sequence of an input sentence and which minimizes a cost including a target cost representing approximate costs between a target phoneme and the phoneme candidate and a concatenation cost corresponding approximate costs to be adjacently concatenated, and outputs index information on the searched out combination of candidates.

...read moreread less

PatentDOI

Voice log-in using spoken name input

Joseph Picone, +1 more

- 01 Jul 1991 -

Journal of the Acoustical Society of Ame...

TL;DR: A voice log-in system is based on a person's spoken name input only, using speaker-dependent acoustic name recognition models in a performing speaker-independent name recognition.

...read moreread less

Collapse

Speech unit selection using HMM acoustic models

Citations

Intelligent Automated Assistant

Using context information to facilitate processing of commands in a virtual assistant

Method and apparatus for building an intelligent automated assistant

Electronic Devices with Voice Command and Contextual Data Processing Capabilities

Automatically adapting user interfaces for hands-free interaction

References

Speech synthesis

Front-end architecture for a multi-lingual text-to-speech system

Methods and Apparatus for Rapid Acoustic Unit Selection From a Large Speech Corpus

Concatenation of speech segments by use of a speech synthesizer

Voice log-in using spoken name input

Related Papers (5)

Feature-domain concatenative speech synthesis

Speech synthesis using concatenation of speech waveforms

Synthesis by generation and concatenation of multi-form segments

Modification of sub-phoneme speech spectral models for lombard speech recognition

Apparatus and method for speech recognition in the presence of unnatural speech effects