Speech segment coding and pitch control methods for speech synthesis systems

doi:10.1121/1.420238

PatentDOI

Speech segment coding and pitch control methods for speech synthesis systems

Lee Chong Rak, +1 more

- 14 Jul 1994 -

Journal of the Acoustical Society of Ame...

- Vol. 102, Iss: 6, pp 3251

Chats0

TLDR

In this article, a method and system for synthesizing speech utilizing a periodic waveform decomposition and relocation coding scheme was proposed, where signals of voiced sound interval among original speech are decomposed into wavelets, each of which corresponds to a speech waveform for one period made by each glottal pulse.

Abstract:

The present invention relates to a method and system for synthesizing speech utilizing a periodic waveform decomposition and relocation coding scheme. According to the scheme, signals of voiced sound interval among original speech are decomposed into wavelets, each of which corresponds to a speech waveform for one period made by each glottal pulse. These wavelets are respectively coded and stored. The wavelets nearest to the positions where the wavelets are to be located are selected from stored wavelets and decoded. The decoded wavelets are superposed to each other such that original sound quality can be maintained and duration and pitch frequency of speech segment can be controlled arbitrarily.

Citations

PDF

Open Access

More filters

Patent

Intelligent Automated Assistant

Thomas R. Gruber, +7 more

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Patent

Automated Response to and Sensing of User Activity in Portable Devices

Brian Q. Huppi, +3 more

TL;DR: In this paper, various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors.

...read moreread less

Patent

Using context information to facilitate processing of commands in a virtual assistant

Thomas R. Gruber, +4 more

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.

...read moreread less

Patent

Method and apparatus for building an intelligent automated assistant

Adam Cheyer, +1 more

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.

...read moreread less

Patent

Contextual voice commands

Marcel van Os, +2 more

TL;DR: In this paper, techniques and systems for implementing contextual voice commands are described and a physical input that relates the selected data item to an operation in a second context is received, and the operation is performed on the input data item in the second context.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

A diphone synthesis system based on time-domain prosodic modifications of speech

C. Hamon, +2 more

TL;DR: A novel time-domain algorithm is presented for text-to-speech synthesis using diphone concatenation based on the pitch-synchronous overlap-add (PSOLA) approach and is capable of good quality prosodic modifications of natural speech.

...read moreread less

Patent

Automatic speaker verification by non-linear time alignment of acoustic parameters

George Rowland Doddington, +2 more

TL;DR: In this paper, a nonlinear process is used to align the sample and reference utterances through a piece-wise linear continuous transformation of the time scale, and the extent of time transformation that is required to achieve maximum similarity also influences the decision to accept or reject the identity claim.

...read moreread less

PatentDOI

Phonetic hidden markov model speech synthesizer

Massimo Giustiniani, +1 more

- 07 Jun 1991 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, a method and a system for synthesizing speech from unrestricted text, based on the principle of associating a written string of text with a sequence of speech features vectors that most probably model the corresponding speech utterance, is presented.

...read moreread less

PatentDOI

Method and apparatus for encoding speech

Israel Bernard Zibman

- 29 Aug 1988 -

Journal of the Acoustical Society of Ame...

TL;DR: In this paper, the Fourier transform is equalized by normalizing the spectrum coefficients to a curve which approximates the shape of the spectrum, and the spectrum is normalized by scaling different subbands differently to flatten the spectrum.

...read moreread less

Proceedings ArticleDOI

Improving naturalness in text-to-speech synthesis using natural glottal source

K. Matsui, +3 more

TL;DR: Various methods to improve text-to-speech in its naturalness and its ability to model individual speakers are discussed, and a multisource method which utilizes different types of glottal source by cross-fading techniques is proposed.

...read moreread less