scispace - formally typeset
PatentDOI

Speech segment coding and pitch control methods for speech synthesis systems

Lee Chong Rak, +1 more
- 14 Jul 1994 - 
- Vol. 102, Iss: 6, pp 3251
Reads0
Chats0
TLDR
In this article, a method and system for synthesizing speech utilizing a periodic waveform decomposition and relocation coding scheme was proposed, where signals of voiced sound interval among original speech are decomposed into wavelets, each of which corresponds to a speech waveform for one period made by each glottal pulse.
Abstract
The present invention relates to a method and system for synthesizing speech utilizing a periodic waveform decomposition and relocation coding scheme. According to the scheme, signals of voiced sound interval among original speech are decomposed into wavelets, each of which corresponds to a speech waveform for one period made by each glottal pulse. These wavelets are respectively coded and stored. The wavelets nearest to the positions where the wavelets are to be located are selected from stored wavelets and decoded. The decoded wavelets are superposed to each other such that original sound quality can be maintained and duration and pitch frequency of speech segment can be controlled arbitrarily.

read more

Citations
More filters
Patent

Intelligent Automated Assistant

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Patent

Automated Response to and Sensing of User Activity in Portable Devices

TL;DR: In this paper, various methods and devices described herein relate to devices which, in at least certain embodiments, may include one or more sensors for providing data relating to user activity and at least one processor for causing the device to respond based on the user activity which was determined, at least in part, through the sensors.
Patent

Using context information to facilitate processing of commands in a virtual assistant

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.
Patent

Method and apparatus for building an intelligent automated assistant

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.
Patent

Contextual voice commands

TL;DR: In this paper, techniques and systems for implementing contextual voice commands are described and a physical input that relates the selected data item to an operation in a second context is received, and the operation is performed on the input data item in the second context.
References
More filters
Proceedings ArticleDOI

A diphone synthesis system based on time-domain prosodic modifications of speech

TL;DR: A novel time-domain algorithm is presented for text-to-speech synthesis using diphone concatenation based on the pitch-synchronous overlap-add (PSOLA) approach and is capable of good quality prosodic modifications of natural speech.
Patent

Automatic speaker verification by non-linear time alignment of acoustic parameters

TL;DR: In this paper, a nonlinear process is used to align the sample and reference utterances through a piece-wise linear continuous transformation of the time scale, and the extent of time transformation that is required to achieve maximum similarity also influences the decision to accept or reject the identity claim.
PatentDOI

Phonetic hidden markov model speech synthesizer

TL;DR: In this article, a method and a system for synthesizing speech from unrestricted text, based on the principle of associating a written string of text with a sequence of speech features vectors that most probably model the corresponding speech utterance, is presented.
PatentDOI

Method and apparatus for encoding speech

TL;DR: In this paper, the Fourier transform is equalized by normalizing the spectrum coefficients to a curve which approximates the shape of the spectrum, and the spectrum is normalized by scaling different subbands differently to flatten the spectrum.
Proceedings ArticleDOI

Improving naturalness in text-to-speech synthesis using natural glottal source

TL;DR: Various methods to improve text-to-speech in its naturalness and its ability to model individual speakers are discussed, and a multisource method which utilizes different types of glottal source by cross-fading techniques is proposed.
Related Papers (5)