scispace - formally typeset
PatentDOI

Audio analysis/synthesis system

Reads0
Chats0
TLDR
In this article, a method and apparatus for the automatic analysis, synthesis and modification of audio signals, based on an overlap-add sinusoidal model is disclosed, which incorporates successive approximation, yielding synthetic waveforms which are very good approximations to the original waveforms.
Abstract
A method and apparatus for the automatic analysis, synthesis and modification of audio signals, based on an overlap-add sinusoidal model is disclosed. Automatic analysis of amplitude, frequency and phase parameters of the model is achieved using an analysis-by-synthesis procedure (108) which incorporates successive approximation, yielding synthetic waveforms which are very good approximations to the original waveforms. In addition, a new approach to pich-scale modification (111) allows for the use of arbitrary spectral envelope estimates and addresses the problems of high-frequency loss and noise amplification encountered with prior art methods.

read more

Citations
More filters
PatentDOI

Speech synthesis method

TL;DR: In this article, a plurality of synthesis speech segments are generated by synthesizing training speech segments labeled with phonetic contexts and input speech segments while altering the pitch/duration of the input text segments in accordance with the pitch and duration of the training text segments.
PatentDOI

Speech coding system and method using voicing probability determination

TL;DR: A modular system and method is provided for encoding and decoding of speech signals using voicing probability determination and the use of the system in the generation of a variety of voice effects.
PatentDOI

Expressivity of voice synthesis by emphasizing source signal features

TL;DR: In this paper, a library of source sound categories in the source module is used for voice synthesis with improved expressivity, where each source sound category corresponds to a particular morphological category and is derived from analysis of real vocal sounds.
Patent

Low bit-rate speech coding system and method using voicing probability determination

TL;DR: In this paper, a modular system and method for low bit rate encoding and decoding of speech signals using voicing probability determination is provided, where continuous input speech is divided into time segments of a predetermined length.
Patent

Encoder, decoder and methods for encoding and decoding data segments representing a time-domain data stream

TL;DR: An apparatus for decoding data segments representing a time-domain data stream, a data segment being encoded in the time domain or in the frequency domain, having successive blocks of data representing successive and overlapping blocks of time domain data samples was proposed in this article.
References
More filters
Journal ArticleDOI

Speech analysis/Synthesis based on a sinusoidal representation

TL;DR: A sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves, which forms the basis for new approaches to the problems of speech transformations including time-scale and pitch-scale modification, and midrate speech coding.
Journal ArticleDOI

Speech transformations based on a sinusoidal representation

TL;DR: In this paper, a speech analysis/synthesis technique is presented which provides the basis for a general class of speech transformations including time-scale modification, frequency scaling, and pitch modification.
PatentDOI

Processing of acoustic waveforms

TL;DR: In this article, a sinusoidal model for acoustic waveforms is applied to develop a new analysis/synthesis technique which characterizes a waveform by the amplitudes, frequencies, and phases of component sine waves.
Proceedings ArticleDOI

Pitch estimation and voicing detection based on a sinusoidal speech model

TL;DR: A pitch estimation criterion is derived that is inherently unambiguous, uses pitch-adaptive resolution, uses small-signal suppression to provide enhanced discrimination, and uses amplitude compression to eliminate the effects of pitch-formant interaction.
PatentDOI

Coding of acoustic waveforms

TL;DR: In this article, a pitch-adaptive channel encoding technique for amplitude coding varies the channel spacing in accordance with the pitch of the speaker's voice, and a phase synthesis technique locks rapidly-varying phases into synchrony with the phase of the fundamental.
Related Papers (5)