Nonstationary spectral modeling of voiced speech

doi:10.1109/TASSP.1983.1164128

Journal ArticleDOI

Nonstationary spectral modeling of voiced speech

Luís B. Almeida, +1 more

- 01 Jun 1983 -

IEEE Transactions on Acoustics, Speech, ...

- Vol. 31, Iss: 3, pp 664-678

Chats0

TLDR

A novel model for voiced speech that allows for local non-stationarities not only in terms of pitch perturbations, but in Terms of vocal tract variations as well, and supports new forms of spectral prediction, which can be put to advantage in speech coding applications.

Abstract:

The main purpose of this paper is to present a novel model for voiced speech. The classical model, which is being used in many applications, assumes local stationarity, and consequently imposes a simple and well known line structure to the short-time spectrum of voiced speech. The model derived in this paper allows for local non-stationarities not only in terms of pitch perturbations, but in terms of vocal tract variations as well. The resulting structure of the short-time spectrum becomes more complex, but can still be interpreted in terms of generalized lines. The proposed model supports new forms of spectral prediction, which can be put to advantage in speech coding applications. Experimental results are presented supporting the validity of both the model itself and the prediction relationships. Finally, a new class of speech coders, denoted harmonic coders, based on the presented model, is proposed, and a specific implementation is presented.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Perceptual coding of digital audio

T. Painter, +1 more

TL;DR: This paper reviews methodologies that achieve perceptually transparent coding of FM- and CD-quality audio signals, including algorithms that manipulate transform components, subband signal decompositions, sinusoidal signal components, and linear prediction parameters, as well as hybrid algorithms that make use of more than one signal model.

...read moreread less

Journal ArticleDOI

Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

E.B. George, +1 more

- 01 Sep 1997 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The proposed analysis-by-synthesis/overlap-add (ABS/OLA) system allows for both fixed and time-varying time-, frequency-, and pitch-scale modifications, and computational shortcuts using the FFT algorithm make its implementation feasible using currently available hardware.

...read moreread less

Patent

Method and apparatus for hybrid coding of speech at 4kbps

Allen Gersho, +3 more

TL;DR: In this article, a method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and "transitory" or "transition" speech.

...read moreread less

Book

Adaptive Signal Models: Theory, Algorithms, and Audio Applications

Michael Mark Goodwin, +1 more

TL;DR: This paper presents a meta-modelling framework for Fourier Series Representations of Signal Models and Analysis-Synthesis and concludes with a comparison of these models against known models for Pitch-Synchronous Modeling.

...read moreread less

Journal ArticleDOI

Encoding speech using prototype waveforms

Willem Bastiaan Kleijn

- 01 Oct 1993 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The coding method is easily combined with existing LP-based speech coders, such as CELP, for unvoiced signals and excellent voiced speech quality is obtained at rates between 3.0 and 4.0 kb/s.

...read moreread less

Luís B. Almeida, +1 more

TL;DR: This paper discusses a form of non-linear prediction, namely, the prediction of the phase of speech signals, based upon a new treatment of the classical speech production model within a short-time analysis/synthesis framework.

...read moreread less

Nonstationary spectral modeling of voiced speech

Citations

Perceptual coding of digital audio

Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model

Method and apparatus for hybrid coding of speech at 4kbps

Adaptive Signal Models: Theory, Algorithms, and Audio Applications

Encoding speech using prototype waveforms

References

Quantizing for minimum distortion

Frequency domain coding of speech

Real-time digital hardware pitch detector

Short-time Fourier analysis of sampled speech

A model for short-time phase prediction of speech

Related Papers (5)

Speech analysis/Synthesis based on a sinusoidal representation

Multiband excitation vocoder

Speech coding: a tutorial review

Code-excited linear prediction(CELP): High-quality speech at very low bit rates

Efficient vector quantization of LPC parameters at 24 bits/frame