A new model of LPC excitation for producing natural-sounding speech at low bit rates

doi:10.1109/ICASSP.1982.1171649

Proceedings ArticleDOI

A new model of LPC excitation for producing natural-sounding speech at low bit rates

B. Atal, +1 more

- Vol. 7, pp 614-617

Chats0

TLDR

This paper describes a new approach to the excitation problem that does not require a priori knowledge of either the voiced-unvoiced decision or the pitch period, and minimizes a perceptual-distance metric representing subjectively-important differences between the waveforms of the original and the synthetic speech signals.

Abstract:

The excitation for LPC speech synthesis usually consists of two separate signals - a delta-function pulse once every pitch period for voiced speech and white noise for unvoiced speech. This manner of representing excitation requires that speech segments be classified accurately into voiced and unvoiced categories and the pitch period of voiced segments be known. It is now well recognized that such a rigid idealization of the vocal excitation is often responsible for the unnatural quality associated with synthesized speech. This paper describes a new approach to the excitation problem that does not require a priori knowledge of either the voiced-unvoiced decision or the pitch period. All classes of sounds are generated by exciting the LPC filter with a sequence of pulses; the amplitudes and locations of the pulses are determined using a non-iterative analysis-by-synthesis procedure. This procedure minimizes a perceptual-distance metric representing subjectively-important differences between the waveforms of the original and the synthetic speech signals. The distance metric takes account of the finite-frequency resolution as well as the differential sensitivity of the human ear to errors in the formant and inter-formant regions of the speech spectrum.

A new model of LPC excitation for producing natural-sounding speech at low bit rates

Citations

Introduction to data compression

Speech analysis/Synthesis based on a sinusoidal representation

Sparse solutions to linear inverse problems with multiple measurement vectors

Discrete-Time Speech Signal Processing: Principles and Practice

Vector quantization in speech coding

References

Linear Prediction of Speech

Speech Analysis, Synthesis and Perception

Speech analysis and synthesis by linear prediction of the speech wave.

Optimizing digital speech coders by exploiting masking properties of the human ear

Predictive coding of speech signals and subjective error criteria

Related Papers (5)

Code-excited linear prediction(CELP): High-quality speech at very low bit rates

Linear prediction: A tutorial review

Speech analysis and synthesis by linear prediction of the speech wave.

Linear Prediction of Speech

An Algorithm for Vector Quantizer Design