Predictive Coding of Speech at Low Bit Rates

doi:10.1109/TCOM.1982.1095501

Journal ArticleDOI

Predictive Coding of Speech at Low Bit Rates

Bishnu S. Atal

- 01 Apr 1982 -

IEEE Transactions on Communications

- Vol. 30, Iss: 4, pp 600-614

Chats0

TLDR

A new class of speech coders are described which allow one to realize the precise optimum noise spectrum which is crucial to achieving very low bit rates, but also represent the important first step in bridging the gap between waveform coders and vocoders without suffering from their limitations.

Abstract:

Predictive coding is a promising approach for speech coding. In this paper, we review the recent work on adaptive predictive coding of speech signals, with particular emphasis on achieving high speech quality at low bit rates (less than 10 kbits/s). Efficient prediction of the redundant structure in speech signals is obviously important for proper functioning of a predictive coder. It is equally important to ensure that the distortion in the coded speech signal be perceptually small. The subjective loudness of quantization noise depends both on the short-time spectrum of the noise and its relation to the short-time spectrum of the Speech signal. The noise in the formant regions is partially masked by the speech signal itself. This masking of quantization noise by speech signal allows one to use low bit rates while maintaining high speech quality. This paper will present generalizations of predictive coding for minimizing subjective distortion in the reconstructed speech signal at the receiver. The quantizer in predictive coders quantizes its input on a sample-by-sample basis. Such sample-by-sample (instantaneous) quantization creates difficulty in realizing an arbitrary noise spectrum, particularly at low bit rates. We will describe a new class of speech coders in this paper which could be considered to be a generalization of the predictive coder. These new coders not only allow one to realize the precise optimum noise spectrum which is crucial to achieving very low bit rates, but also represent the important first step in bridging the gap between waveform coders and vocoders without suffering from their limitations.

Predictive Coding of Speech at Low Bit Rates

Citations

Predictive coding of speech using analysis-by-synthesis techniques

Optimizing LPC filter parameters for multi-pulse excitation

Analysis-by-synthesis method for whisper-speech reconstruction

Speech and Audio Processing: A MATLAB®-based Approach

Multi-state speech encoder and decoder

References

Speech analysis and synthesis by linear prediction of the speech wave.

Predictive coding--I

Optimizing digital speech coders by exploiting masking properties of the human ear

Predictive coding of speech signals and subjective error criteria

Adaptive predictive coding of speech signals

Related Papers (5)

Code-excited linear prediction(CELP): High-quality speech at very low bit rates

A new model of LPC excitation for producing natural-sounding speech at low bit rates

Linear prediction: A tutorial review

Linear Prediction of Speech

An Algorithm for Vector Quantizer Design