scispace - formally typeset
Journal ArticleDOI

Predictive Coding of Speech at Low Bit Rates

Bishnu S. Atal
- 01 Apr 1982 - 
- Vol. 30, Iss: 4, pp 600-614
Reads0
Chats0
TLDR
A new class of speech coders are described which allow one to realize the precise optimum noise spectrum which is crucial to achieving very low bit rates, but also represent the important first step in bridging the gap between waveform coders and vocoders without suffering from their limitations.
Abstract
Predictive coding is a promising approach for speech coding. In this paper, we review the recent work on adaptive predictive coding of speech signals, with particular emphasis on achieving high speech quality at low bit rates (less than 10 kbits/s). Efficient prediction of the redundant structure in speech signals is obviously important for proper functioning of a predictive coder. It is equally important to ensure that the distortion in the coded speech signal be perceptually small. The subjective loudness of quantization noise depends both on the short-time spectrum of the noise and its relation to the short-time spectrum of the Speech signal. The noise in the formant regions is partially masked by the speech signal itself. This masking of quantization noise by speech signal allows one to use low bit rates while maintaining high speech quality. This paper will present generalizations of predictive coding for minimizing subjective distortion in the reconstructed speech signal at the receiver. The quantizer in predictive coders quantizes its input on a sample-by-sample basis. Such sample-by-sample (instantaneous) quantization creates difficulty in realizing an arbitrary noise spectrum, particularly at low bit rates. We will describe a new class of speech coders in this paper which could be considered to be a generalization of the predictive coder. These new coders not only allow one to realize the precise optimum noise spectrum which is crucial to achieving very low bit rates, but also represent the important first step in bridging the gap between waveform coders and vocoders without suffering from their limitations.

read more

Citations
More filters
PatentDOI

Method for searching an excitation codebook in a code excited linear prediction (CELP) coder

TL;DR: In this paper, the analysis window for the coder is extended beyond the length of the target speech frame by using a one-dimensional autocorrelation matrix to reduce the computational complexity and memory required for the search.
Journal Article

The «(4,2) concept» fault-tolerant computer

Th. Krol
TL;DR: The basic (4,2)-concept has been extended with a special mode in which single-bit faults are masked even in the presence of a completely failing slice, which will be applied in the Philips PRX-D digital telephone exchange.
Journal ArticleDOI

Fractional rate multitree speech coding

TL;DR: The authors present both forward and backward adaptive speech coders that operate at 9.6, 12, and 16 kb/s using integer and fractional rate trees, weighted squared error distortion measures, the (M,L) tree search algorithm, and incremental path map symbol release.
Proceedings ArticleDOI

A unified framework for LPC excitation representation in residual speech coders

TL;DR: It is demonstrated that this approach provides a unified framework for describing and analyzing a wide range of residual speechCoders, from multipulse LPC and code-excited linear prediction to residual transform coders, and leads to generalization of some of these schemes.
Proceedings ArticleDOI

Role of multi-pulse excitation in synthesis of natural-sounding voiced speech

TL;DR: The role of multi-pulse excitation and its importance in the synthesis of natural-sounding voiced speech is discussed and it is suggested, that this irregularity contributes to the "fullness" heard in multi-Pulse synthesized speech.
References
More filters
Journal ArticleDOI

Speech analysis and synthesis by linear prediction of the speech wave.

TL;DR: Application of this method for efficient transmission and storage of speech signals as well as procedures for determining other speechcharacteristics, such as formant frequencies and bandwidths, the spectral envelope, and the autocorrelation function, are discussed.
Journal ArticleDOI

Predictive coding--I

TL;DR: Part II will give the mathematical criterion for the best predictor for use in the predictive coding of particular messages, will give examples of such messages, and will show that the error term which is transmitted in predictive coding may always be coded efficiently.
Journal ArticleDOI

Optimizing digital speech coders by exploiting masking properties of the human ear

TL;DR: New results of masking and loudness reduction of noise are reported and the design principles of speech coding systems exploiting auditory masking are described.
Journal ArticleDOI

Predictive coding of speech signals and subjective error criteria

TL;DR: Improved speech quality is obtained by efficient removal of formant and pitch-related redundant structure of speech before quantizing, and by effective masking of the quantizer noise by the speech signal.
Journal ArticleDOI

Adaptive predictive coding of speech signals

TL;DR: Preliminary studies suggest that the binary difference signal and the predictor parameters together can be transmitted at approximately 10 kilobits/second which is several times less than the bit rate required for log-PCM encoding with comparable speech quality.