scispace - formally typeset
Proceedings ArticleDOI

Predictive coding of speech signals and subjective error criteria

Reads0
Chats0
TLDR
Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.
Abstract
Predictive coding methods attempt to minimize the r.m.s. error in the coded signal. However, the human ear does not perceive signal distortion on the basis of r.m.s. error regardless of its spectral shape relative to the signal spectrum. Specifically, for speech signals, the locations of the formant frequencies and their rates of change with time influence the audibility, and thus the subjective distortion of any quantizing noise. In this paper, methods for reducing the subjective distortion in predictive coders for speech siganls are described and evaluated. Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.

read more

Citations
More filters
Journal ArticleDOI

Vector quantization in speech coding

TL;DR: This tutorial review presents the basic concepts employed in vector quantization and gives a realistic assessment of its benefits and costs when compared to scalar quantization, and focuses primarily on the coding of speech signals and parameters.
Book

Survey of the State of the Art in Human Language Technology

R. Cole
TL;DR: In this article, the authors present a glossary for language analysis and understanding in the context of spoken language input and output technologies, and evaluate their work with a set of annotated corpora.
Book

An Introduction to Digital Speech Processing

TL;DR: A comprehensive overview of digital speech processing that ranges from the basic nature of the speech signal, through a variety of methods of representing speech in digital form, to applications in voice communication and automatic synthesis and recognition of speech.
Journal ArticleDOI

Design and description of CS-ACELP: a toll quality 8 kb/s speech coder

TL;DR: The coder structure is described in detail and the reasons behind certain design choices are discussed and a summary of the subjective test results based on a real-time implementation of this version are presented.
Book

The Theory of Linear Prediction

TL;DR: The text is self-contained for readers with introductory exposure to signal processing, random processes, and the theory of matrices, and a historical perspective and detailed outline are given in the first chapter.
References
More filters
Journal ArticleDOI

Quantizing for minimum distortion

TL;DR: This paper discusses the problem of the minimization of the distortion of a signal by a quantizer when the number of output levels of the quantizer is fixed and an algorithm is developed to simplify their numerical solution.
Journal ArticleDOI

Speech analysis and synthesis by linear prediction of the speech wave.

TL;DR: Application of this method for efficient transmission and storage of speech signals as well as procedures for determining other speechcharacteristics, such as formant frequencies and bandwidths, the spectral envelope, and the autocorrelation function, are discussed.
Journal ArticleDOI

Predictive coding--I

TL;DR: Part II will give the mathematical criterion for the best predictor for use in the predictive coding of particular messages, will give examples of such messages, and will show that the error term which is transmitted in predictive coding may always be coded efficiently.
Journal ArticleDOI

Adaptive predictive coding of speech signals

TL;DR: Preliminary studies suggest that the binary difference signal and the predictor parameters together can be transmitted at approximately 10 kilobits/second which is several times less than the bit rate required for log-PCM encoding with comparable speech quality.
Patent

Predictive coding of speech signals

TL;DR: In this article, an adaptive predictor is employed which is readjusted periodically to match the time-varying characteristics of a speech signal, which is used to reduce the channel capacity required to transmit a signal with specified fidelity.