Proceedings ArticleDOI
Predictive coding of speech signals and subjective error criteria
Bishnu S. Atal,Manfred R. Schroeder +1 more
- Vol. 3, pp 573-576
Reads0
Chats0
TLDR
Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.Abstract:
Predictive coding methods attempt to minimize the r.m.s. error in the coded signal. However, the human ear does not perceive signal distortion on the basis of r.m.s. error regardless of its spectral shape relative to the signal spectrum. Specifically, for speech signals, the locations of the formant frequencies and their rates of change with time influence the audibility, and thus the subjective distortion of any quantizing noise. In this paper, methods for reducing the subjective distortion in predictive coders for speech siganls are described and evaluated. Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.read more
Citations
More filters
Journal ArticleDOI
Vector quantization in speech coding
John Makhoul,S. Roucos,H. Gish +2 more
TL;DR: This tutorial review presents the basic concepts employed in vector quantization and gives a realistic assessment of its benefits and costs when compared to scalar quantization, and focuses primarily on the coding of speech signals and parameters.
Book
Survey of the State of the Art in Human Language Technology
TL;DR: In this article, the authors present a glossary for language analysis and understanding in the context of spoken language input and output technologies, and evaluate their work with a set of annotated corpora.
Book
An Introduction to Digital Speech Processing
TL;DR: A comprehensive overview of digital speech processing that ranges from the basic nature of the speech signal, through a variety of methods of representing speech in digital form, to applications in voice communication and automatic synthesis and recognition of speech.
Journal ArticleDOI
Design and description of CS-ACELP: a toll quality 8 kb/s speech coder
R. Salami,Claude Laflamme,J.-P. Adoul,A. Kataoka,S. Hayashi,Takehiro Moriya,C. Lamblin,D. Massaloux,S. Proust,P. Kroon,Y. Shoham +10 more
TL;DR: The coder structure is described in detail and the reasons behind certain design choices are discussed and a summary of the subjective test results based on a real-time implementation of this version are presented.
Book
The Theory of Linear Prediction
TL;DR: The text is self-contained for readers with introductory exposure to signal processing, random processes, and the theory of matrices, and a historical perspective and detailed outline are given in the first chapter.
References
More filters
Journal ArticleDOI
Quantizing for minimum distortion
TL;DR: This paper discusses the problem of the minimization of the distortion of a signal by a quantizer when the number of output levels of the quantizer is fixed and an algorithm is developed to simplify their numerical solution.
Journal ArticleDOI
Speech analysis and synthesis by linear prediction of the speech wave.
B. S. Atal,Suzanne L. Hanauer +1 more
TL;DR: Application of this method for efficient transmission and storage of speech signals as well as procedures for determining other speechcharacteristics, such as formant frequencies and bandwidths, the spectral envelope, and the autocorrelation function, are discussed.
Journal ArticleDOI
Predictive coding--I
TL;DR: Part II will give the mathematical criterion for the best predictor for use in the predictive coding of particular messages, will give examples of such messages, and will show that the error term which is transmitted in predictive coding may always be coded efficiently.
Journal ArticleDOI
Adaptive predictive coding of speech signals
Bishnu S. Atal,M. R. Schroeder +1 more
TL;DR: Preliminary studies suggest that the binary difference signal and the predictor parameters together can be transmitted at approximately 10 kilobits/second which is several times less than the bit rate required for log-PCM encoding with comparable speech quality.
Patent
Predictive coding of speech signals
TL;DR: In this article, an adaptive predictor is employed which is readjusted periodically to match the time-varying characteristics of a speech signal, which is used to reduce the channel capacity required to transmit a signal with specified fidelity.