scispace - formally typeset
Proceedings ArticleDOI

Flexible Quantization of Audio and Speech based on the Autoregressive Model

A. Ozerov, +1 more
- pp 535-539
Reads0
Chats0
TLDR
Experiments indicate that, compared with a non-scalable conventional fixed-rate code-excited linear predictive (CELP) coding scheme, the real time scalable coder with scalar quantization performs at least as well in the constrained entropy case, and has nearly identical performance for the constrained resolution case.
Abstract
We describe a coding scheme based on audio and speech quantization with an adaptive quantizer derived from the autoregressive model under high-rate assumptions The main advantage of this scheme compared to state-of-the-art training-based coders is its flexibility The scheme can adapt in real time to any particular rate and has a computational complexity independent of the rate Experiments indicate that, compared with a non-scalable conventional fixed-rate code-excited linear predictive (CELP) coding scheme, our real time scalable coder with scalar quantization performs at least as well in the constrained entropy case, and has nearly identical performance for the constrained resolution case

read more

Citations
More filters
Journal Article

Distribution Preserving Quantization

TL;DR: In the lossy coding of perceptually relevant signals, such as sound and images, the ultimate goal is to achieve good perceived quality of the reconstructed signal, under a constraint on the bit-rat quality.
Journal Article

Quantization with Constrained Relative Entropy and Its Application to Audio Coding

TL;DR: Conventional quantization distorts the probability density of the source, and in scenarios such as low bit rate audio coding, this leads to perceived distortion that is not well characterized by common criteria.
Journal ArticleDOI

Multiple Description Distribution Preserving Quantization

TL;DR: This work proposes a new MDQ designed according to an unconventional distortion criterion that combines the mean squared error with a constraint on the probability distribution of the reconstructed signal and significantly outperforms a classic MDQ in perceptual tests.
Proceedings Article

Perceptual coding-based informed source separation

TL;DR: It is shown how the perceptual quality of the separated sources can be improved by inserting perceptual source coding techniques in this framework, achieving a continuum of optimal bitrate-perceptual distortion trade-offs.
Dissertation

Processus gaussiens pour la séparation de sources et le codage informé

TL;DR: Cette analogie permet de placer la separation informee dans un cadre theorique plus global ou elle devient un probleme de codage particulier and beneficie a ce titre des resultats classiques de the theorie du codage, qui permettent d’optimiser efficacement les performances.
References
More filters
Book

Speech Coding and Synthesis

TL;DR: An introduction to speech coding, W.B. Kleijn evaluation of speech coders, and a robust algorithm for pitch tracking (RAPT), D. McAulay and T.F. Quatieri waveform interpolation for coding and synthesis.
Book

Source coding theory

TL;DR: A comparison of the Bounds of Optimized VQ vs. Uniform Quantization Noise: Deterministic Inputs and Dithering and the Results of Exercises.
Journal ArticleDOI

PDF optimized parametric vector quantization of speech line spectral frequencies

TL;DR: A low complexity quantization scheme using transform coding and bit allocation techniques which allows for easy mapping from observation to quantized value is developed for both fixed rate and variable rate systems.
Book ChapterDOI

Source Coding Theory

TL;DR: In what follows, a communication system as schematized by the block-diagram of Fig. 1.1 is referred to.
Proceedings ArticleDOI

High-quality 16 kb/s speech coding with a one-way delay less than 2 ms

TL;DR: A high-quality 16-kb/s speech coder which has a one-way coding delay of less than 2 ms is presented and formal subjective tests indicate that this coder produces high- quality speech comparable to that of the CCITT G.721 32- kb/s ADPCM standard.