Very low delay and high quality coding of 20 Hz-15 kHz speech signals at 64 Kbit/s
read more
Citations
Warped Low-Delay CELP for Wideband Audio Coding
A low delay sub-band audio coder (20 Hz-15 kHz) at 64 kbit/s
The D/sub 5/ lattice quantization for 64 kbit/s low-delay subband audio coder with a 15 kHz bandwidth
Codage audio haute qualité (20 Hz-15 kHz) en sous bandes à débit réduit (64 kbit/s) et à faible retard (5 ms)
References
A low-delay CELP coder for the CCITT 16 kb/s speech coding standard
Real-time vector excitation coding of speech at 4800 bps
High-quality audio transform coding at 64 kbps
Low-delay code-excited linear-predictive coding of wideband speech at 32 kbps
Low-delay code-excited linear-predictive coding of wideband speech at 32 kbps.
Related Papers (5)
Frequently Asked Questions (10)
Q2. How long should the delay be for the encoder-decoder?
To ensure good interactivity in the communication systems, the one-way delay should not exceed 100 ms including processing, sound recording and transmission delays, leaving only about 20 ms for the encoder-decoder delay.
Q3. What is the way to model the spectral envelope?
In their coder, the modelisation of the spectral envelope is provided by the synthesis filter, whereas the pitch modelisation is mainly ensured by the choice of the optimum vector and their gains.
Q4. What is the role of speech coding in modern telecommunication systems?
High quality speech coding plays an important role in modern telecommunication systems, especially in videoconference and teleconference systems.
Q5. What is the optimum code word in the shape codebook?
With a 5-sample frame and at the sampling frequency of 32 kHz the bit rate is 64 kbit/s.The shape codebook has been designed using a closed-loop iterative procedure [8].
Q6. What is the computationally complex block in CELP?
In CELP coders the most computationally complex block is the analysis-by-synthesis research of the optimum code word in the codebook.
Q7. What is the frequency response of the classical weighting filter?
In fact, between 1 kHz and 3 kHz the frequential response of the classical weighting filter is above the optimum perceptual shape, which makes the quantization noise more audible.
Q8. How is the optimum masking shape obtained?
Then the optimum masking shape is obtained by the convolution (in the Bark domain) of this spectrum with the basilar membrane spreading function [6, 7].
Q9. What does the classical weighting filter achieve?
A re-optimisation of classical filter parameters (γ1, γ2) and tilt control filter parameters does not achieve optimum noise shaping.
Q10. What is the way to solve the problem of the synthesis filter order?
This choice allows, by adapting the gain and selecting the optimum excitation vectors, the reproduction of the details and the periodicity of the linear predictive residual, without requiring a too high filter order.