scispace - formally typeset
Search or ask a question
Topic

Enhanced Variable Rate Codec

About: Enhanced Variable Rate Codec is a research topic. Over the lifetime, 357 publications have been published within this topic receiving 4842 citations. The topic is also known as: EVRC.


Papers
More filters
Journal ArticleDOI
TL;DR: In this paper, the adaptive multirate wideband (AMR-WB) speech codec was selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services.
Abstract: This paper describes the adaptive multirate wideband (AMR-WB) speech codec selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services. The AMR-WB speech codec algorithm was selected in December 2000 and the corresponding specifications were approved in March 2001. The AMR-WB codec was also selected by the International Telecommunication Union-Telecommunication Sector (ITU-T) in July 2001 in the standardization activity for wideband speech coding around 16 kb/s and was approved in January 2002 as Recommendation G.722.2. The adoption of AMR-WB by ITU-T is of significant importance since for the first time the same codec is adopted for wireless as well as wireline services. AMR-WB uses an extended audio bandwidth from 50 Hz to 7 kHz and gives superior speech quality and voice naturalness compared to existing second- and third-generation mobile communication systems. The wideband speech service provided by the AMR-WB codec will give mobile communication speech quality that also substantially exceeds (narrowband) wireline quality. The paper details AMR-WB standardization history, algorithmic description including novel techniques for efficient ACELP wideband speech coding and subjective quality performance of the codec.

312 citations

01 Jan 2002
TL;DR: The adaptive multirate wideband (AMR-WB) speech codec selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services is described.
Abstract: This paper describes the adaptive multirate wideband (AMR-WB) speech codec selected by the Third Generation Partnership Project (3GPP) for GSM and the third generation mobile communication WCDMA system for providing wideband speech services. The AMR-WB speech codec algorithm was selected in December 2000 and the corresponding specifications were approved in March 2001. The AMR-WB codec was also selected by the International Telecommunication Union-Telecommunication Sector (ITU-T) in July 2001 in the standardization activity for wideband speech coding around 16 kb/s and was approved in January 2002 as Recommendation G.722.2. The adoption of AMR-WB by ITU-T is of significant importance since for the first time the same codec is adopted for wireless as well as wireline services. AMR-WB uses an extended audio bandwidth from 50 Hz to 7 kHz and gives superior speech quality and voice naturalness compared to existing second- and third-generation mobile communication systems. The wideband speech service provided by the AMR-WB codec will give mobile communication speech quality that also substantially exceeds (narrowband) wireline quality. The paper details AMR-WB standardization history, algorithmic description including novel techniques for efficient ACELP wideband speech coding and subjective quality performance of the codec.

282 citations

Patent
Yang Gao1, Adil Benyassine2, Jes Thyssen2, Eyal Shlomot2, Huan-Yu Su2 
15 Sep 2000
TL;DR: In this paper, a speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed, which optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech.
Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

119 citations

PatentDOI
TL;DR: In this paper, a 26-bit spectrum filter coding scheme was used to jointly optimize pitch and gain parameter sets in a speech codec operating at low data rates using an iterative method, where the number of bits allocated to the pitch and excitation signals depend on whether the signals are significant or not.
Abstract: A speech codec operating at low data rates uses an iterative method to jointly optimize pitch and gain parameter sets. A 26-bit spectrum filter coding scheme may be used, involving successive subtractions and quantizations. The codec may preferably use a decomposed multipulse excitation model, wherein the multipulse vectors used as the excitation signal are decomposed into position and amplitude codewords. Multipulse vectors are coded by comparing each vector to a reference multipulse vector and quantizing the resulting difference vector. An expanded multipulse excitation codebook and associated fast search method, optionally with a dynamically-weighted distortion measure, allow selection of the best excitation vector without memory or computational overload. In a dynamic bit allocation technique, the number of bits allocated to the pitch and excitation signals depend on whether the signals are "significant" or "insignificant". Silence/speech detection is based on an average signal energy over an interval and a minimum average energy over a predetermined number of intervals. Adaptive post-filter and the automatic gain control schemes are also provided. Interpolation is used for spectrum filter smoothing, and an algorithm is provided for ensuring stability of the spectrum filter. Specially designed scalar quantizers are provided for the pitch gain and excitation gain.

110 citations

Journal ArticleDOI
TL;DR: A toll quality speech codec at 8 kb/s suitable for the future personal communications system and can support a frame erasure rate up to 3% with a degradation in its performance that is still worse than the ITU-T requirements.
Abstract: A toll quality speech codec at 8 kb/s suitable for the future personal communications system is presented. The codec is currently under standardization by the ITU-T (successor of CCITT) where the codec terms of reference were mainly determined considering PCS application. The encoding algorithm is based on algebraic code-excited linear prediction (ACELP) and has a speech frame of 10 ms. Efficient pitch and codebook search strategies, along with efficient quantization procedures, have been developed to achieve toll quality encoded speech with a complexity implementable on current fixed-point DSP chips. Formal subjective listening tests, performed by ITU-T SG 12, showed that the codec quality is equivalent to that of G.726 ADPCM at 32 kb/s in error-free conditions and it outperforms G.726 under error conditions. The codec performs adequately under tandeming conditions, and can support a frame erasure rate up to 3% with a degradation in its performance that is still worse than the ITU-T requirements, and this is one subject of study for the next phase. The algorithm has been implemented on a single fixed-point DSP for the ITU-T subjective rest, and required about 29 MIPS. An optimized version, however, requires 24 MIPS without any speech quality degradation. >

110 citations


Network Information
Related Topics (5)
Data compression
43.6K papers, 756.5K citations
71% related
Decoding methods
65.7K papers, 900K citations
69% related
Quality of service
77.1K papers, 996.6K citations
69% related
Video tracking
37K papers, 735.9K citations
69% related
Network packet
159.7K papers, 2.2M citations
69% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20176
20167
201513
20149
201311
20128