scispace - formally typeset
Search or ask a question
Topic

Code-excited linear prediction

About: Code-excited linear prediction is a research topic. Over the lifetime, 2025 publications have been published within this topic receiving 28633 citations. The topic is also known as: CELP.


Papers
More filters
Journal ArticleDOI
TL;DR: The results show thatTCVQ performs significantly better than VQ, with reasonable complexity, which makes TCVQ a fair choice for trading quality against complexity and/or delay.
Abstract: We analyze the performance of a CELP coder where the vector quantization (VQ) of the excitation is replaced with trellis-coded vector quantization (TCVQ). Our results show that TCVQ performs significantly better than VQ, with reasonable complexity. This makes TCVQ a fair choice for trading quality against complexity and/or delay. We describe a systematic procedure to replace VQ with TCVQ for existing CELP coders. We propose an optimization algorithm to appropriately populate the trellis. We show how pseudo-Gray coding can be applied to the TCVQ codebook to improve intrinsic coder robustness to channel errors. Finally, we evaluate the complexity and performance of the method.

5 citations

Proceedings ArticleDOI
12 May 2008
TL;DR: This paper shows that the G.729.1 extension layers are quite generic for scalable codec design in the sense that they can be applied to EFR with limited adjustments, and proposes a minor modification of the bit allocation procedure in TDAC stage, exploiting spectral masking only for higher frequency bands.
Abstract: This paper describes a 12.2-32 kbps scalable wideband speech and audio coder interoperable with GSM enhanced full-rate (EFR). This coder, referred to as EFR-EV, is designed using the ITU-T G.729.1 multi-stage coding structure. Specifically, EFR-EV consists of three stages: a code-excited linear prediction (CELP) stage derived from EFR, time-domain bandwidth extension (TDBWE), and time-domain aliasing cancellation (TDAC). In this paper, we show that the G.729.1 extension layers (i.e. TDBWE and TDAC) are quite generic for scalable codec design in the sense that they can be applied to EFR with limited adjustments. In addition, we propose a minor modification of the bit allocation procedure in TDAC stage, exploiting spectral masking only for higher frequency bands. The performance of EFR- EV and G.729.1 are evaluated in terms of objective/subjective quality, algorithmic delay, and complexity.

5 citations

Proceedings ArticleDOI
Masahiro Serizawa1, Kazunori Ozawa1
09 May 1995
TL;DR: In this article, a pitch prediction method for 4 kbps CELP (code excited LPC) speech coding is proposed, which can carry out the true pitch prediction by utilizing the current subframe excitation codevector signal, when the pitch prediction parameters are determined.
Abstract: This paper proposes a new pitch prediction method for 4 kbps CELP (code excited LPC) speech coding. In the conventional CELP speech coding, synthetic speech quality deteriorates rapidly at 4 kbps, especially for female and children's speech with short pitch period. The important reason is that when the pitch period is shorter than the subframe length, simple repetition of the past excitation signal based on the estimated lag, not the true pitch prediction, is usually used in the adaptive codebook operation. The proposed pitch prediction method can carry out the true pitch prediction by utilizing the current subframe excitation codevector signal, when the pitch prediction parameters are determined. For further improvement, a split weighting method and a low-complexity harmonic and spectral perceptually-weighting method have also been developed. The informal listening test result shows that the 4 kbps coder with 20 msec subframe, utilizing all of the proposed improvements, achieves 0.2 MOS higher results than the coder without them.

5 citations

Patent
02 Aug 2007
TL;DR: In this paper, the authors propose a method and apparatus for a voice transcoder that converts a bitstream representing frames of data encoded according to a first voice compression standard to a binary representation of the data using perceptual weighting that uses tuned weighting factors to produce a higher quality decoded voice signal than a comparable tandem transcoding solution.
Abstract: A method and apparatus for a voice transcoder that converts a bitstream representing frames of data encoded according to a first voice compression standard to a bitstream representing frames of data according to a second voice compression standard using perceptual weighting that uses tuned weighting factors, such that the bitstream of a second voice compression standard to produce a higher quality decoded voice signal than a comparable tandem transcoding solution. The method includes pre-computing weighting factors for a perceptual weighting filter optimized to a specific source and destination codec pair, pre-configuring the transcoding strategies, mapping CELP parameters in the CELP parameter space according to the selected coding strategy, performing Linear Prediction analysis if specified by the transcoding strategy, perceptually weighting the speech using with tuned weighting factors, and searching for adaptive codebook and fixed-codebook parameters to obtain a quantized set of destination codec parameters.

5 citations

Proceedings ArticleDOI
19 Apr 2015
TL;DR: A novel technique is presented to efficiently mix traditional ACELP time domain coding with a frequency domain coding model to improve the quality of generic audio signals coded at low bitrates without additional delay.
Abstract: In this paper a novel technique is presented to efficiently mix traditional ACELP time domain coding with a frequency domain coding model to improve the quality of generic audio signals coded at low bitrates without additional delay. The paper discusses how to integrate parts of a traditional Algebraic Code Excited Linear Prediction (ACELP) speech codec to create a time-domain contribution which coexists with a frequency based coding model. A mechanism to determine the value of the time-domain contribution is proposed and a method is described how the frequency-domain contribution might be added without increasing the overall delay of the codec. The proposed method forms part of the recently standardised 3GPP EVS codec.

5 citations


Network Information
Related Topics (5)
Decoding methods
65.7K papers, 900K citations
83% related
Data compression
43.6K papers, 756.5K citations
83% related
Signal processing
73.4K papers, 983.5K citations
83% related
Feature vector
48.8K papers, 954.4K citations
80% related
Feature extraction
111.8K papers, 2.1M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20226
20213
20207
201915
201810
201713