scispace - formally typeset
Search or ask a question
Topic

Code-excited linear prediction

About: Code-excited linear prediction is a research topic. Over the lifetime, 2025 publications have been published within this topic receiving 28633 citations. The topic is also known as: CELP.


Papers
More filters
Posted Content
TL;DR: The experimental results show that the proposed simple loss function improves the speech enhancement performance compared to a reference DNN with MSE loss in terms of perceptual quality and noise attenuation.
Abstract: Single-channel speech enhancement with deep neural networks (DNNs) has shown promising performance and is thus intensively being studied. In this paper, instead of applying the mean squared error (MSE) as the loss function during DNN training for speech enhancement, we design a perceptual weighting filter loss motivated by the weighting filter as it is employed in analysis-by-synthesis speech coding, e.g., in code-excited linear prediction (CELP). The experimental results show that the proposed simple loss function improves the speech enhancement performance compared to a reference DNN with MSE loss in terms of perceptual quality and noise attenuation. The proposed loss function can be advantageously applied to an existing DNN-based speech enhancement system, without modification of the DNN topology for speech enhancement. The source code for the proposed approach is made available.

3 citations

Dissertation
01 Jan 2003
TL;DR: A novel preprocessing method, which detects irregular pitch variations and modifies the speech signal such as to improve the accuracy of the pitch estimation, is proposed, which results in more regular speech while maintaining perceptual speech quality.
Abstract: Advances in digital technology in the last decade have motivated the development of very efficient and high quality speech compression algorithms. While in the early low bit rate coding systems, the main target was the production of intelligible speech at low bit rates, expansion of new applications such as mobile satellite systems increased the demand for reducing the transmission bandwidth and achieving higher speech quality. This resulted in the development of efficient parametric models for speech production system. These models were the basis of powerful speech compression algorithms such as CELP, MBE, MELP and WI. The performance of a speech coder not only depends on the speech production model employed but also on the accurate estimation of speech parameters. Periodicity, also known as pitch, is one of the speech parameters that greatly affect the synthesised speech quality. Thus, the subject of pitch determination has attracted much research in the area of low bit rate coding. In these studies it is assumed that for a short segment of speech, called frame, the pitch is fixed or smoothly evolving. The pitch estimation algorithms generally fail to determine irregular variations, which can occur at onset and offset speech segments. In order to overcome this problem, a novel preprocessing method, which detects irregular pitch variations and modifies the speech signal such as to improve the accuracy of the pitch estimation, is proposed. This method results in more regular speech while maintaining perceptual speech quality. The perceptual quality of the synthesised speech may also be improved using postfiltering techniques. Conventional postfiltering methods generally consider the enhancement of the whole speech spectrum. This may result in the broadening of the first formant, which leads to the increase of quantisation noise for this formant. A new postfiltering technique, which is based on factorising the linear prediction synthesis filter, is proposed. This provides more control over the formant bandwidth and attenuation of spectral speech valleys. Key words: Pitch smoothing, speech pre-processor, postfiltering.

3 citations

Proceedings ArticleDOI
13 Jun 2000
TL;DR: A novel a audio/speech coding algorithm, hybrid audio coding (HAC), which provides a good quality at a bit rate of 8 to 16 kbps and it is proven that the developed algorithm is effective for both audio and speech signals.
Abstract: A novel a audio/speech coding algorithm, hybrid audio coding (HAC) is described. New features of the algorithm include window switching with generalized MDCT, an improved quantization scheme of the MDCT coefficients, and waveform normalization in the time domain. HAC provides a good quality at a bit rate of 8 to 16 kbps, and it is also proven that the developed algorithm is effective for both audio and speech signals.

3 citations

Proceedings ArticleDOI
01 Jan 2007
TL;DR: Five existing alternative LP methods are described and illustrated how they all attempt to solve the observed frequency distribution problem by examining audio LP from a spectral estimation point of view.
Abstract: Linear prediction (LP) is a valuable tool for speech analysis and coding, due to the efficiency of the autoregressive model for speech signals. In audio analysis and coding, the sinusoidal model is much more popular, which is partly due to the poor performance of audio LP. By examining audio LP from a spectral estimation point of view, we observe that the distribution of the audio signal’s dominant frequencies in the Nyquist interval is a critical factor determining LP performance. In this framework, we describe five existing alternative LP methods and illustrate how they all attempt to solve the observed frequency distribution problem.

3 citations

Proceedings ArticleDOI
30 Oct 1995
TL;DR: Simulation results for the several techniques for efficient variable-rate coding and the analysis cues from which these coding modes are derived are provided.
Abstract: The development of variable-rate speech coding techniques has recently become an area of significant research interest, and several variable-rate schemes have been discussed in the literature. Here, we discuss general techniques for efficient variable-rate coding and the analysis cues from which these coding modes are derived. These techniques include fixed-update, variable-allocation and variable-update, fixed-allocation approaches to formant and pitch filter encoding. Since code-excited linear prediction (CELP) is popular for communications quality speech coding below 8 kbps, we provide simulation results for the several techniques in the context of variable-rate CELP coders with average encoded rate below 2 kbps.

3 citations


Network Information
Related Topics (5)
Decoding methods
65.7K papers, 900K citations
83% related
Data compression
43.6K papers, 756.5K citations
83% related
Signal processing
73.4K papers, 983.5K citations
83% related
Feature vector
48.8K papers, 954.4K citations
80% related
Feature extraction
111.8K papers, 2.1M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20226
20213
20207
201915
201810
201713