scispace - formally typeset
Open Access

Bit rate reduction in CELP coders by applying Vocal Tract Filter similarity behaviour

TLDR
The proposed method has brought down the bit rate of CELP to 6.4 kbps or reduced the bit requirement by 12% without compromising on the perceptual quality of reconstructed speech.
Abstract
Speech is a highly complex and dynamic acoustic wave produced by the vocal tract as a result of the excitation in the form of air expelled from lungs The vocal tract characteristics vary in different manner during production of various speech categories This time variant acoustic filter has been represented by a Linear Prediction (LP) filter in Speech Production Model based on which Code Excited Linear Prediction (CELP) and many other speech coders are built The periodic nature of voiced speech due to vocal chord vibration causes slow variation for vocal tract characteristics and thus, similarity exists among nearby portions of voiced speech This similarity property is explored to reduce the count of transmitted Linear Predictive Coding (LPC) coefficients and excitation that are bit consuming and also significant parameters of LP filter This has been implemented in 73 kbps CELP by determining appropriate threshold for similarity values of both parameters The proposed method has brought down the bit rate of CELP to 64 kbps or reduced the bit requirement by 12% without compromising on the perceptual quality of reconstructed speech

read more

References
More filters
Proceedings ArticleDOI

Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs

TL;DR: A new model has been developed for use across a wider range of network conditions, including analogue connections, codecs, packet loss and variable delay, known as perceptual evaluation of speech quality (PESQ).
Proceedings ArticleDOI

A new model of LPC excitation for producing natural-sounding speech at low bit rates

B. Atal, +1 more
TL;DR: This paper describes a new approach to the excitation problem that does not require a priori knowledge of either the voiced-unvoiced decision or the pitch period, and minimizes a perceptual-distance metric representing subjectively-important differences between the waveforms of the original and the synthetic speech signals.
Journal ArticleDOI

Speech coding: a tutorial review

TL;DR: The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications.
Journal ArticleDOI

A mixed excitation LPC vocoder model for low bit rate speech coding

TL;DR: A new mixed excitation LPC vocoder model is presented that preserves the low bit rate of a fully parametric model but adds more free parameters to the excitation signal so that the synthesizer can mimic more characteristics of natural human speech.
Related Papers (5)