Bit rate reduction in CELP coders by applying Vocal Tract Filter similarity behaviour

Open Access

Bit rate reduction in CELP coders by applying Vocal Tract Filter similarity behaviour

- Vol. 2019, pp 1741-1745

TLDR

The proposed method has brought down the bit rate of CELP to 6.4 kbps or reduced the bit requirement by 12% without compromising on the perceptual quality of reconstructed speech.

Abstract:

Speech is a highly complex and dynamic acoustic wave produced by the vocal tract as a result of the excitation in the form of air expelled from lungs The vocal tract characteristics vary in different manner during production of various speech categories This time variant acoustic filter has been represented by a Linear Prediction (LP) filter in Speech Production Model based on which Code Excited Linear Prediction (CELP) and many other speech coders are built The periodic nature of voiced speech due to vocal chord vibration causes slow variation for vocal tract characteristics and thus, similarity exists among nearby portions of voiced speech This similarity property is explored to reduce the count of transmitted Linear Predictive Coding (LPC) coefficients and excitation that are bit consuming and also significant parameters of LP filter This has been implemented in 73 kbps CELP by determining appropriate threshold for similarity values of both parameters The proposed method has brought down the bit rate of CELP to 64 kbps or reduced the bit requirement by 12% without compromising on the perceptual quality of reconstructed speech

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs

Antony William Rix, +3 more

TL;DR: A new model has been developed for use across a wider range of network conditions, including analogue connections, codecs, packet loss and variable delay, known as perceptual evaluation of speech quality (PESQ).

...read moreread less

Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST

John S. Garofolo, +5 more

Proceedings ArticleDOI

A new model of LPC excitation for producing natural-sounding speech at low bit rates

B. Atal, +1 more

TL;DR: This paper describes a new approach to the excitation problem that does not require a priori knowledge of either the voiced-unvoiced decision or the pitch period, and minimizes a perceptual-distance metric representing subjectively-important differences between the waveforms of the original and the synthetic speech signals.

...read moreread less

Journal ArticleDOI

Speech coding: a tutorial review

Andreas Spanias

TL;DR: The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications.

...read moreread less

Journal ArticleDOI

A mixed excitation LPC vocoder model for low bit rate speech coding

Alan V. McCree, +1 more

- 01 Jul 1995 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: A new mixed excitation LPC vocoder model is presented that preserves the low bit rate of a fully parametric model but adds more free parameters to the excitation signal so that the synthesizer can mimic more characteristics of natural human speech.

...read moreread less