Neural code-excited linear prediction for low power speech compression

doi:10.1109/WESCAN.1995.494066

Proceedings ArticleDOI

Neural code-excited linear prediction for low power speech compression

- Vol. 2, pp 415-420

TLDR

This paper discusses the use of artificial neural learning methods for low bit-rate speech compression, potentially in non-stationary environments and employs two unsupervised learning algorithms: frequency-sensitive competitive learning and Kohonen's self-organizing maps.

Abstract:

In this paper, we discuss the use of artificial neural learning methods for low bit-rate speech compression, potentially in non-stationary environments. Unsupervised learning algorithms are particularly well-suited for vector quantization (VQ) which is used in many speech compression applications. We discuss two unsupervised learning algorithms: frequency-sensitive competitive learning and Kohonen's self-organizing maps which have both been investigated for learning the codebook vectors in an adaptive vector quantizer. In contrast with earlier work, we have employed these learning rules in VQ of the linear predictive coding (LPC) prediction residual. The performance of these unsupervised learning algorithms in speaker-dependent and speaker-independent speech compression are presented. Our results compare favourably with those of code-excited linear prediction (CELP) requiring reduced computational power with a tolerable reduction in speech quality. We also explore the effects of limited precision on classification and learning in competitive learning algorithms for low power VLSI implementations.

Neural code-excited linear prediction for low power speech compression

Citations

Adaptive compression of animated sequences

References

Vector quantization

Code-excited linear prediction(CELP): High-quality speech at very low bit rates

Competitive learning algorithms for vector quantization

Neural networks for vector quantization of speech and images

Pitch prediction filters in speech coding

Related Papers (5)

Maximum mutual information neural networks for hybrid connectionist-HMM speech recognition systems

Towards Scaling Up Classification-Based Speech Separation

DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization

Deep CNNs With Self-Attention for Speaker Identification

Kernel methods match Deep Neural Networks on TIMIT