Speech Coding: Fundamentals and Applications

doi:10.1002/0471219282.EOT156

Reference EntryDOI

Speech Coding: Fundamentals and Applications

TLDR

An overview of the most widely used algorithms, standards, and applications of wideband and narrowband speech coding, including perceptually transparent multi-rate and embedded coding, is presented.

Abstract:

In this chapter, we present an overview of the most widely used algorithms, standards, and applications of wideband and narrowband speech coding. Algorithms for speech coding are classified into four broad headings: (1) waveform coding techniques (including PCM, companded PCM, and DPCM), which are typically used for landline telephony, internet telephony, and secure military communications; (2) subband coding, including perceptually transparent multi-rate and embedded coding which is mainly used for internet and digital audio applications; (3) linear predictive analysis by synthesis coding (LPC-AS) algorithms, including multipulse LPC, CELP, SELP, VSELP, and low-delay CELP, which are typically used for digital cellular and telephony; and (4) LPC vocoders, including advanced vocoder algorithms (e.g., MELP, MBE, and PWI) are used for applications such as secure telephony and satellite telephony. Applications in areas such as voiceover IP (VoIP) and digital cellular are emerging and require a speech coder to gracefully adapt to rapidly changing channel conditions—a need that is met by embedded and multirate speech coders associated with joint source-channel coding algorithms. Measures of speech coder perceptual quality include subjective measures of intelligibility (DRT and DALT) and naturalness (MOS and DAM), as well as objective measures such as segmental SNR, Bark spectral distortion, PSQM, and PESQ. Speech coding standards are set by organizations including the ITU (for landline telephony), MPEG (for multimedia applications), ETSI (for European digital cellular), TIA (for U.S. digital cellular), and DDVPC (for United States military applications). Keywords: speech coding; PCM; subband coding; CELP; LPC; digital cellular; multimedia; voiceover IP; mean opinion score (MOS)

Speech Coding: Fundamentals and Applications

Citations

Digital processing of speech signals

Digital speech processing, synthesis, and recognition

2007 Presidential Address—The End(s) of Testing:

Speech Technology Progress Based on New Machine Learning Paradigm

Efficient and Scalable Neural Residual Waveform Coding with Collaborative Quantization

References

A 2dvEv- bit distributed algorithm for the directed Euler trail problem

Fundamentals of speech recognition

Digital Processing of Speech Signals

An Introduction to the Psychology of Hearing

Variable-rate variable-power MQAM for fading channels

Related Papers (5)

Mixed wideband speech and music coding using a speech/music discriminator

Implementation of CELP CODER and to evaluate the performance in terms of bit rate, coding delay and quality of speech

eX-CELP: a speech coding paradigm

5.6 kbits/s PSI-CELP of the half-rate PDC speech coding standard

CCITT Standardizing activities on speech coding