Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Patent•

Improving sound quality of established low bit-rate audio coding systems without loss of decoder compatibility

[...]

Yu-Li You¹, William Paul Smith¹, Zoran Fejzo¹, Stephen Smyth¹•Institutions (1)

DTS¹

19 Jun 2000

TL;DR: In this paper, a multi-channel audio compression technology is presented that extends the range of sampling frequencies compared to existing technologies and/or lowers the noise floor while remaining compatible with those earlier generation technologies.

...read moreread less

Abstract: A multi-channel audio compression technology is presented that extends the range of sampling frequencies compared to existing technologies and/or lowers the noise floor while remaining compatible with those earlier generation technologies. The high-sampling frequency multi-channel audio (12) is decomposed into core audio up to the existing sampling frequencies and a difference signal up to the sampling frequencies of the next generation technologies. The core audio is encoded (18) using the first generation technology such as DTS, DOLBY AC-3 or MPEG I or MPEG II such that the encoded core bit stream (20) is fully compatible with a comparable decoder in the market. The difference signal (34) is encoded (36) using technologies that extend the sampling frequency and/or improve the quality of the core audio. The compressed difference signal (38) is attached as an extension to the core bit stream (20). The extension data will be ignored by the first generation decoders but can be decoded by the second generation decoders. By summing the decoded core and extension audio signals together (28), a second generation decoder can effectively extend the audio signal bandwidth and/or improve the signal to noise ratio beyond that available through the core decoder alone.

...read moreread less

176 citations

Patent•DOI•

Reducing noise in audio systems

[...]

Gary W. Elko

05 Feb 2003-Journal of the Acoustical Society of America

TL;DR: In this paper, two or more microphones receive acoustic signals and generate audio signals that are processed to determine what portion of the audio signals result from incoherence between audio signals and/or audio-signal sources having propagation speeds different from the acoustic signals.

...read moreread less

Abstract: Two or more microphones receive acoustic signals and generate audio signals that are processed to determine what portion of the audio signals result from (i) incoherence between the audio signals and/or (ii) audio-signal sources having propagation speeds different from the acoustic signals. The audio signals are filtered to reduce that portion of one or more of the audio signals. The present invention can be used to reduce turbulent wind-noise resulting from wind or other airjets blowing across the microphones. Time-dependent phase and amplitude differences between the microphones can be compensated for based on measurements made in parallel with routine audio system processing.

...read moreread less

176 citations

Proceedings Article•DOI•

Noise-dependent postfiltering

[...]

V. Grancharov, J. Samuelsson, W.B. Kleijn

17 May 2004

TL;DR: The paper introduces a modification of the commonly used postfilter that improves performance when acoustic background noise is present by replacing the nonadaptive postfilter parameters that govern the degree of spectral emphasis with parameters that adapt to the noise statistics.

...read moreread less

Abstract: The paper introduces a modification of the commonly used postfilter that improves performance when acoustic background noise is present. The modification consists of replacing the nonadaptive postfilter parameters that govern the degree of spectral emphasis (commonly denoted as /spl gamma//sub 1/ and /spl gamma//sub 2/) with parameters that adapt to the noise statistics. We describe an effective mapping from the noise statistics to the emphasis parameters and provide a low complexity noise estimation algorithm that is sufficient for this application. The resulting noise-adaptive postfilter successfully attenuates the background noise and naturally converges to the conventional postfilter at high SNR conditions. Thus, the speech enhancement problem is solved with minimal modification of legacy codecs, since the existing structure of the speech codec is used. Test results indicate that the presented algorithm significantly outperforms the standard postfilter with non-adaptive parameters.

...read moreread less

175 citations

Proceedings Article•DOI•

Image coding using vector quantization

[...]

Allen Gersho¹, Bhaskar Ramamurthi•Institutions (1)

University of California, Santa Barbara¹

01 May 1982

TL;DR: Vector quantization appears to be a powerful and promising technique for image coding and results for coding rates from 0.5 to 1.5 bits/pixel are discussed.

...read moreread less

Abstract: An image is partitioned into cells of pxp pixels. Each cell is regarded as a vector of dimension p2and is encoded by searching through a codebook for a nearest matching representative vector. A binary word identifying the selected representative vector is assigned as the codeword to describe the original cell. The decoder uses this codeword to address a codebook. Each entry of the codebook contains a full precision digital representation of one of the N representative vectors. The codebook design is based on a clustering technique for vector quantizer design preceded by a classification of training cells into edge or shade cells. Results for coding rates from 0.5 to 1.5 bits/pixel are discussed. Vector quantization appears to be a powerful and promising technique for image coding.

...read moreread less

175 citations

Patent•

Multi-mode audio recognition and auxiliary data encoding and decoding

[...]

Ravi K. Sharma, Brett A. Bradley, Yang Bai, Shankar Thagadur Shivappa, Ajith Kamath, Aparna Gurijala, Tomas Filler, David A. Cushman - Show less +4 more

15 Mar 2013

TL;DR: In this article, perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity, which is applied to audio segments in audio embedder and detector configurations to support real-time operation.

...read moreread less

Abstract: Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.

...read moreread less

174 citations

Collapse

Network Information

Performance

Metrics

14,368

Papers

279,843

Citations

No. of papers in the topic in previous years
Year	Papers
2023	38
2022	84
2021	70
2020	62
2019	77
2018	108

Speech coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics