scispace - formally typeset
Search or ask a question
Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.


Papers
More filters
Journal ArticleDOI
01 Apr 1976
TL;DR: The resulting system serves as a model for the cognitive process of reading aloud, and also as a stable practical means for providing speech output in a broad class of computer-based systems.
Abstract: For many applications, it is desirable to be able to convert arbitrary English text to natural and intelligible sounding speech. This transformation between two surface forms is facilitated by first obtaining the common underlying abstract linguistic representation which relates to both text and speech surface representations. Calculation of these abstract bases then permits proper selection of phonetic segments, lexical stress, juncture, and sentence-level stress and intonation. The resulting system serves as a model for the cognitive process of reading aloud, and also as a stable practical means for providing speech output in a broad class of computer-based systems.

116 citations

Proceedings ArticleDOI
06 Jul 2003
TL;DR: A novel gender identification approach based on a general audio classifier that shows robustness to adverse audio compression and it is language independent is introduced.
Abstract: In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum's statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.

116 citations

Proceedings ArticleDOI
TL;DR: Experimental results show that the proposed technique can be used to detect the presence of hidden messages in digital audio data.
Abstract: Classification of audio documents as bearing hidden information or not is a security issue addressed in the context of steganalysis. A cover audio object can be converted into a stego-audio object via steganographic methods. In this study we present a statistical method to detect the presence of hidden messages in audio signals. The basic idea is that, the distribution of various statistical distance measures, calculated on cover audio signals and on stego-audio signals vis-a-vis their denoised versions, are statistically different. The design of audio steganalyzer relies on the choice of these audio quality measures and the construction of a two-class classifier. Experimental results show that the proposed technique can be used to detect the presence of hidden messages in digital audio data.

116 citations

Book
09 Feb 2007
TL;DR: This chapter discusses signal processing Essentials, audio Coding Standards and Algorithms, and quality measures for Perceptual Audio Coding.
Abstract: Preface. 1. Introduction. 2. Signal Processing Essentials. 3. Quantization and Entropy Coding. 4. Linear Prediction in Narrowband and Wideband Coding. 5. Psychoacoustic Principles. 6. Time-Frequency Analysis: Filter Banks and Transforms. 7. Transform Coders. 8. Subband Coders. 9. Sinusoidal Coders. 10. Audio Coding Standards and Algorithms. 11. Lossless Audio Coding and Digital Watermarking. 12. Quality Measures for Perceptual Audio Coding. References. Index.

116 citations

Patent
Dipanjan Sen1, Pei Xiang1
15 Mar 2013
TL;DR: In this article, the backward compatible coding of a set of basis function coefficients that describe a sound field is presented, along with methods and apparatus for backward-compatible coding of the coefficients.
Abstract: Systems, methods, and apparatus for backward-compatible coding of a set of basis function coefficients that describe a sound field are presented.

115 citations


Network Information
Related Topics (5)
Signal processing
73.4K papers, 983.5K citations
86% related
Decoding methods
65.7K papers, 900K citations
84% related
Fading
55.4K papers, 1M citations
80% related
Feature vector
48.8K papers, 954.4K citations
80% related
Feature extraction
111.8K papers, 2.1M citations
80% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202338
202284
202170
202062
201977
2018108