Topic
Speech coding
About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.
Papers published on a yearly basis
Papers
More filters
••
01 Apr 1976TL;DR: The resulting system serves as a model for the cognitive process of reading aloud, and also as a stable practical means for providing speech output in a broad class of computer-based systems.
Abstract: For many applications, it is desirable to be able to convert arbitrary English text to natural and intelligible sounding speech. This transformation between two surface forms is facilitated by first obtaining the common underlying abstract linguistic representation which relates to both text and speech surface representations. Calculation of these abstract bases then permits proper selection of phonetic segments, lexical stress, juncture, and sentence-level stress and intonation. The resulting system serves as a model for the cognitive process of reading aloud, and also as a stable practical means for providing speech output in a broad class of computer-based systems.
116 citations
••
06 Jul 2003TL;DR: A novel gender identification approach based on a general audio classifier that shows robustness to adverse audio compression and it is language independent is introduced.
Abstract: In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum's statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.
116 citations
••
TL;DR: Experimental results show that the proposed technique can be used to detect the presence of hidden messages in digital audio data.
Abstract: Classification of audio documents as bearing hidden information or not is a security issue addressed in the context of steganalysis. A cover audio object can be converted into a stego-audio object via steganographic methods. In this study we present a statistical method to detect the presence of hidden messages in audio signals. The basic idea is that, the distribution of various statistical distance measures, calculated on cover audio signals and on stego-audio signals vis-a-vis their denoised versions, are statistically different. The design of audio steganalyzer relies on the choice of these audio quality measures and the construction of a two-class classifier. Experimental results show that the proposed technique can be used to detect the presence of hidden messages in digital audio data.
116 citations
•
09 Feb 2007
TL;DR: This chapter discusses signal processing Essentials, audio Coding Standards and Algorithms, and quality measures for Perceptual Audio Coding.
Abstract: Preface. 1. Introduction. 2. Signal Processing Essentials. 3. Quantization and Entropy Coding. 4. Linear Prediction in Narrowband and Wideband Coding. 5. Psychoacoustic Principles. 6. Time-Frequency Analysis: Filter Banks and Transforms. 7. Transform Coders. 8. Subband Coders. 9. Sinusoidal Coders. 10. Audio Coding Standards and Algorithms. 11. Lossless Audio Coding and Digital Watermarking. 12. Quality Measures for Perceptual Audio Coding. References. Index.
116 citations
•
15 Mar 2013TL;DR: In this article, the backward compatible coding of a set of basis function coefficients that describe a sound field is presented, along with methods and apparatus for backward-compatible coding of the coefficients.
Abstract: Systems, methods, and apparatus for backward-compatible coding of a set of basis function coefficients that describe a sound field are presented.
115 citations