Topic
Speech coding
About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.
Papers published on a yearly basis
Papers
More filters
••
27 Dec 2005TL;DR: The potential of perceptive speech analysis and processing in combination with biologically plausible neural network processors is discussed and a preliminary test with recognition of French spoken digits from a small speech database is illustrated.
Abstract: Speech recognition is very difficult in the context of noisy and corrupted speech. Most conventional techniques need huge databases to estimate speech (or noise) density probabilities to perform recognition. We discuss the potential of perceptive speech analysis and processing in combination with biologically plausible neural network processors. We illustrate the potential of such non-linear processing of speech by means of a preliminary test with recognition of French spoken digits from a small speech database
94 citations
••
10 Apr 1978TL;DR: Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.
Abstract: Predictive coding methods attempt to minimize the r.m.s. error in the coded signal. However, the human ear does not perceive signal distortion on the basis of r.m.s. error regardless of its spectral shape relative to the signal spectrum. Specifically, for speech signals, the locations of the formant frequencies and their rates of change with time influence the audibility, and thus the subjective distortion of any quantizing noise. In this paper, methods for reducing the subjective distortion in predictive coders for speech siganls are described and evaluated. Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.
94 citations
••
TL;DR: A speech coder apparatus operates to compress speech signals to a low bit rate and includes a continuous speech recognizer (CSR) which has a memory for storing templates.
Abstract: A speech coder apparatus operates to compress speech signals to a low bit rate. The apparatus includes a continuous speech recognizer (CSR) which has a memory for storing templates. Input speech is processed by the CSR where information in the speech is compared against the templates to provide an output digital signal indicative of recognized words, which signal is transmitted along a first path. There is further included a front end processor which is also responsive to the input speech signal for providing output digitized speech samples during a given frame interval. A side information encoder circuit responds to the output from the front end processor to provide at the output of the encoder a parameter signal indicative of the value of the pitch and word duration for each word as recognized by the CSR unit. The output of the encoder is transmitted as a second signal. There is a receiver which includes a synthesizer responsive to the first and second transmitted signals for providing an output synthesized signal for each recognized word where the pitch, duration and amplitude of the synthesized signal is changed according to the parameter signal to preserve the quality of the synthesized speech.
94 citations
•
12 Mar 2009
TL;DR: In this article, a speech recognition system includes a mobile device and a remote server, where the mobile device receives the speech from the user and extracts the features and phonemes from the speech.
Abstract: A speech recognition system includes a mobile device and a remote server. The mobile device receives the speech from the user and extracts the features and phonemes from the speech. Selected phonemes and measures of uncertainty are transmitted to the server, which processes the phonemes for speech understanding and transmits a text of the speech (or the context or understanding of the speech) back to the mobile device.
94 citations
••
TL;DR: The proposed Wavelet threshold based ECG signal compression technique using uniform scalar zero zone quantizer (USZZQ) and Huffman coding on differencing significance map (DSM) achieves the required compression ratio with less reconstruction error for GSM-based cellular telemedicine system.
94 citations