Proceedings ArticleDOI
Efficient coding of LPC parameters by temporal decomposition
Bishnu S. Atal
- Vol. 8, pp 81-84
Reads0
Chats0
TLDR
The aim is to determine the extent to which the bit rate of LPC parameters can be reduced without sacrificing speech quality.Abstract:
This paper describes a method for efficient coding of LPC log area parameters. It is now well recognized that sample-by-sample quantization of LPC parameters is not very efficient in minimizing the bit rate needed to code these parameters. Recent methods for reducing the bit rate have used vector and segment quantization methods. Much of the past work in this area has focussed on efficient coding of LPC parameters in the context of vocoders which put a ceiling on achievable speech quality. The results from these studies cannot be directly applied to synthesis of high quality speech. This paper describes a different approach to efficient coding of log area parameters. Our aim is to determine the extent to which the bit rate of LPC parameters can be reduced without sacrificing speech quality. Speech events occur generally at non-uniformly spaced time intervals. Moreover, some speech events are slow while others are fast. Uniform sampling of speech parameters is thus not efficient. We describe a non-uniform sampling and interpolation procedure for efficient coding of log area parameters. A temporal decomposition technique is used to represent the continuous variation of these parameters as a linearly-weighted sum of a number of discrete elementary components. The location and length of each component is automatically adapted to speech events. We find that each elementary component can be coded as a very low information rate signal.read more
Citations
More filters
Patent
System and method for detecting errors in interactions with a voice-based digital assistant
TL;DR: In this article, a speech input containing a request is received from a user and at least one action in furtherance of satisfying the request is performed, such as a user interaction to a digital assistant or a physical interaction with a device.
Patent
Mobile device having human language translation capability with positional feedback
TL;DR: In this article, a mobile electronic device has a touch sensitive screen and an accelerometer, and a translator is used to translate a word or phrase that is in a first human language and that is entered via a first virtual keyboard displayed on the touch-sensitive screen, into a second human language.
Patent
Disambiguation based on active input elicitation by intelligent automated assistant
TL;DR: In this article, a user request is received, the user request including at least a speech input received from a user, and two or more alternative interpretations of user intent are obtained based on the received user request.
Patent
Electronic Device with Text Error Correction Based on Voice Recognition Data
TL;DR: In this paper, the autocorrection engine may make word correction decisions based at least partly on information in the spoken word database, and the corrected words may be displayed in real time as the user supplies the text input.
Patent
Systems and methods for selective text to speech synthesis
TL;DR: In this article, an algorithm for synthesizing speech used to identify media assets is presented. But this algorithm is implemented on a system including several dedicated render engines, and the system may be part of a back end coupled to a front end including storage for media assets and associated synthesized speech, and a request processor for receiving and processing requests that result in providing the synthesised speech.
References
More filters
Book
Digital Processing of Speech Signals
TL;DR: This paper presents a meta-modelling framework for digital Speech Processing for Man-Machine Communication by Voice that automates the very labor-intensive and therefore time-heavy and expensive process of encoding and decoding speech.
Book
Linear Prediction of Speech
John E. Markel,A. Gray +1 more
TL;DR: Speech Analysis and Synthesis Models: Basic Physical Principles, Speech Synthesis Structures, and Considerations in Choice of Analysis.
Journal ArticleDOI
Speech analysis and synthesis by linear prediction of the speech wave.
B. S. Atal,Suzanne L. Hanauer +1 more
TL;DR: Application of this method for efficient transmission and storage of speech signals as well as procedures for determining other speechcharacteristics, such as formant frequencies and bandwidths, the spectral envelope, and the autocorrelation function, are discussed.
Journal ArticleDOI
Predictive Coding of Speech at Low Bit Rates
TL;DR: A new class of speech coders are described which allow one to realize the precise optimum noise spectrum which is crucial to achieving very low bit rates, but also represent the important first step in bridging the gap between waveform coders and vocoders without suffering from their limitations.