Showing papers on "Code-excited linear prediction published in 1985"

PDF

Open Access

Proceedings Article•DOI•

Code-excited linear prediction(CELP): High-quality speech at very low bit rates

[...]

Manfred R. Schroeder¹, B. S. Atal²•Institutions (2)

26 Apr 1985

TL;DR: A code-excited linear predictive coder in which the optimum innovation sequence is selected from a code book of stored sequences to optimize a given fidelity criterion, indicating that a random code book has a slight speech quality advantage at low bit rates.

...read moreread less

Abstract: We describe in this paper a code-excited linear predictive coder in which the optimum innovation sequence is selected from a code book of stored sequences to optimize a given fidelity criterion. Each sample of the innovation sequence is filtered sequentially through two time-varying linear recursive filters, one with a long-delay (related to pitch period) predictor in the feedback loop and the other with a short-delay predictor (related to spectral envelope) in the feedback loop. We code speech, sampled at 8 kHz, in blocks of 5-msec duration. Each block consisting of 40 samples is produced from one of 1024 possible innovation sequences. The bit rate for the innovation sequence is thus 1/4 bit per sample. We compare in this paper several different random and deterministic code books for their effectiveness in providing the optimum innovation sequence in each block. Our results indicate that a random code book has a slight speech quality advantage at low bit rates. Examples of speech produced by the above method will be played at the conference.

...read moreread less

1,343 citations

Proceedings Article•DOI•

Regular excitation reduction for effective and efficient LP-coding of speech

[...]

Ed F. Deprettere, P. Kroon

01 Apr 1985

TL;DR: This paper describes an effective and efficient time domain speech encoding technique that has an appealingly low complexity, and produces (near) toll quality speech at rates below 16 kbit/s.

...read moreread less

Abstract: This paper describes an effective and efficient time domain speech encoding technique that has an appealingly low complexity, and produces (near) toll quality speech at rates below 16 kbit/s. The proposed coder uses linear predictive techniques to remove the short-time correlation in the speech signal. The remaining (residual) information is then modeled by a regular (in time) excitation signal that, when inputted to the time-varying model filter, produces a signal that is "close" to the reference speech signal. The procedure for finding the appropriate excitation model parameters incorporates the solution of a few sets of linear equations and is of moderate complexity compared to competing coding systems such as Adaptive Transform Coding and Multi-Pulse Excitation Coding.

...read moreread less

32 citations

Journal Article•DOI•

Linear predictive coding of speech: Review and current directions

[...]

M. Schroeder¹•Institutions (1)

Bell Labs¹

01 Aug 1985-IEEE Communications Magazine

30 citations

Proceedings Article•

Gain-adaptive vector quantization for medium-rate speech coding

[...]

J.-H. Chen¹, Allen Gersho¹•Institutions (1)

University of California, Santa Barbara¹

01 Jan 1985

TL;DR: Experimental results show that a significant gain in segmental SNR can be obtained over nonadaptive VQ with a negligible increase in complexity.

...read moreread less

Abstract: A class of adaptive vector quantizers (VQs) that can dynamically adjust the 'gain' of codevectors according to the input signal level is introduced The encoder uses a gain estimator to determine a suitable normalization of each input vector prior to VQ coding The normalized vectors have reduced dynamic range and can then be more efficiently coded At the receiver, the VQ decoder output is multiplied by the estimated gain Both forward and backward adaptation are considered and several different gain estimators are compared and evaluated An approach to optimizing the design of gain estimators is introduced Some of the more obvious techniques for achieving gain adaptation are substantially less effective than the use of optimized gain estimators A novel design technique that is needed to generate the appropriate gain-normalized codebook for the vector quantizer is introduced Experimental results show that a significant gain in segmental SNR can be obtained over nonadaptive VQ with a negligible increase in complexity

...read moreread less

16 citations

Proceedings Article•DOI•

A speech coding method using thinned-out residual

[...]

Akira Ichikawa¹, S. Takeda, Y. Asakawa•Institutions (1)

Hitachi¹

01 Apr 1985

TL;DR: A new high-quality speech information compression method which introduces techniques of eliminating unnecessary samples of prediction residual wave pulses to obtain a thinned-out residual and produces slightly higher quality speech than does the MPE method.

...read moreread less

Abstract: A new high-quality speech information compression method is developed. This method introduces techniques of eliminating unnecessary samples of prediction residual wave pulses to obtain a thinned-out residual. First, a thinning-out procedure which minimizes the quality degradation is formulated. Next, a procedure which simplifies this thinning-out procedure under several hypotheses is defined. Subjective evaluation of this procedure using preference tests confirms that almost no quality degradation occurs. Pitch information is utilized. Adding the process of repetitive use of the thinned-out residual to the procedure, preference tests are carried out at a bit-rate of 9.6 kb/s for purposes of comparison with the newest MPE which includes the pitch prediction process. The results are that our proposed method produces slightly higher quality speech than does the MPE method. The number of processing steps is less than one-third that of MPE.

...read moreread less

13 citations

Proceedings Article•DOI•

Efficient algorithms for obtaining multipulse excitation for LPC coders

[...]

J.-P. Lefevre¹, O. Passien•Institutions (1)

Alcatel-Lucent¹

01 Apr 1985

TL;DR: Several approaches for pulse amplitude and position determination are described, based on the insertion of a long term pitch predictor in the multi-pulse analysis, and the idea of a two stage modelization is introduced.

...read moreread less

Abstract: Since the presentation of multi-pulse excitation concept for LPC coders, by Atal and Remde, many different analysis techniques have been proposed to derive the excitation waveform. This paper describes several approaches for pulse amplitude and position determination. The original solution is compared to procedures which work directly on the residual signal, or which compute again a jointly optimal set of amplitudes, or which improve the filter parameters by taking into account the computed multi-pulse excitation. In addition, other novel techniques, based on the insertion of a long term pitch predictor in the multi-pulse analysis are presented. Also, the idea of a two stage modelization is introduced. Results of experimental evaluations for typical configurations, with respect to implementation complexity as well as speech quality, are given.

...read moreread less

11 citations

Proceedings Article•DOI•

Diphone synthesis using multipulse coding and a phase vecoder

[...]

M. Stella¹, F. Charpentier•Institutions (1)

Centre national d'études des télécommunications¹

26 Apr 1985

TL;DR: This paper shows that a multipulse LPC synthesizer can also be used in a text-to-speech system based on diphone concatenation, and produces French synthetic speech of fairly good naturalness.

...read moreread less

Abstract: Multipulse Linear Predictive Coding [1] has been shown to produce natural sounding speech at relatively low bit rates. So far, this technique has mostly been used for speech transmission or storage. In this paper, we show that a multipulse LPC synthesizer can also be used in a text-to-speech system based on diphone concatenation. The main problem is how to manipulate the prosodic parameters required for speech synthesis, and it is addressed here by a two-step procedure. First, a speech signal with relatively flat pitch contour is obtained by multipulse synthesis of concatenated diphones. Then the prosodic parameters of this signal are corrected using a special purpose phase vocoder. This method produces French synthetic speech of fairly good naturalness.

...read moreread less

8 citations

Proceedings Article•DOI•

Generalization of the multipulse coding for low bit rate coding purposes: The generalized decimation

[...]

J.-P. Adoul¹, F. Didelot, P. Mabilleau, S. Morissette•Institutions (1)

Université de Sherbrooke¹

01 Apr 1985

TL;DR: This paper shows a technique of encoding the LPC residual which allows the achievement of speech coding with residual excitation at a bit rate as low as 2400 bps, inspired by the multipulse coding approach introduced by Atal.

...read moreread less

Abstract: This paper shows a technique of encoding the LPC residual which allows the achievement of speech coding with residual excitation at a bit rate as low as 2400 bps. The method is inspired by the multipulse coding approach introduced by Atal, associated with an irregular downsampl-ing. The real time implementation of a 4800 bps vocoder on a single TMS 320 DSP is discussed.

...read moreread less

5 citations

Proceedings Article•DOI•

A Comparison of Two Methods for Very-Low-Rate Speech Coding

[...]

Salim Roucos, Mari O. Dunham

01 Oct 1985

TL;DR: This paper describes two systems that use VQ and transmit intelligible speech in the range of 300 to 600 b/s and presents the quantization algorithms and bit allocation for the two vocoders and compares their performance for varying bit rates and different noisy speech conditions.

...read moreread less

Abstract: Vector quantization (VQ) has been used recently for developing vocoders operating below 800 b/s. We describe in this paper two systems that use VQ and transmit intelligible speech in the range of 300 to 600 b/s. The frame vocoder which uses VQ for quantizing the spectral parameters of a single frame of speech was found to be most effective at the higher rate of 600 b/s. The segment vocoder which uses VQ for quantizing the spectral parameters of a sequence of frames yielded better intelligibility at the lower 300 b/s rate. We present the quantization algorithms and bit allocation for the two vocoders and compare their performance for varying bit rates and different noisy speech conditions.

...read moreread less

4 citations

Proceedings Article•DOI•

All-pole speech modeling with a maximally pulse-like residual

[...]

R. Rose¹, Mark A. Clements•Institutions (1)

Georgia Institute of Technology¹

01 Apr 1985

TL;DR: The strategy proposed here selects the all-pole parameters to concentrate the model excitation in a finite number of locations to produce a maximally pulse-like residual as a result of theall-pole parameter estimation.

...read moreread less

Abstract: Multiple pulse excited linear predictive coding (MPLPC) has recently received a great deal of attention in the literature as an attractive means of speech coding at data rates below 10 Kbits/second. The existing approaches to MPLPC analysis arrive at the parameters for an all-pole model by minimizing the mean squared modeling error before attempting to find a set of pulses to excite the model. The strategy proposed here selects the all-pole parameters to concentrate the model excitation in a finite number of locations. The goal is then to produce a maximally pulse-like residual as a result of the all-pole parameter estimation.

...read moreread less

3 citations

Proceedings Article•DOI•

A linear programming approach to multipulse speech coding

[...]

R. Garcia-Gomez, J. Alcazar-Fernandez

01 Apr 1985

TL;DR: A method for estimating the LPC input pulses using L1-norm that renders a preselected signal-to-noise ratio for every analysis frame and includes a general framework capable of yielding multipulse sequences with different characteristics for a given speech signal.

...read moreread less

Abstract: Based on the multipulse model for speech coding, we propose a method for estimating the LPC input pulses using L 1 -norm. Our method renders a preselected signal-to-noise ratio for every analysis frame. Additionally, it includes a general framework capable of yielding multipulse sequences with different characteristics for a given speech signal.

...read moreread less

Proceedings Article•DOI•

Speech Coding Using Forward And Backward Prediction

[...]

S. Maitra¹, D. Parikh, M.A. Haque•Institutions (1)

Advanced Micro Devices¹

06 Nov 1985

Proceedings Article•DOI•

Bit rate reduction by Markov-Huffman coding of speech parameters

[...]

P. Papamichalis¹•Institutions (1)

Texas Instruments¹

01 Apr 1985

TL;DR: It is demonstrated that Markov-Huffman coding can lead to average savings of more than 20% in bit rate and a suboptimal scheme is investigated, which can facilitate the implementation of the method on currently available signal processing chips.

...read moreread less

Abstract: A post-quantization processing method is presented which reduces the bit rate of LPC-coded speech without any effect on the speech quality. A Markov model is applied to the quantization levels of the LPC parameters and the resulting transition probabilities are used co generate Huffman coding tables. The appropriate coding table is selected depending on the quantization level of the parameter in the previous frame. It is demonstrated that Markov-Huffman coding can lead to average savings of more than 20% in bit rate. A suboptimal scheme is also investigated, which can facilitate the implementation of the method on currently available signal processing chips.

...read moreread less

Proceedings Article•DOI•

Tree structures for implementation of a vector quantized speech coding system

[...]

S. Kaul¹, M. Shridhar•Institutions (1)

University of Windsor¹

01 Apr 1985

TL;DR: Three different methods of codebook generation are presented and their performance as evaluated by the average distortion, signal to quantization noise, speed of implementation are discussed.

...read moreread less

Abstract: This paper discusses the results of some experiments that were performed to test the feasibility of high speed vector quantization scheme for low bit rate speech coding. Three different methods of codebook generation are presented and their performance as evaluated by the average distortion, signal to quantization noise, speed of implementation are discussed.

...read moreread less