scispace - formally typeset
Search or ask a question
Topic

Linear predictive coding

About: Linear predictive coding is a research topic. Over the lifetime, 6565 publications have been published within this topic receiving 142991 citations. The topic is also known as: Linear predictive coding, LPC.


Papers
More filters
Journal ArticleDOI
TL;DR: The experimental results demonstrate that the proposed approach to embed information during linear predictive coding (LPC) process based on Matrix Embedding (ME) leads to a better performance with less speech distortion and better security.
Abstract: The extensive use of Voice over IP (VoIP) applications makes low bit-rate speech stream a very suitable steganographic cover media. To incorporate steganography into low bit-rate speech codec, we propose a novel approach to embed information during linear predictive coding (LPC) process based on Matrix Embedding (ME). In the proposed method, a mapping table is constructed based on the criterion of minimum distance of Linear-Predictive-Coefficient-Vectors, and embedding position and template are selected according to a private key so as to choose the cover frames. The original speech data of the chosen frames are partially encoded to get the codewords for embedding and then the codewords that need to be modified for embedding are selected according to the secret bits and ME algorithm. The selected codeword will be changed into its best replacement codeword according to the mapping table. When embedding k (k > 1) bits into 2kź1 codewords, the embedding efficiency of our method is k times as that of LPC-based Quantization Index Modulation method. The performance of the proposed approach is evaluated in two aspects: distortion in speech quality introduced by embedding and security under steganalysis. The experimental results demonstrate that the proposed approach leads to a better performance with less speech distortion and better security.

41 citations

Proceedings ArticleDOI
09 May 1995
TL;DR: An efficient coding scheme for linear predictive coding (LPC) residuals is proposed based on harmonic and noise representation, which reduces the high complexity of the direct sinusoidal synthesis method with interpolated magnitudes and phases.
Abstract: An efficient coding scheme for linear predictive coding (LPC) residuals is proposed based on harmonic and noise representation. New features of the scheme include classified vector quantization of the spectral envelope of LPC residuals with a weighted distortion measure. The improvement in performance obtained by classifying codebooks based on a voiced/unvoiced (V/UV) decision is shown. Sequences of the short-term RMS power of the time domain waveforms are also vector quantized and transmitted for unvoiced signals. A fast synthesis algorithm for voiced signals using an FFT is also presented, which reduces the high complexity of the direct sinusoidal synthesis method with interpolated magnitudes and phases. Informal listening tests indicate that, in combination with a known LSP quantization technique, this residual coding scheme provides good communication quality at a total bit rate of less than 2.0 kbps.

41 citations

Proceedings ArticleDOI
07 May 1996
TL;DR: It is found that memory VQ can improve the performance with 3-5 bits compared to memoryless VQ for error-free transmission, and the best method in this study is a safety-net extended predictive VQ.
Abstract: This paper addresses the problem of efficient transmission of the LSF parameters in speech coding using vector quantization (VQ). By performing a comparison of several memory VQ methods on the same database, we investigate what gains can be achieved by exploiting interframe correlation. The memory VQ methods studied are finite-state VQ and linear predictive VQ. By combining the memory VQ with a fixed memoryless VQ, called the safety-net, further improvements in performance can be obtained. It is found that memory VQ can improve the performance with 3-5 bits compared to memoryless VQ for error-free transmission. The best method in this study is a safety-net extended predictive VQ. For noisy channels, most memory methods perform worse than memoryless VQ, but the safety-net predictive VQ outperforms memoryless VQ for all tested channel error rates, with 4 bits less.

41 citations

Proceedings ArticleDOI
14 Apr 1991
TL;DR: The proposed voice conversion algorithm was used with two male speakers and, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than thespeech converted frame-by-frame.
Abstract: A voice conversion algorithm that uses speech segments as conversion units is proposed. Input speech is decomposed into speech segments by a speech recognition module, and the segments are replaced by speech segments uttered by another speaker. This algorithm makes it possible to convert not only the static characteristics but also the dynamic characteristics of speaker individuality. The proposed voice conversion algorithm was used with two male speakers. Spectrum distortion between target speech and the converted speech was reduced to one-third the natural spectrum distortion between the two speakers. A listening experiment showed that, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than the speech converted frame-by-frame. >

41 citations

Proceedings ArticleDOI
03 Apr 1990
TL;DR: It is possible to obtain stable LPC filters that model sinusoids closely and to include perceptual masking in these coders, and it is found that multipulse can compensate for quantization error in LPC filter filters.
Abstract: Experiments are described in coding broadband audio using multipulse linear predictive coding (LPC). It is possible to obtain stable LPC filters that model sinusoids closely and to include perceptual masking in these coders. The quantization of both the LPC and multipulse parameters is also examined, and it is found that multipulse can compensate for quantization error in LPC filters. With appropriate perceptual masking, these coders can provide high quality and audio output. At 128 kb/s, the coders achieved typical SNR values of 35-40 dB in simulations. >

41 citations


Network Information
Related Topics (5)
Signal processing
73.4K papers, 983.5K citations
86% related
Noise
110.4K papers, 1.3M citations
81% related
Feature extraction
111.8K papers, 2.1M citations
81% related
Feature vector
48.8K papers, 954.4K citations
80% related
Filter (signal processing)
81.4K papers, 1M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20239
202225
202126
202042
201925
201837