Topic

Linear predictive coding

About: Linear predictive coding is a research topic. Over the lifetime, 6565 publications have been published within this topic receiving 142991 citations. The topic is also known as: Linear predictive coding, LPC.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Real-time vector excitation coding of speech at 4800 bps

[...]

G. Davidson¹, Mei Yong, Allen Gersho•Institutions (1)

University of California, Santa Barbara¹

01 Apr 1987

TL;DR: A real-time 4.8 kb/s Pulse Excitation VXC coder (PVXC) is presented which achieves high reconstructed speech quality and incorporates new techniques which reduce the codebook search complexity to only 0.55 MFlops.

...read moreread less

Abstract: In Vector Excitation Coding (VXC), speech is represented by applying a sequence of excitation vectors to a time-varying speech production filter with each vector chosen from a codebook using a perceptually-based performance measure. Although VXC is a powerful technique for achieving natural and high quality speech compression at low bit-rates, it suffers as other excitation coders do from a very high computational complexity. Recent research has shown that codebook search computation can be reduced to approximately 40 MFlops without compromising speech quality. However, this operation count still prohibits a practical real-time implementation of the coder using today's DSP chips. We present a real-time 4.8 kb/s Pulse Excitation VXC coder (PVXC) which achieves high reconstructed speech quality and incorporates new techniques which reduce the codebook search complexity to only 0.55 MFlops. The coder utilizes an optimized excitation codebook and a promising new interframe vector predictive LPC parameter quantization scheme. A preliminary implementation using a single floating-point signal processor is described.

...read moreread less

67 citations

Proceedings Article•DOI•

Transform predictive coding of wideband speech signals

[...]

Juin-Hwey Chen¹, Dongmei Wang¹•Institutions (1)

Bell Labs¹

07 May 1996

TL;DR: This paper presents a novel wideband speech coding algorithm called transform predictive coding (TPC), which uses short-term and long-term prediction to remove the redundancy in speech and uses closed-loop quantization techniques to improve TPC output speech quality.

...read moreread less

Abstract: This paper presents a novel wideband speech coding algorithm called transform predictive coding (TPC). The main emphasis is on low complexity. TPC uses short-term and long-term prediction to remove the redundancy in speech. The prediction residual is quantized in the frequency domain based on a calculated noise masking threshold. In its simplest form, the TPC coder uses only open-loop quantization and therefore has a low complexity. A 16 kb/s full-duplex, open-loop TPC coder takes only 22% of the CPU load on a 150 MHz SGI Indy workstation and about 34% on a 90 MHz Pentium PC. The speech quality of TPC is almost transparent at 32 kb/s, very good at 24 kb/s, and acceptable at 16 kb/s. In the second half of the paper, we report our recent progress in using closed-loop quantization techniques to improve TPC output speech quality.

...read moreread less

67 citations

Journal Article•DOI•

A Quantitative Assessment of Group Delay Methods for Identifying Glottal Closures in Voiced Speech

[...]

Mike Brookes¹, Patrick A. Naylor¹, Jon Gudnason¹•Institutions (1)

Imperial College London¹

01 Dec 2006-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: It is found that when using a fixed-length analysis window, the best measures can detect the instant of glottal closure in 97% of larynx cycles with a standard deviation of 0.6 ms and that some improvement in detection rate may be obtained if the analysis window length is adapted to the speech pitch.

...read moreread less

Abstract: Measures based on the group delay of the LPC residual have been used by a number of authors to identify the time instants of glottal closure in voiced speech. In this paper, we discuss the theoretical properties of three such measures and we also present a new measure having useful properties. We give a quantitative assessment of each measure's ability to detect glottal closure instants evaluated using a speech database that includes a direct measurement of glottal activity from a Laryngograph/EGG signal. We find that when using a fixed-length analysis window, the best measures can detect the instant of glottal closure in 97% of larynx cycles with a standard deviation of 0.6 ms and that in 9% of these cycles an additional excitation instant is found that normally corresponds to glottal opening. We show that some improvement in detection rate may be obtained if the analysis window length is adapted to the speech pitch. If the measures are applied to the preemphasized speech instead of to the LPC residual, we find that the timing accuracy worsens but the detection rate improves slightly. We assess the computational cost of evaluating the measures and we present new recursive algorithms that give a substantial reduction in computation in all cases.

...read moreread less

67 citations

Patent•DOI•

Voice encoding method and apparatus using modified discrete cosine transform

[...]

Jun Matsumoto¹, Omori Shiro¹, Masayuki Nishiguchi¹, Kazuyuki Iijima¹•Institutions (1)

Sony Broadcast & Professional Research Laboratories¹

24 Oct 1996-Journal of the Acoustical Society of America

TL;DR: A method and apparatus for encoding an input signal, such as a broad-range speech signal, in which a number of decoding operations with different bit rates are enabled for assuring a high encoding bit rate and for minimizing deterioration of the reproduced sound even with a low bit rate.

...read moreread less

Abstract: A method and apparatus for encoding an input signal, such as a broad-range speech signal, in which a number of decoding operations with different bit rates are enabled for assuring a high encoding bit rate and for minimizing deterioration of the reproduced sound even with a low bit rate. The signal encoding method includes a band-splitting step for splitting an input signal into a number of bands and a step of encoding signals of the bands in a different manner depending on signal characteristics of the bands. Specifically, a low-range side signal is taken out by a low-pass filter from an input signal entering a terminal, and analyzed for Linear Predictive coding by an Linear Predictive coding analysis quantization unit. After finding the Linear Predictive coding residuals, as short-term prediction residuals by an Linear Predictive coding inverted filter, the pitch is found by a pitch analysis circuit. Then, pitch residuals are found by long-term prediction by a pitch inverted filter. The pitch residuals are processed with modified discrete cosine transform by a modified discrete cosine transform (MDCT) circuit and vector-quantized by a vector-quantization circuit. The resulting quantization indices are transmitted along with the pitch lag and the pitch gain. The linear spectral pairs linear spectral pairs are also sent as parameter representing LPC coefficients.

...read moreread less

67 citations

Journal Article•DOI•

Recursive windowing for generating autocorrelation coefficients for LPC analysis

[...]

Thomas P. Barnwell¹•Institutions (1)

Georgia Institute of Technology¹

01 Oct 1981-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: Experimental results showed the speech quality to be comparable to and slightly better than that produced by an auto-correlation LPC vocoder using a Hamming window.

...read moreread less

Abstract: A method for recursively computing the autocorrelation estimates needed for LPC analysis in a vocoder environment has been developed theoretically and studied experimentally. The method has three specific advantages: 1) it requires very little memory for its implementation; 2) it is realized by a structure consisting of several identical modules; and 3) the effective window length may be changed without varying the structure. Experimental results showed the speech quality to be comparable to and slightly better than that produced by an auto-correlation LPC vocoder using a Hamming window.

...read moreread less

66 citations

Collapse

Network Information

Performance

Metrics

6,598

Papers

148,119

Citations

No. of papers in the topic in previous years
Year	Papers
2023	9
2022	25
2021	26
2020	42
2019	25
2018	37

Linear predictive coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics