Home
/
Topics
/
Linear predictive coding

Topic

Linear predictive coding

About: Linear predictive coding is a research topic. Over the lifetime, 6565 publications have been published within this topic receiving 142991 citations. The topic is also known as: Linear predictive coding, LPC.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Low-delay CELP with multi-pulse VQ and fast search for GSM EFR

[...]

Shin-Ichi Taumi¹, K. Ozawa², T. Nomura², M. Serizawa²•Institutions (2)

NEC¹, Carnegie Mellon University²

07 May 1996

TL;DR: A novel multi-pulse excitation signal quantization method is proposed, where the pulse amplitudes are vector-quantized (VQ), which remarkably enhances the performance and drastically reduces the position search complexity.

...read moreread less

Abstract: This paper proposes a speech codec, named MP-CELP (multi-pulse-based CELP), with a 10 msec frame length, which has been developed for the GSM EFR (enhanced full-rate) codec standardization. A novel multi-pulse excitation signal quantization method is proposed, where the pulse amplitudes are vector-quantized (VQ). The combination search of the pulse position and the amplitude VQ remarkably enhances the performance. By restricting the pulse positions based on the algebraic-type structure, the search complexity and the bits are reduced. The divided pulse position search drastically reduces the position search complexity. The speech quality for MP-CELP is higher than that for G.728 LD-CELP. MP-CELP also satisfies all the speech quality requirements of the GSM EFR standardization except for the background noise condition.

...read moreread less

36 citations

Proceedings Article•DOI•

The effect of speech and audio compression on speech recognition performance

[...]

Laurent Besacier, Carole Bergamini, Dominique Vaufreydaz, Eric Castelli

03 Oct 2001

TL;DR: An in-depth look at the influence of different speech and audio codecs on the performance of the continuous speech recognition engine and a new strategy is proposed to cope with degradation due to low bitrate coding.

...read moreread less

Abstract: This paper proposes an in-depth look at the influence of different speech and audio codecs on the performance of our continuous speech recognition engine. GSM full rate, G711, G723.1 and MPEG coders are investigated. It is shown that MPEG transcoding degrades the speech recognition performance for low bitrates whereas performance remains acceptable for specialized speech coders like GSM or G711. A new strategy is proposed to cope with degradation due to low bitrate coding. The acoustic models of the speech recognition system are trained with transcoded speech (one acoustic model for each speech/audio codec). First results show that this strategy allows one to recover acceptable performance.

...read moreread less

36 citations

Proceedings Article•DOI•

Combined spectral envelope normalization and subtraction of sinusoidal components in the ODFT and MDCT frequency domains

[...]

Aníbal Ferreira

21 Oct 2001

TL;DR: It is shown how a parametrization of L stationary sinusoids in the complex ODFT spectrum can lead to the effective subtraction, in the real MDCT spectrum, of 3L spectral lines.

...read moreread less

Abstract: Recent research in high-quality audio coding seeks not only improved coding gains but also new functionalities such as easy semantic access to compressed audio material and audio modification in the compressed domain. These objectives imply the decomposition of the audio signal into several components of specific semantic value, such as sinusoidal components, that take advantage of selective coding and parametrization tools. We presume an MDCT based audio coding environment and present a new technique combining spectral envelope normalization with accurate subtraction of sinusoidal components in the MDCT frequency domain. It is shown how a parametrization of L stationary sinusoids in the complex ODFT spectrum can lead to the effective subtraction, in the real MDCT spectrum, of 3L spectral lines. A demonstration of the implementation of the technique is available on the Internet (see http://www.inescn.pt//spl sim/ajf/waspaa01/flattening.html.).

...read moreread less

36 citations

Patent•

Using signal to noise ratio of a speech signal to adjust thresholds for extracting speech parameters for coding the speech signal

[...]

Adil Benyassine¹, Huan-Yu Su¹•Institutions (1)

Mindspeed Technologies¹

16 Aug 2000

TL;DR: In this article, the authors provided speech coding methods and systems for estimating a plurality of speech parameters of a speech signal for coding the speech signal using one or more speech coding algorithms.

...read moreread less

Abstract: There are provided speech coding methods and systems for estimating a plurality of speech parameters of a speech signal for coding the speech signal using one of a plurality of speech coding algorithms, the plurality of speech parameters includes pitch information, the plurality of speech parameters is calculated using a plurality of thresholds. An example method includes estimating a background noise level in the speech signal to determine a signal to noise ratio (SNR) for the speech signal, adjusting one or more of the plurality of thresholds based on the SNR to generate one or more SNR adjusted thresholds, analyzing the speech signal to extract the pitch information using the one or more SNR adjusted thresholds, and repeating the estimating, the adjusting and the analyzing to code the speech signal using one the plurality of speech coding algorithms.

...read moreread less

36 citations

Journal Article•DOI•

A frequency-weighted Itakura spectral distortion measure and its application to speech recognition in noise

[...]

F.K. Soong¹, Man Mohan Sondhi¹•Institutions (1)

Bell Labs¹

01 Jan 1988-IEEE Transactions on Acoustics, Speech, and Signal Processing

TL;DR: The authors propose an adaptively weighted Itakura distortion measure, which they studied its effects on the performance of a conventional dynamic time-warping (DTW)-based speech recognizer in a series of speaker-independent, isolated-digit-recognition experiments.

...read moreread less

Abstract: The authors propose an adaptively weighted Itakura distortion measure. They studied its effects on the performance of a conventional dynamic time-warping (DTW)-based speech recognizer in a series of speaker-independent, isolated-digit-recognition experiments. The equivalent SNR improvement achieved by using the proposed weighted Itakura distortion at low SNRs is about 5-7 dB. >

...read moreread less

36 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
…
142
143
144
145
146
147
148
…
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

6,598

Papers

148,119

Citations

No. of papers in the topic in previous years
Year	Papers
2023	9
2022	25
2021	26
2020	42
2019	25
2018	37

Linear predictive coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics