Home
/
Topics
/
Speech coding

Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

High quality coding of wideband audio signals using transform coded excitation (TCX)

[...]

Roch Lefebvre¹, R. Salami¹, Claude Laflamme¹, J.-P. Adoul¹•Institutions (1)

Université de Sherbrooke¹

19 Apr 1994

TL;DR: This paper describes the application of transform coded excitation (TCX) coding to encoding wideband speech and audio signals in the bit rate range of 16 k bits/s to 32 kbits/s and proposes novel quantization procedures including inter-frame prediction in the frequency domain.

...read moreread less

Abstract: This paper describes the application of transform coded excitation (TCX) coding to encoding wideband speech and audio signals in the bit rate range of 16 kbits/s to 32 kbits/s. The approach uses a combination of time domain (linear prediction; pitch prediction) and frequency domain (transform coding; dynamic bit allocation) techniques, and utilizes a synthesis model similar to that of linear prediction coders such as CELP. However, at the encoder, the high complexity analysis-by-synthesis technique is bypassed by directly quantizing the so-called target signal in the frequency domain. The innovative excitation is derived at the decoder by inverse filtering the quantized target signal. The algorithm is intended for applications whereby a large number of bits is available for the innovative excitation. The TCX algorithm is utilized to encode wideband speech and audio signals with a 50-7000 Hz bandwidth. Novel quantization procedures including inter-frame prediction in the frequency domain are proposed to encode the target signal. The proposed algorithm achieves very high quality for speech at 16 kbits/s, and for music at 24 kbits/s. >

...read moreread less

93 citations

Proceedings Article•DOI•

A speech coder based on decomposition of characteristic waveforms

[...]

Willem Bastiaan Kleijn¹, J. Haagen¹•Institutions (1)

Bell Labs¹

09 May 1995

TL;DR: A 2.4 kb/s coder using waveform interpolation principles to represent the speech signal as an evolving characteristic waveform (CW) and a significant increase in coding efficiency is obtained by coding these two components separately.

...read moreread less

Abstract: For low-rate speech coding it is advantageous to represent the speech signal as an evolving characteristic waveform (CW). The CW evolves slowly when the speech signal is clearly voiced and rapidly when the speech signal is clearly unvoiced. The voiced (periodic) and unvoiced (nonperiodic) components of the speech signal can be separated by a simple nonadaptive filter in the CW domain. Because of perceptual effects, a significant increase in coding efficiency is obtained by coding these two components separately. A 2.4 kb/s coder using these principles was developed. In an independent evaluation, the performance of the 2.4 kb/s waveform interpolation (WI) coder was found to be at least equivalent to the 4.8 kb/s FS1016 standard for all of the many tests.

...read moreread less

93 citations

Proceedings Article•DOI•

Modeling of speech signals using fractional calculus

[...]

Khaled Assaleh¹, Wajdi M. Ahmad²•Institutions (2)

American University of Sharjah¹, University of Sharjah²

01 Feb 2007

TL;DR: A novel approach for speech signal modeling using fractional calculus that has the merit of requiring a smaller number of model parameters, and is demonstrated to be superior to the LPC approach in capturing the details of the modeled signal.

...read moreread less

Abstract: In this paper, we present a novel approach for speech signal modeling using fractional calculus. This approach is contrasted with the celebrated Linear Predictive Coding (LPC) approach which is based on integer order models. It is demonstrated via numerical simulations that by using a few integrals of fractional orders as basis functions, the speech signal can be modeled accurately. The new approach has the merit of requiring a smaller number of model parameters, and is demonstrated to be superior to the LPC approach in capturing the details of the modeled signal.

...read moreread less

93 citations

Journal Article•DOI•

Techniques and Standards for Image, Video and Audio Coding

[...]

Venkatesha R. Prasad

01 Apr 1998-Journal of Electronic Imaging

93 citations

Patent•

System and method for scaleable streamed audio transmission over a network

[...]

Philippe Ferriere¹•Institutions (1)

Microsoft¹

11 Oct 1995

TL;DR: In this article, an audio data transmission system uses computing units which are designed to select an appropriate combination of block size and input sampling rate to maximize the available bandwidth of the receiving modem.

...read moreread less

Abstract: An audio data transmission system encodes audio files into individual audio data blocks which contain a variable number bits of digital audio data that were sampled at a selectable sample rate. The number of bits of digital data and the input sampling rate are scaleable to produce an encoded bit stream bit rate that is less than or equal to an effective operational bit rate of a recipient's modem. The audio data transmission system uses computing units which are designed to select an appropriate combination of block size and input sampling rate to maximize the available bandwidth of the receiving modem. For example, if the modem connection speed for one modem is 14.4 kbps, a version of the audio data compressed at 13000 bits/s might be sent to the recipient; if the modem connection speed for another modem is 28.8 kbps, a version of the audio data compressed at 24255 bits/s might be sent to the receiver. The audio data blocks are then transmitted at the encoded bit stream bit rate to the intended recipient's modem. The audio data blocks are decoded at the recipient to reconstruct the audio file and immediately play the audio file as it is received. The audio data transmission system can be implemented in online service systems, ITV systems, computer data network systems, and communication systems.

...read moreread less

93 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
…
102
103
104
105
106
107
108
…
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

14,368

Papers

279,843

Citations

No. of papers in the topic in previous years
Year	Papers
2023	38
2022	84
2021	70
2020	62
2019	77
2018	108

Speech coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics