Home
/
Topics
/
Speech coding

Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Patent•

Hearing enhancement system and components thereof

[...]

Dale Lott, William T. Newton

01 Sep 2009

TL;DR: In this paper, an active noise reduction (ANR) circuit is used to adjust the hearing compensated audio signal based on the ANR signal to produce an output audio signal, wherein the ANRs signal is generated based on output audio signals.

...read moreread less

Abstract: A circuit includes a microphone circuit, an audio processing module, a digital audio processing module, and an active noise reduction (ANR) circuit. The microphone circuit receives acoustic vibrations and generates an audio signal therefrom. The audio processing module generates a representation of the audio signal. The digital audio processing module compensates the representation of the audio signal based on hearing compensation data to produce a hearing compensated audio signal. The ANR circuit receives the hearing compensated audio signal and an ANR signal. The ANR circuit further functions to adjust the hearing compensated audio signal based on the ANR signal to produce an output audio signal, wherein the ANR signal is generated based on the output audio signal.

...read moreread less

65 citations

Journal Article•DOI•

High-quality audio transform coding at 64 kbps

[...]

Y. Mahieux¹, J.P. Petit¹•Institutions (1)

CNET¹

01 Nov 1994-IEEE Transactions on Communications

TL;DR: In this article, a transform coding algorithm devoted to high quality audio coding at a bit rate of 64 kbps per monophonic channel is presented. But, although a complete system including framing, synchronization and error correction has been developed, only the bit rate compression algorithm is described.

...read moreread less

Abstract: This paper presents a transform coding algorithm devoted to high quality audio coding at a bit rate of 64 kbps per monophonic channel. It enables the transmission of a high quality stereo sound through the basic access (2B channels) of ISDN. Although a complete system including framing, synchronization and error correction has been developed, only the bit rate compression algorithm is described here. A detailed analysis of the signal processing techniques such as the time/frequency transformation, the pre-echo reduction by adaptive filtering, the fast algorithm computations, etc., is provided. The use of psychoacoustical properties is also precisely reported. Finally, some subjective evaluation results and one real time implementation of the coder using the ATT DSP32C digital signal processor are presented. >

...read moreread less

65 citations

Journal Article•DOI•

Interframe LSF quantization for noisy channels

[...]

Thomas Eriksson¹, J. Linden, Jan Skoglund•Institutions (1)

Chalmers University of Technology¹

01 Sep 1999-IEEE Transactions on Speech and Audio Processing

TL;DR: By combining an interframe quantizer and a memoryless "safety-net" quantizer, it is demonstrated that the advantages of both quantization strategies can be utilized, and the performance for both noiseless and noisy channels improves.

...read moreread less

Abstract: In linear predictive speech coding algorithms, transmission of linear predictive coding (LPC) parameters-often transformed to the line spectrum frequencies (LSF) representation-consumes a large part of the total bit rate of the coder. Typically, the LSF parameters are highly correlated from one frame to the next, and a considerable reduction in bit rate can be achieved by exploiting this interframe correlation. However, interframe coding leads to error propagation if the channel is noisy, which possibly cancels the achievable gain. In this paper, several algorithms for exploiting interframe correlation of LSF parameters are compared. Especially, performance for transmission over noisy channels is examined, and methods to improve noisy channel performance are proposed. By combining an interframe quantizer and a memoryless "safety-net" quantizer, we demonstrate that the advantages of both quantization strategies can be utilized, and the performance for both noiseless and noisy channels improves. The results indicate that the best interframe method performs as good as a memoryless quantizing scheme, with 4 bits less per frame. Subjective listening tests have been employed that verify the results from the objective measurements.

...read moreread less

65 citations

Proceedings Article•DOI•

Description of ITU-T Recommendation G.729 Annex A: reduced complexity 8 kbit/s CS-ACELP codec

[...]

R. Salami¹, Claude Laflamme¹, B. Bessette¹, J.-P. Adoul¹•Institutions (1)

Université de Sherbrooke¹

21 Apr 1997

TL;DR: Several algorithmic changes have been introduced into G.729 which resulted in 50% drop in its complexity, enabling a DSP implementation with a complexity of about 10-12 MIPS, while meeting the terms of reference.

...read moreread less

Abstract: This paper describes the recently adopted ITU-T Recommendation G.729 Annex A (G.729A) for encoding speech signals at 8 kbit/s with low complexity. G.729A has been selected as the standard speech coding algorithm for multimedia digital simultaneous voice and data (DSVD). G.729A is bitstream interoperable with G.729; i.e., speech coded with G.729A can be decoded with G.729, and vice versa. As G.729, it uses the conjugate structure algebraic code excited linear prediction (CS-ACELP) algorithm with 10 ms frames. However, several algorithmic changes have been introduced into G.729 which resulted in 50% drop in its complexity, enabling a DSP implementation with a complexity of about 10-12 MIPS. This paper describes the algorithmic changes which have been introduced in order to achieve the low complexity goal while meeting the terms of reference. Subjective tests have been performed by ITU-T in both the selection phase and the characterization phase and the results showed that the performance of G.729A is equivalent to both G.729 and G.726 at 32 kbit/s in most operating conditions; however, it is slightly worse in case of three tandems and in the presence of background noise. A breakdown of the complexities of both G.729 and G.729A is given at the end of the paper.

...read moreread less

65 citations

Book•

Spatial Audio Processing

[...]

Jeroen Breebaart, Christof Faller

10 Dec 2007

65 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
…
160
161
162
163
164
165
166
…
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

14,368

Papers

279,843

Citations

No. of papers in the topic in previous years
Year	Papers
2023	38
2022	84
2021	70
2020	62
2019	77
2018	108

Speech coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics