Home
/
Topics
/
Speech coding

Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Entropy-based variable frame rate analysis of speech signals and its application to ASR

[...]

H. You¹, Qifeng Zhu¹, Abeer Alwan¹•Institutions (1)

University of California, Los Angeles¹

17 May 2004

TL;DR: This paper compares entropy and Euclidian distance measures for VFR in ASR experiments using the Aurora2 and T146 databases and finds better performance is observed for the entropy-based VFR over the earlier VFR approach and over the fixed-rate system.

...read moreread less

Abstract: Most speech processing algorithms analyze speech signals frame by frame with a fixed frame rate. Fixed-rate analysis is inconsistent with human speech perception and effectively assigns the same importance or 'weight' to all equi-duration frames. In Zhu et al. (2000), we proposed a variable frame rate (VFR) analysis technique that is based on a Euclidian distance measure. In this paper, we propose another approach for VFR based on the entropy of the signal. We compare entropy and Euclidian distance measures for VFR in ASR experiments using the Aurora2 and T146 databases. Better performance is observed for the entropy-based VFR over our earlier VFR approach and over the fixed-rate system.

...read moreread less

59 citations

Proceedings Article•DOI•

Concepts and solutions for link adaptation and inband signaling for the GSM AMR speech coding standard

[...]

S. Bruhn¹, Peter Blöcher², Karl Hellwig², J. Sjoberg²•Institutions (2)

Ericsson Radio Systems¹, Ericsson²

16 May 1999

TL;DR: Various approaches for link adaptation with respect to varying radio channel conditions are described and the method of inband signaling that is standardized is discussed and motivated.

...read moreread less

Abstract: The European Telecommunications Standards Institute (ETSI) has just defined an adaptive multi rate (AMR) speech codec standard for the GSM system with a multitude of source and channel coding rates. The standard aims to provide robust high quality speech together with the flexibility to deliver radio network capacity enhancements by means of low bit-rate operation. The codec rates are dynamically selected with respect to the rapidly changing radio conditions and to local capacity requirements. This paper describes various approaches for link adaptation with respect to varying radio channel conditions and puts a focus on the solution in the AMR standard. Moreover the method of inband signaling that is standardized is discussed and motivated.

...read moreread less

59 citations

Journal Article•DOI•

A comparative study of various quantization schemes for speech encoding

[...]

P. Noll

01 Nov 1975-Bell System Technical Journal

TL;DR: The performance limits, as given by the signal-to-noise ratio (s/n), are described for different speech-encoding schemes including adaptive quantization and (linear) adaptive prediction schemes.

...read moreread less

Abstract: In this paper, the performance limits, as given by the signal-to-noise ratio (s/n), are described for different speech-encoding schemes including adaptive quantization and (linear) adaptive prediction schemes. The comparison is made on the basis of computer simulations using 8-kHz-sampled speech signals of one speaker. Different bit rates (two bits per sample–five bits per sample) have been used. A three-bit-per-sample pcm scheme with a nonadaptive μ100 quantizer leads to an s/n value of approximately 9 dB. A maximum s/n value of approximately 25 dB has been reached using an encoding scheme including both adaptive quantization and adaptive prediction. Entropy coding of the quantizer output symbols leads to an additional gain in s/n of nearly 3 dB.

...read moreread less

59 citations

Proceedings Article•DOI•

Selecting the modeling order for the ESPRIT high resolution method: an alternative approach

[...]

Roland Badeau¹, Bertrand David¹, Gael Richard¹•Institutions (1)

Télécom ParisTech¹

17 May 2004

TL;DR: This work proposes a new method for selecting an appropriate modeling order, which outperformed the classical information theoretic criteria and was applied to both synthetic and musical signals.

...read moreread less

Abstract: High resolution methods, such as the ESPRIT (estimation of signal parameters by rotational invariance techniques) algorithm, perform an accurate representation of a harmonic signal as a sum of exponentially damped sinusoids. However, in coding applications, the signal must be represented with a minimum number of parameters. Unfortunately, it is well known that applying the ESPRIT algorithm with an under-estimated model order generates biased frequency estimates. We propose a new method for selecting an appropriate modeling order, which minimizes this bias. This approach was applied to both synthetic and musical signals and outperformed the classical information theoretic criteria.

...read moreread less

58 citations

Proceedings Article•DOI•

Optimal time segmentation for signal modeling and compression

[...]

P. Prandom¹, Michael M. Goodwin, Martin Vetterli•Institutions (1)

École Normale Supérieure¹

21 Apr 1997

TL;DR: Two immediate applications of the dynamic programming approach to LPC speech coding and to sinusoidal modeling of musical signals are presented.

...read moreread less

Abstract: The idea of optimal joint time segmentation and resource allocation for signal modeling is explored with respect to arbitrary segmentations and arbitrary representation schemes. When the chosen signal modeling techniques can be quantified in terms of a cost function which is additive over distinct segments, a dynamic programming approach guarantees the global optimality of the scheme while keeping the computational requirements of the algorithm sufficiently low. Two immediate applications of the algorithm to LPC speech coding and to sinusoidal modeling of musical signals are presented.

...read moreread less

58 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
…
185
186
187
188
189
190
191
…
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

14,368

Papers

279,843

Citations

No. of papers in the topic in previous years
Year	Papers
2023	38
2022	84
2021	70
2020	62
2019	77
2018	108

Speech coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics