Home
/
Topics
/
Speech coding

Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Adapted local trigonometric transforms and speech processing

[...]

E. Wesfreid¹, Mladen Victor Wickerhauser²•Institutions (2)

CEREMADE¹, Washington University in St. Louis²

01 Dec 1993-IEEE Transactions on Signal Processing

TL;DR: This decomposition provides a method of parameter simplification which appears to be useful for detecting fundamental frequencies, and characterizing formants.

...read moreread less

Abstract: Uses an algorithm based on the adapted-window Malvar transform to decompose digitized speech signals into a local time-frequency representation. The authors present some applications and experimental results for a signal compression and automatic voiced-unvoiced segmentation. This decomposition provides a method of parameter simplification which appears to be useful for detecting fundamental frequencies, and characterizing formants. >

...read moreread less

85 citations

Patent•DOI•

Decomposition in noise and periodic signal waveforms in waveform interpolation

[...]

Willem Bastiaan Kleijn¹•Institutions (1)

AT&T¹

02 Feb 1995-Journal of the Acoustical Society of America

TL;DR: In this article, a plurality of sets of indexed parameters are generated based on samples of the speech signal, each set corresponds to a waveform characterizing the speech signals at a discrete point in time.

...read moreread less

Abstract: A method of coding a speech signal is described. In accordance with the method, a plurality of sets of indexed parameters are generated based on samples of the speech signal. Each set of indexed parameters corresponds to a waveform characterizing the speech signal at a discrete point in time. Parameters of the plurality of sets are grouped based on index value to form a first set of signals which represents the evolution of characterizing waveform shape; the signals of the first set are filtered to remove low frequency components and thereby produce a second set of signals which represents relatively high rates of evolution of characterizing waveform shape. The speech signal is then coded based on the second set of signals representing high rates of characterizing waveform shape evolution. Coding of the speech signal may further be based on a set of smoothed first signals.

...read moreread less

85 citations

Proceedings Article•DOI•

VAD techniques for real-time speech transmission on the Internet

[...]

A. Sangwan¹, M.C. Chiranth¹, H. S. Jamadagni², R. Sah¹, R. Venkatesha Prasad², Vishal Gaurav¹ - Show less +2 more•Institutions (2)

PES University¹, Indian Institute of Science²

07 Nov 2002

TL;DR: A comparison of the relative merits and demerits along with the subjective quality of speech after the pruning of silence periods for four time-domain VAD algorithms in terms of speech quality, compression level and computational complexity.

...read moreread less

Abstract: We discuss techniques for voice activity detection (VAD) for voice over Internet Protocol (VoIP). VAD aids in reducing the bandwidth requirement of a voice session, thereby using bandwidth efficiently. Such a scheme would be implemented in the application layer. Thus the VAD is independent of the lower layers in the network stack (see Flood, J.E., "Telecommunications Switching - Traffic and Networks", Prentice Hall India). We compare four time-domain VAD algorithms in terms of speech quality, compression level and computational complexity. A comparison of the relative merits and demerits along with the subjective quality of speech after the pruning of silence periods is presented for all the algorithms. A quantitative measurement of speech quality for different algorithms is also presented.

...read moreread less

84 citations

Patent•

Method and apparatus for processing an input speech signal during presentation of an output audio signal

[...]

Ira A. Gerson

05 Oct 1999

TL;DR: In this paper, a start of an input speech signal is detected during presentation of an output audio signal and an input start time, relative to the output audio signals, is determined.

...read moreread less

Abstract: A start of an input speech signal is detected during presentation of an output audio signal and an input start time, relative to the output audio signal, is determined. The input start time is then provided for use in responding to the input speech signal. In another embodiment, the output audio signal has a corresponding identification. When the input speech signal is detected during presentation of the output audio signal, the identification of the output audio signal is provided for use in responding to the input speech signal. Information signals comprising data and/or control signals are provided in response to at least the contextual information provided, i.e., the input start time and/or the identification of the output audio signal. In this manner, the present invention accurately establishes a context of an input speech signal relative to an output audio signal regardless of the delay characteristics of the underlying communication system.

...read moreread less

84 citations

Journal Article•DOI•

An efficient implementation of the forward and inverse MDCT in MPEG audio coding

[...]

Vladimir Britanak¹, K. R. Rao•Institutions (1)

Slovak Academy of Sciences¹

01 Feb 2001-IEEE Signal Processing Letters

TL;DR: The most efficient implementation of theforward and inverse MDCT computation for layer III in MPEG-1 and MPEG-2 international audio coding standards is proposed, based on a new fast algorithm for the forward and inverseMDCT computation in the oddly stacked system.

...read moreread less

Abstract: The modified discrete cosine transform (MDCT) is employed in subband/transform coding schemes as the analysis/synthesis filter bank based on time domain aliasing cancellation (TDAC). The most efficient implementation of the forward and inverse MDCT computation for layer III in MPEG-1 and MPEG-2 international audio coding standards is proposed. It is based on a new fast algorithm for the forward and inverse MDCT computation in the oddly stacked system. The complete signal flow graphs for the implementation of MDCT and inverse MDCT in layer III are also provided.

...read moreread less

84 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
…
113
114
115
116
117
118
119
…
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

14,368

Papers

279,843

Citations

No. of papers in the topic in previous years
Year	Papers
2023	38
2022	84
2021	70
2020	62
2019	77
2018	108

Speech coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics