Home
/
Topics
/
Speech coding

Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Synthesis of speech from unrestricted text

[...]

J. Allen¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Apr 1976

TL;DR: The resulting system serves as a model for the cognitive process of reading aloud, and also as a stable practical means for providing speech output in a broad class of computer-based systems.

...read moreread less

Abstract: For many applications, it is desirable to be able to convert arbitrary English text to natural and intelligible sounding speech. This transformation between two surface forms is facilitated by first obtaining the common underlying abstract linguistic representation which relates to both text and speech surface representations. Calculation of these abstract bases then permits proper selection of phonetic segments, lexical stress, juncture, and sentence-level stress and intonation. The resulting system serves as a model for the cognitive process of reading aloud, and also as a stable practical means for providing speech output in a broad class of computer-based systems.

...read moreread less

116 citations

Proceedings Article•DOI•

Gender identification using a general audio classifier

[...]

Hadi Harb¹, Liming Chen¹•Institutions (1)

École centrale de Lyon¹

06 Jul 2003

TL;DR: A novel gender identification approach based on a general audio classifier that shows robustness to adverse audio compression and it is language independent is introduced.

...read moreread less

Abstract: In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum's statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.

...read moreread less

116 citations

Proceedings Article•DOI•

Steganalysis of audio based on audio quality metrics

[...]

Hamza Ozer, Ismail Avcibas¹, Bulent Sankur¹, Nasir Memon•Institutions (1)

Boğaziçi University¹

13 Jun 2003-electronic imaging

TL;DR: Experimental results show that the proposed technique can be used to detect the presence of hidden messages in digital audio data.

...read moreread less

Abstract: Classification of audio documents as bearing hidden information or not is a security issue addressed in the context of steganalysis. A cover audio object can be converted into a stego-audio object via steganographic methods. In this study we present a statistical method to detect the presence of hidden messages in audio signals. The basic idea is that, the distribution of various statistical distance measures, calculated on cover audio signals and on stego-audio signals vis-a-vis their denoised versions, are statistically different. The design of audio steganalyzer relies on the choice of these audio quality measures and the construction of a two-class classifier. Experimental results show that the proposed technique can be used to detect the presence of hidden messages in digital audio data.

...read moreread less

116 citations

Book•

Audio Signal Processing and Coding

[...]

Andreas Spanias, Ted Painter, Atti Venkatraman S

09 Feb 2007

TL;DR: This chapter discusses signal processing Essentials, audio Coding Standards and Algorithms, and quality measures for Perceptual Audio Coding.

...read moreread less

Abstract: Preface. 1. Introduction. 2. Signal Processing Essentials. 3. Quantization and Entropy Coding. 4. Linear Prediction in Narrowband and Wideband Coding. 5. Psychoacoustic Principles. 6. Time-Frequency Analysis: Filter Banks and Transforms. 7. Transform Coders. 8. Subband Coders. 9. Sinusoidal Coders. 10. Audio Coding Standards and Algorithms. 11. Lossless Audio Coding and Digital Watermarking. 12. Quality Measures for Perceptual Audio Coding. References. Index.

...read moreread less

116 citations

Patent•

Systems, methods, apparatus, and computer-readable media for backward-compatible audio coding

[...]

Dipanjan Sen¹, Pei Xiang¹•Institutions (1)

Qualcomm¹

15 Mar 2013

TL;DR: In this article, the backward compatible coding of a set of basis function coefficients that describe a sound field is presented, along with methods and apparatus for backward-compatible coding of the coefficients.

...read moreread less

Abstract: Systems, methods, and apparatus for backward-compatible coding of a set of basis function coefficients that describe a sound field are presented.

...read moreread less

115 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
…
77
78
79
80
81
82
83
…
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

14,368

Papers

279,843

Citations

No. of papers in the topic in previous years
Year	Papers
2023	38
2022	84
2021	70
2020	62
2019	77
2018	108

Speech coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics