Home
/
Topics
/
Speech coding

Topic

Speech coding

About: Speech coding is a research topic. Over the lifetime, 14245 publications have been published within this topic receiving 271964 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Speech/music discrimination for multimedia applications

[...]

Khaled Helmi El-Maleh¹, M. Klein¹, G. Petrucci¹, Peter Kabal¹•Institutions (1)

McGill University¹

05 Jun 2000

TL;DR: This work presents the results of combining the line spectral frequencies (LSFs) and zero crossing-based features for frame-level narrowband speech/music discrimination and shows the good discriminating power of these features.

...read moreread less

Abstract: Automatic discrimination of speech and music is an important tool in many multimedia applications. Previous work has focused on using long-term features such as differential parameters, variances and time-averages of spectral parameters. These classifiers use features estimated over windows of 0.5-5 seconds, and are relatively complex. We present our results of combining the line spectral frequencies (LSFs) and zero crossing-based features for frame-level narrowband speech/music discrimination. Our classification results for different types of music and speech show the good discriminating power of these features. Our classification algorithms operate using only a frame delay of 20 ms, making them suitable for real-time multimedia applications.

...read moreread less

229 citations

Patent•

Method and system for customizing voice translation of text to speech

[...]

Steve Tischer¹•Institutions (1)

AT&T¹

10 Dec 2001

TL;DR: In this paper, a method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized audio representation, and organizing the recorded speech samples and correlated audio representations into a collection.

...read moreread less

Abstract: A method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized audio representation, and organizing the recorded speech samples and correlated audio representations into a collection. The collection of speech samples correlated with audio representations is saved as a single voice file and stored in a device capable of translating the text to speech. The voice file is applied to a translation of text to speech so that the translated speech is customized according to the applied voice file.

...read moreread less

229 citations

Patent•

System and method for flexible coding, modulation, and time slot allocation in a radio telecommunications network

[...]

Torbjorn Ward¹, Anders B Sandell¹•Institutions (1)

Ericsson¹

23 Sep 1996

TL;DR: In this paper, a system and method for dynamically adapting the user bit rate of a time division multiple access (TDMA) cellular telecommunication system to achieve optimum voice quality over a broad range of radio channel conditions are disclosed.

...read moreread less

Abstract: A system and method for dynamically adapting the user bit rate of a time division multiple access (TDMA) cellular telecommunication system to achieve optimum voice quality over a broad range of radio channel conditions are disclosed. The system continuously monitors radio channel quality on both the uplink and the downlink, and dynamically adapts the system's combination of speech coding (21), channel coding (22), modulation (23), a number of assignable time slots per call (27) to optimize voice quality of the measured conditions. Various combinations of the system's speech coding, channel coding, modulation, and assignable time slots are identified as combination types (1-5) and corresponding cost functions are defined. By idendifying and selecting the cost function with the lowest cost for the measured radio channel conditions, the system provides the maximum voice quality achievable within the limits of the system design.

...read moreread less

229 citations

Book•

Real-time digital signal processing

[...]

Sen M. Kuo¹, Bob H. Lee¹, Wenshun Tian•Institutions (1)

Northern Illinois University¹

01 Jan 2001

TL;DR: This book presents an introduction to Real-Time Digital Signal Processing, a branch of Digital Image Processing, and some of the techniques used in this area, as well as some new ideas on how to implement these techniques in the real-time.

...read moreread less

Abstract: Preface. Chapter 1. Introduction to Real-Time Digital Signal Processing. Chapter 2. Introduction to TMS320C55x Digital Signal Processor. Chapter 3. DSP Fundamentals and Implementation Considerations. Chapter 4. Design and Implementation of FIR Filters. Chapter 5. Design and Implementation of IIR Filters. Chapter 6. Frequency Analysis and Fast Fourier Transform. Chapter 7. Adaptive Filtering. Chapter 8. Digital Signal Generators. Chapter 9. Dual-Tone Multi-Frequency Detection. Chapter 10. Adaptive Echo Cancellation. Chapter 11. Speech Coding Techniques. Chapter 12. Speech Enhancement Techniques. Chapter 13. Audio Signal Processing. Chapter 14. Channel Coding Techniques. Chapter 15. Introduction to Digital Image Processing. Appendix A: Some Useful Formulas and Definitions. A.1 Trigonometric Identities. A.2 Geometric Series. A.3 Complex Variables. A.4 Units of Power. References. Appendix B: Software Organization and List of Experiments. Index.

...read moreread less

228 citations

Proceedings Article•DOI•

Automatic audio content analysis

[...]

Silvia Pfeiffer¹, Stephan Fischer¹, Wolfgang Effelsberg¹•Institutions (1)

University of Mannheim¹

01 Feb 1997

TL;DR: The theoretic framework and applications of automatic audio content analysis, including analysis of amplitude, frequency and pitch, and simulations of human audio perception, are described.

...read moreread less

Abstract: This paper describes the theoretic framework and applications of automatic audio content analysis. Research in multimedia content analysis has so far concentrated on the video domain. We demonstrate the strength of automatic audio content analysis. We explain the algorithms we use, including analysis of amplitude, frequency and pitch, and simulations of human audio perception. These algorithms serve us as tools for further audio content analysis. We use these tools in applications like the segmentation of audio data streams into logical units for further processing, the analysis of music, as well as the recognition of sounds indicative of violence like shots, explosions and cries.

...read moreread less

227 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
…
24
25
26
27
28
29
30
…
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

14,368

Papers

279,843

Citations

No. of papers in the topic in previous years
Year	Papers
2023	38
2022	84
2021	70
2020	62
2019	77
2018	108

Speech coding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics