Home
/
Topics
/
Spectrogram

Topic

Spectrogram

About: Spectrogram is a research topic. Over the lifetime, 5813 publications have been published within this topic receiving 81547 citations.

...read moreread less

Papers published on a yearly basis

2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Comparison on Data Augmentation Methods Based on Deep Learning for Audio Classification

[...]

Shengyun Wei¹, Shun Zou¹, Feifan Liao¹, Lang Weimin¹•Institutions (1)

National University of Defense Technology¹

01 Jan 2020

TL;DR: In this paper, a mixed frequency masking data augmentation method is proposed for audio classification, which adopts a nonlinear combination method to construct new samples and a linear method for constructing labels.

...read moreread less

Abstract: Deep learning focuses on the representation of the input data and generalization of the model. It is well known that data augmentation can combat overfitting and improve the generalization ability of deep neural network. In this paper, we summarize and compare multiple data augmentation methods for audio classification. These strategies include traditional methods on raw audio signal, as well as the current popular augmentation of linear interpolation and nonlinear mixing on the spectrum. We explore the generation of new samples, the transformation of labels, and the combination patterns of samples and labels of each data augmentation method. Finally, inspired by SpecAugment and Mixup, we propose an effective and easy to implement data augmentation method, which we call Mixed frequency Masking data augmentation. This method adopts nonlinear combination method to construct new samples and linear method to construct labels. All methods are verified on the Freesound Dataset Kaggle2018 dataset, and ResNet is adopted as the classifier. The baseline system uses the log-mel spectrogram feature as the input. We use mean Average Precision @3 (mAP@3) as the evaluation metric to evaluate the performance of all data augmentation methods.

...read moreread less

37 citations

Journal Article•DOI•

Feature extraction from EEG spectrograms for epileptic seizure detection

[...]

Ricardo Ramos-Aguilar¹, J. Arturo Olvera-López¹, Ivan Olmos-Pineda¹, Susana Sánchez-Urrieta¹•Institutions (1)

Benemérita Universidad Autónoma de Puebla¹

01 May 2020-Pattern Recognition Letters

TL;DR: An approach to extract features from EEG signals is proposed based on spectrograms using STFT to obtain time-frequency representations, and is evaluated using the dataset from Bonn University, identifying a healthy person and an epileptic attack classes as main task.

...read moreread less

37 citations

Proceedings Article•DOI•

The why and how of time-frequency reassignment

[...]

François Auger, Patrick Flandrin

25 Oct 1994

TL;DR: A general methodology providing a better readability of any bilinear distribution, referred to as reassignment, is essentially a generalization of an improvement of the spectrogram proposed by Kodera, Gendrin and de Villedary (1978).

...read moreread less

Abstract: A general methodology providing a better readability of any bilinear distribution has been proposed. This methodology, referred to as reassignment, is essentially a generalization of an improvement of the spectrogram proposed by Kodera, Gendrin and de Villedary (1978). After a presentation of this original work, its generalization to a wide range of distributions is shown. The close connections of this method with some related approaches are also underlined. >

...read moreread less

37 citations

Journal Article•DOI•

Griffin–Lim Like Phase Recovery via Alternating Direction Method of Multipliers

[...]

Yoshiki Masuyama¹, Kohei Yatabe¹, Yasuhiro Oikawa¹•Institutions (1)

Waseda University¹

01 Jan 2019-IEEE Signal Processing Letters

TL;DR: Two novel algorithms based on GLA and the alternating direction method of multipliers (ADMM) are proposed for better recovery with fewer iteration for better perceptual quality in some cases.

...read moreread less

Abstract: Recovering a signal from its amplitude spectrogram, or phase recovery, exhibits many applications in acoustic signal processing. When only an amplitude spectrogram is available and no explicit information is given for the phases, the Griffin–Lim algorithm (GLA) is one of the most utilized methods for phase recovery. However, GLA often requires many iterations and results in low perceptual quality in some cases. In this letter, we propose two novel algorithms based on GLA and the alternating direction method of multipliers (ADMM) for better recovery with fewer iteration. Some interpretation of the existing methods and their relation to the proposed method are also provided. Evaluations are performed with both objective measure and subjective test.

...read moreread less

37 citations

Journal Article•DOI•

A new variable frame analysis method for speech recognition

[...]

P Le Cerf¹, D. Van Compernolle¹•Institutions (1)

Katholieke Universiteit Leuven¹

01 Jan 1994-IEEE Signal Processing Letters

TL;DR: A new method for VFR using the norm of the derivative parameters in deciding to retain or to discard a frame is introduced, and informal inspection of speech spectrograms shows that this new method puts more emphasis on the transient regions of the speech signal.

...read moreread less

Abstract: Variable frame rate (VFR) analysis is a technique used in speech processing and recognition for discarding frames that are too much alike. The article introduces a new method for VFR. Instead of calculating the distance between frames, the norm of the derivative parameters is used in deciding to retain or to discard a frame, informal inspection of speech spectrograms shows that this new method puts more emphasis on the transient regions of the speech signal. Experimental results with a hidden Markov model (HMM) based system show that the new method outperforms the classical method. >

...read moreread less

37 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
…
120
121
122
123
124
125
126
…
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,848

Papers

107,060

Citations

No. of papers in the topic in previous years
Year	Papers
2024	1
2023	627
2022	1,396
2021	488
2020	595
2019	593

Spectrogram

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics