Home
/
Topics
/
Spectrogram

Topic

Spectrogram

About: Spectrogram is a research topic. Over the lifetime, 5813 publications have been published within this topic receiving 81547 citations.

...read moreread less

Papers published on a yearly basis

2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Spatiotemporal and frequential cascaded attention networks for speech emotion recognition

[...]

Shuzhen Li¹, Xiaofen Xing¹, Weiquan Fan¹, Bolun Cai², Perry Fordson¹, Xiangmin Xu¹ - Show less +2 more•Institutions (2)

South China University of Technology¹, Tencent²

11 Aug 2021-Neurocomputing

TL;DR: A novel spatiotemporal and frequential cascaded attention network with large-margin learning is proposed that achieves a promising performance in speech emotion recognition.

...read moreread less

33 citations

Journal Article•DOI•

Acoustic scene classification using deep CNN with fine-resolution feature

[...]

Tao Zhang¹, Jinhua Liang¹, Biyun Ding¹•Institutions (1)

Tianjin University¹

01 Apr 2020-Expert Systems With Applications

TL;DR: FRCNN is proposed to embrace the progress in very deep architecture, feature fusion and convolutional operation, and depth-wise separable convolution is utilized to reduce the number of trainable parameters.

...read moreread less

Abstract: Convolutional neural networks with spectrogram feature representation for acoustic scene classification are attracting more and more attentions due to its favorable performance. However, most of the existing methods are still restricted to the tradeoff between the minimum coverage area across time-frequency feature representation, i.e. time-frequency feature resolution, and the depth of CNN models. Thus, it is unfeasible to improve the performance by simply deepening networks. In this paper, fine-resolution convolutional neural network (FRCNN) is proposed to embrace the progress in very deep architecture, feature fusion and convolutional operation. Specifically, lateral construction is applied to generate a fine-resolution feature map with semantic information, and depth-wise separable convolution is utilized to reduce the number of trainable parameters. Extensive experiments demonstrate that the proposed FRCNN exhibits high performance on several metrics, with low computational complexity.

...read moreread less

33 citations

Journal Article•DOI•

Multichannel high-resolution NMF for modeling convolutive mixtures of non-stationary signals in the time-frequency domain

[...]

Roland Badeau¹, Mark D. Plumbley²•Institutions (2)

Institut Mines-Télécom¹, Queen Mary University of London²

01 Nov 2014-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: The HR-NMF model is extended to multichannel signals and to convolutive mixtures, and a fast variational expectation-maximization (EM) algorithm is proposed to estimate the enhanced model.

...read moreread less

Abstract: Several probabilistic models involving latent components have been proposed for modeling time-frequency (TF) representations of audio signals such as spectrograms, notably in the nonnegative matrix factorization (NMF) literature. Among them, the recent high-resolution NMF (HR-NMF) model is able to take both phases and local correlations in each frequency band into account, and its potential has been illustrated in applications such as source separation and audio inpainting. In this paper, HR-NMF is extended to multichannel signals and to convolutive mixtures. The new model can represent a variety of stationary and non-stationary signals, including autoregressive moving average (ARMA) processes and mixtures of damped sinusoids. A fast variational expectation-maximization (EM) algorithm is proposed to estimate the enhanced model. This algorithm is applied to piano signals, and proves capable of accurately modeling reverberation, restoring missing observations, and separating pure tones with close frequencies.

...read moreread less

33 citations

Journal Article•DOI•

Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation.

[...]

Bin Gao¹, Wai Lok Woo², LC Khor²•Institutions (2)

University of Electronic Science and Technology of China¹, Newcastle University²

06 Mar 2014-Journal of the Acoustical Society of America

TL;DR: Experimental tests have been conducted, which show that the extraction of the spectral dictionary and temporal codes is more efficient using sparsity learning and subsequently leads to better separation performance.

...read moreread less

Abstract: An unsupervised single channel audio separation method from pattern recognition viewpoint is presented. The proposed method does not require training knowledge and the separation system is based on non-uniform time-frequency (TF) analysis and feature extraction. Unlike conventional research that concentrates on the use of spectrogram or its variants, the proposed separation algorithm uses an alternative TF representation based on the gammatone filterbank. In particular, the monaural mixed audio signal is shown to be considerably more separable in this non-uniform TF domain. The analysis of signal separability to verify this finding is provided. In addition, a variational Bayesian approach is derived to learn the sparsity parameters for optimizing the matrix factorization. Experimental tests have been conducted, which show that the extraction of the spectral dictionary and temporal codes is more efficient using sparsity learning and subsequently leads to better separation performance.

...read moreread less

33 citations

Proceedings Article•DOI•

Speech enhancement based on filtering the spectrotemporal modulations

[...]

Nima Mesgarani¹, Shihab A. Shamma¹•Institutions (1)

University of Maryland, College Park¹

18 Mar 2005

TL;DR: A monaural noise suppression algorithm is proposed based on filtering the spectrotemporal modulations of noisy speech based on a multiscale representation of the signal spectrogram generated by a model of sound processing in the auditory system.

...read moreread less

Abstract: A monaural noise suppression algorithm is proposed based on filtering the spectrotemporal modulations of noisy speech. The modulations are estimated from a multiscale representation of the signal spectrogram generated by a model of sound processing in the auditory system. A significant advantage of this method is its ability to suppress noise that has distinctive modulation patterns, despite being spectrally overlapping with the speech. The performance of the algorithm is evaluated using subjective and objective tests and compared to the optimal smoothing and minimum statistics approach (Martin (2001)). The results demonstrate the efficacy of the spectrotemporal filtering approach in the conditions examined.

...read moreread less

32 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
…
140
141
142
143
144
145
146
…
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,848

Papers

107,060

Citations

No. of papers in the topic in previous years
Year	Papers
2024	1
2023	627
2022	1,396
2021	488
2020	595
2019	593

Spectrogram

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics