scispace - formally typeset
Search or ask a question
Topic

Spectrogram

About: Spectrogram is a research topic. Over the lifetime, 5813 publications have been published within this topic receiving 81547 citations.


Papers
More filters
01 Jan 2008
TL;DR: A low-dimensional feature is defined which captures the shape of the modulation spectra and is improved from the previous result of Eer=25.1% to EER=17.4% on the NIST 2001 speaker recognition task.
Abstract: A so-called modulation spectrogram is obtained from the conventional speech spectrogram by short-term spectral analysis along the temporal trajectories of the frequency bins. In its original definition, the modulation spectrogram is a highdimensional representation and it is not clear how to extract features from it. In this paper, we define a low-dimensional feature which captures the shape of the modulation spectra. The recognition accuracy of the modulation spectrogram based classifier is improved from our previous result of EER=25.1% to EER=17.4% on the NIST 2001 speaker recognition task.

45 citations

Journal ArticleDOI
TL;DR: The RID gave a detailed representation of the TMJ signals' relative energy distribution in the time and frequency domains, with a great reduction in the interference or cross terms, and appears to be most useful in the application of time-frequency distributions in classification of TMJ sounds.
Abstract: summary For analysis of time-varying signals such as the TMJ sounds, it is often desirable to know how the frequency components change with time, using methods of timefrequency analysis. The aim of this study was to compare two of the most familiar methods for energy density representation with a newly developed technique. The sounds were recorded with a microphone fastened to the subject’s forehead, transformed to the time-frequency domain and displayed as 3D- and contour plots using spectrogram, Wigner distribution (WD), and the reduced interference distribution (RID) to display their time-frequency energy distributions. The spectrogram resolved only the lowfrequency components. The WD provided higher resolution but also exhibited strong interference between components. The RID gave a detailed representation of the TMJ signals’ relative energy distribution in the time and frequency domains, with a great reduction in the interference or cross terms. The RID therefore appears to be most useful in the application of time-frequency distributions in classification of TMJ sounds.

45 citations

Journal ArticleDOI
TL;DR: An orthogonal linear chirp modulation scheme that is based on assigning different users with optimally designed parameters in order to reduce multiple-access interference is designed and improved performance when compared with frequency-shift-keying (FSK) modulation is demonstrated.
Abstract: We propose the use of time-varying (TV) signaling in modulation schemes to provide multiuser detection and multipath diversity in TV wireless channels. Specifically, we design an orthogonal linear chirp modulation scheme that is based on assigning different users with optimally designed parameters in order to reduce multiple-access interference. We also derive conditions on the parameters of the modulation signals to achieve multipath diversity. Furthermore, we propose the use of TV pilot signals with nonlinear instantaneous frequency and matched time-frequency (TF) techniques to estimate fast-fading channels with unknown state information. The proposed algorithm simplifies to the estimation of the parameters of multiple linear chirps, which we perform using the modified matching pursuit decomposition. We compare our estimation method with the use of pilot signals with linear instantaneous frequency, which we implement using the reassigned spectrogram. The proposed modulation scheme is applied to a frequency-hopped code-division multiple-access system for which we demonstrate improved performance when compared with frequency-shift-keying (FSK) modulation due to the designed multipath diversity and low multiple-access interference. Our simulations also demonstrate the increased estimation performance when pilot signals with nonlinear structures are used instead of linear structured ones to estimate TV channel parameters

45 citations

Journal ArticleDOI
TL;DR: An extension of non-negative matrix factorization where the temporal activations become frequency dependent and follow a time-varying autoregressive moving average (ARMA) modeling leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch).
Abstract: Real-world sounds often exhibit time-varying spectral shapes, as observed in the spectrogram of a harpsichord tone or that of a transition between two pronounced vowels. Whereas the standard non-negative matrix factorization (NMF) assumes fixed spectral atoms, an extension is proposed where the temporal activations (coefficients of the decomposition on the spectral atom basis) become frequency dependent and follow a time-varying autoregressive moving average (ARMA) modeling. This extension can thus be interpreted with the help of a source/filter paradigm and is referred to as source/filter factorization. This factorization leads to an efficient single-atom decomposition for a single audio event with strong spectral variation (but with constant pitch). The new algorithm is tested on real audio data and shows promising results.

45 citations

Journal ArticleDOI
TL;DR: In this paper, the authors explore the possibilities of the spectrogram representation of one-point measurements of the ship wake to identify these components and to quantify their main properties, and demonstrate that the wake energy from strongly powered large ferries that sail at depth Froude numbers is concentrated in components that are located near the edge of the wake wedge.
Abstract: The wake of a ship that sails at relatively large Froude numbers usually contains a number of components of different nature and with different heights, lengths, timings and propagation directions. We explore the possibilities of the spectrogram representation of one-point measurements of the ship wake to identify these components and to quantify their main properties. This representation, based on the short-time Fourier transform, facilitates a reliable decomposition of the wake into constituent components and makes it possible to quantify their variations in the time–space domain and the energy content of each component, from very low-frequency precursor waves up to high-frequency signals within the frequency range of typical wind-generated waves. A method for estimation of the ship speed and the distance of its sailing line from the measurement site is proposed, which only uses information available within the record of the ship wake surface elevation, but where it is assumed that the wake pattern does not deviate significantly from the classical Kelvin wake structure. The wake decomposition using the spectrogram method allows investigation of the energy content that can be attributed to each individual component of the wake. We demonstrate that the majority (60–80 %) of wake energy from strongly powered large ferries that sail at depth Froude numbers is concentrated in components that are located near the edge of the wake wedge. Finally, we demonstrate that the spectrogram representation offers a convenient way to identify a specific signature of single types of ships.

45 citations


Network Information
Related Topics (5)
Deep learning
79.8K papers, 2.1M citations
79% related
Convolutional neural network
74.7K papers, 2M citations
78% related
Feature extraction
111.8K papers, 2.1M citations
77% related
Wavelet
78K papers, 1.3M citations
76% related
Support vector machine
73.6K papers, 1.7M citations
75% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20241
2023627
20221,396
2021488
2020595
2019593