scispace - formally typeset
Search or ask a question
Topic

Spectrogram

About: Spectrogram is a research topic. Over the lifetime, 5813 publications have been published within this topic receiving 81547 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: The ConditionaL Neural Network (CLNN) 1 and its extension, the MaskedconditionaL neural network (MCLNN), designed to exploit the nature of sound in a time–frequency representation and surpasses neural networks based architectures including state-of-the-art Convolutional Neural Networks and several hand-crafted attempts.

32 citations

Patent
16 Apr 2008
TL;DR: In this paper, an FPGA-based random signal generator comprising a PC, a USB controller, an MCU3, a MCU interface module, a crystal resonator, an EPC2, a time controller, a dual-channel DA output circuit, a frequency controller, register matrix unit, a keyboard, a key scanning module, FLASH, a Flash control module, TFT display, a TFT control module and a DDS signal generator, a waveform synthesis module are presented.
Abstract: The invention discloses an FPGA-based random signal generator comprising a PC, a USB controller, an MCU3, an MCU interface module, a crystal resonator, an EPC2, a time controller, a dual-channel DA output circuit, a frequency controller, a register matrix unit, a keyboard, a keyboard scanning module, a FLASH, a Flash control module, a TFT display, a TFT control module, a DDS signal generator, a waveform synthesis module and other waveform generators. When the invention is used, software can automatically complete frequency spectrum information identification and get frequency point amplitude and phase parameters after a frequency spectrogram and phase spectrogram parameters are inputted into a software control interface; then a time domain information table is obtained after sampling values are quantized and encoded, the time domain information table is downloaded to the RAM of a DDS generating circuit to realize periodic or nonperiodic time domain signal reduction output; furthermore, a waveform amplitude is online stepped and adjustable, thereby realizing the aims of frequency domain output and time domain output.

32 citations

Journal ArticleDOI
TL;DR: This approach combines a preprocessing based on functional principles of the human auditory system and a probabilistic tracking scheme with an algorithm for adaptive frequency range segmentation as well as Bayesian smoothing to derive an efficient framework for estimating formant trajectories.
Abstract: We present a framework for estimating formant trajectories. Its focus is to achieve high robustness in noisy environments. Our approach combines a preprocessing based on functional principles of the human auditory system and a probabilistic tracking scheme. For enhancing the formant structure in spectrograms we use a Gammatone filterbank, a spectral preemphasis, as well as a spectral filtering using difference-of-Gaussians (DoG) operators. Finally, a contrast enhancement mimicking a competition between filter responses is applied. The probabilistic tracking scheme adopts the mixture modeling technique for estimating the joint distribution of formants. In conjunction with an algorithm for adaptive frequency range segmentation as well as Bayesian smoothing an efficient framework for estimating formant trajectories is derived. Comprehensive evaluations of our method on the VTR-formant database emphasize its high precision and robustness. We obtained superior performance compared to existing approaches for clean as well as echoic noisy speech. Finally, an implementation of the framework within the scope of an online system using instantaneous feature-based resynthesis demonstrates its applicability to real-world scenarios.

32 citations

Journal ArticleDOI
TL;DR: The Locally Optimized Spectrogram (LOS) defines a novel method for obtaining a high-resolution time-frequency representation based on the short-time fractional Fourier transform (STFrFT) and shows the trade-off between the cross-terms suppression and auto-terms resolution.

32 citations

Journal ArticleDOI
TL;DR: In this paper optical implementation of a generalized space-spatial frequency (GSF) function is discussed and experimental results using coherent space-integrating techniques for the realization of different GSF are presented.
Abstract: One-dimensional signals have complementary attributes, their space and spatial frequency properties. Recently, another signal representation was introduced in optics, the Wigner distribution (WD) function, which allows the simultaneous display of the two attributes of a 1-D signal. In this paper optical implementation of a generalized space–spatial frequency (GSF) function is discussed. Special cases of the GSF representations are the WD, the radar ambiguity function (AF), the energy distribution function, the various pseudo-WD functions, the spectrogram, the local frequency and the local Doppler frequency spectrum. The GSF representation allows the simultaneous filtering of both the spatial and the spatial frequency content of a 1-D signal. The 2-D filtering of a 1-D signal allows sidelobe reduction of the WD and AF of quasi-periodic 1-D signals, display of the instantaneous spatial frequency content of a 1-D signal, space–spatial frequency excision of a desired portion of a 1-D signal as well as the usual matched filter detection. Experimental results using coherent space-integrating techniques for the realization of different GSF are presented.

32 citations


Network Information
Related Topics (5)
Deep learning
79.8K papers, 2.1M citations
79% related
Convolutional neural network
74.7K papers, 2M citations
78% related
Feature extraction
111.8K papers, 2.1M citations
77% related
Wavelet
78K papers, 1.3M citations
76% related
Support vector machine
73.6K papers, 1.7M citations
75% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20241
2023627
20221,396
2021488
2020595
2019593