scispace - formally typeset
Search or ask a question
Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.


Papers
More filters
Patent
Soochan Lim1
03 Mar 2008
TL;DR: In this paper, a method for equalizing audio and a video apparatus using the audio equalizing method is presented, which includes detecting the distance between a speaker mounted in a camera and a reflective surface, and equalizing an audio signal to be output from the speaker based on the detected distance.
Abstract: A method for equalizing audio and a video apparatus using the audio equalizing method are provided. The method for equalizing audio includes detecting the distance between a speaker mounted in a video apparatus and a reflective surface, and equalizing an audio signal to be output from the speaker based on the detected distance. Accordingly, attenuation of audio output is reduced, so audio output is optimized.

87 citations

Proceedings ArticleDOI
04 May 2014
TL;DR: A multi-resolution approach based on discrete wavelet transform and linear prediction filtering that improves time resolution and performance of onset detection in different musical scenarios and significantly outperforms existing methods in terms of F-Measure is presented.
Abstract: A plethora of different onset detection methods have been proposed in the recent years. However, few attempts have been made with respect to widely-applicable approaches in order to achieve superior performances over different types of music and with considerable temporal precision. In this paper, we present a multi-resolution approach based on discrete wavelet transform and linear prediction filtering that improves time resolution and performance of onset detection in different musical scenarios. In our approach, wavelet coefficients and forward prediction errors are combined with auditory spectral features and then processed by a bidirectional Long Short-Term Memory recurrent neural network, which acts as reduction function. The network is trained with a large database of onset data covering various genres and onset types. We compare results with state-of-the-art methods on a dataset that includes Bello, Glover and ISMIR 2004 Ballroom sets, and we conclude that our approach significantly outperforms existing methods in terms of F-Measure. For pitched non percussive music an absolute improvement of 7.5% is reported.

87 citations

Proceedings Article
01 Aug 2009
TL;DR: A nearly ideal VAD algorithm is proposed which is both easy-to-implement and noise robust, comparing to some previous methods and uses short-term features such as Spectral Flatness and Short-term Energy.
Abstract: Voice Activity Detection (VAD) is a very important front end processing in all Speech and Audio processing applications. The performance of most if not all speech/audio processing methods is crucially dependent on the performance of Voice Activity Detection. An ideal voice activity detector needs to be independent from application area and noise condition and have the least parameter tuning in real applications. In this paper a nearly ideal VAD algorithm is proposed which is both easy-to-implement and noise robust, comparing to some previous methods. The proposed method uses short-term features such as Spectral Flatness (SF) and Short-term Energy. This helps the method to be appropriate for online processing tasks. The proposed method was evaluated on several speech corpora with additive noise and is compared with some of the most recent proposed algorithms. The experiments show satisfactory performance in various noise conditions.

87 citations

Patent
26 Sep 1980
TL;DR: In this article, a radio apparatus has a section for receiving an analog signal and a digital angle-modulated carrier wave signal, and a clock signal is regenerated from the output of either the demodulating means or receiver section.
Abstract: According to the present invention, a radio apparatus has a section for receiving an analog signal and a digital angle-modulated carrier wave signal. The analog signal and digital angle-modulated carrier wave output of the receiver section are demodulated to provide first and second demodulated signals. A clock signal is regenerated from the output of either the demodulating means or receiver section. A control signal selectively operates a switch for passing either the first or the second demodulated signals. The regenerated clock signal controls the switch.

87 citations

Journal ArticleDOI
TL;DR: To achieve real-time processing, independent of signal length, slice-wise processing of the full input signal is proposed and referred to as sliCQ transform, and overcomes computational inefficiency and lack of invertibility of classical constant-Q transform implementations.
Abstract: Audio signal processing frequently requires time-frequency representations and in many applications, a non-linear spacing of frequency bands is preferable. This paper introduces a framework for efficient implementation of invertible signal transforms allowing for non-uniform frequency resolution. Non-uniformity in frequency is realized by applying nonstationary Gabor frames with adaptivity in the frequency domain. The realization of a perfectly invertible constant-Q transform is described in detail. To achieve real-time processing, independent of signal length, slice-wise processing of the full input signal is proposed and referred to as sliCQ transform. By applying frame theory and FFT-based processing, the presented approach overcomes computational inefficiency and lack of invertibility of classical constant-Q transform implementations. Numerical simulations evaluate the efficiency of the proposed algorithm and the method's applicability is illustrated by experiments on real-life audio signals .

87 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
81% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Robustness (computer science)
94.7K papers, 1.6M citations
78% related
Noise
110.4K papers, 1.3M citations
77% related
Image segmentation
79.6K papers, 1.8M citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202319
202263
2021217
2020525
2019659
2018597