scispace - formally typeset
Search or ask a question
Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.


Papers
More filters
Journal ArticleDOI
TL;DR: A simulated study using a large data set, including nearly 10 000 songs and requiring over a billion audio pairwise comparisons, shows that modulation-scale features improves content identification accuracy substantially, especially when time and frequency distortions are imposed.
Abstract: For nonstationary signal classification, e.g., speech or music, features are traditionally extracted from a time-shifted, yet short data window. For many applications, these short-term features do not efficiently capture or represent longer term signal variation. Partially motivated by human audition, we overcome the deficiencies of short-term features by employing modulation-scale analysis for long-term feature analysis. Our analysis, which uses time-frequency theory integrated with psychoacoustic results on modulation frequency perception, not only contains short-term information about the signals, but also provides long-term information representing patterns of time variation. This paper describes these features and their normalization. We demonstrate the effectiveness of our long-term features over conventional short-term features in content-based audio identification. A simulated study using a large data set, including nearly 10 000 songs and requiring over a billion audio pairwise comparisons, shows that modulation-scale features improves content identification accuracy substantially, especially when time and frequency distortions are imposed.

85 citations

Proceedings ArticleDOI
04 Oct 1998
TL;DR: It is proposed to use audio information along with image and motion information to accomplish segmentation at different levels with promising results with videos digitized from TV programs.
Abstract: A video sequence usually consists of separate scenes, and each scene includes many shots. For video understanding purposes, it is most important to detect scene breaks. To analyze the content of each scene, detection of shot breaks is also required. Usually, a scene break is associated with a simultaneous change of image, motion, and audio characteristics, while a shot break is only accompanied with changes in image or motion or both. We propose to use audio information along with image and motion information to accomplish segmentation at different levels. Promising results have been obtained with videos digitized from TV programs.

85 citations

Patent
Sung Yong Yoon1, Hee Suk Pang1, Hyunkook Lee1, Dong Soo Kim1, Jae Hyun Lim1 
24 Nov 2007
TL;DR: In this article, a method and apparatus for encoding and decoding object-based audio signals is presented. But this method requires the first audio signal and a first audio parameter to be extracted from an audio signal, and the second audio signal to be encoded on an object basis.
Abstract: The present invention relates to a method and apparatus for encoding and decoding object- based audio signals. This audio decoding method includes extracting a first audio signal and a first audio parameter in which a music object are encoded on a channel basis and a second audio signal and a second audio parameter in which a vocal object are encoded on an object basis, from an audio signal, generating a third audio signal by employing at least one of the first and second audio signals, and generating a multi-channel audio signal by employing at least one of the first and second audio parameters and the third audio signal. Accordingly, the amount of calculation in encoding and decoding processes and the size of a bitstream that is encoded can be reduced efficiently.

85 citations

Patent
21 May 2003
TL;DR: In this paper, an auditory prosthesis (30) comprising a microphone (27) for receiving the sound and producing a microphone signal corresponding to the received sound, an output device for providing audio signals in a form receivable by a user of the prosthesis, and a sound processing unit (33) operable to receive the microphone signal and carry out a processing operation on the signal to produce an output signal.
Abstract: An auditory prosthesis (30) comprising a microphone (27) for receiving the sound and producing a microphone signal corresponding to the received sound, an output device for providing audio signals in a form receivable by a user of the prosthesis (30), a sound processing unit (33) operable to receive the microphone signal and carry out a processing operation on the microphone signal to produce an output signal in a form suitable to operate the output device, wherein the sound processing unit (33) is operable in a first mode in which the processing operation comprises at least one variable processing factor which is adjustable by a user to a setting which causes the output signal of the sound processing unit (33) to be adjusted according to the preference of the user for the characteristics of the current acoustic environment.

85 citations

PatentDOI
TL;DR: In this paper, a sound processing method for auditory prostheses, such as cochlear implants, is adapted to improve the perception of loudness by users, and to improve speech perception.
Abstract: A sound processing method for auditory prostheses, such as cochlear implants, which is adapted to improve the perception of loudness by users, and to improve speech perception. The overall contribution of stimuli to simulated loudness is compared with an estimate of acoustic loudness for a normally hearing listener based on the input sound signal. A weighting is applied to the filter channels to emphasize those frequencies which are most important to speech perception for normal hearing listeners when selecting channels as a basis for stimulation.

85 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
81% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Robustness (computer science)
94.7K papers, 1.6M citations
78% related
Noise
110.4K papers, 1.3M citations
77% related
Image segmentation
79.6K papers, 1.8M citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202319
202263
2021217
2020525
2019659
2018597