scispace - formally typeset
Search or ask a question
Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.


Papers
More filters
PatentDOI
TL;DR: In this paper, a signal processor formed using digital waveguide networks is described. But the signal processor is typically used for digital reverberation and for synthesis of reed, string or other instruments.
Abstract: Disclosed is a signal processor formed using digital waveguide networks. The digital waveguide networks have signal scattering junctions. A junction connects two waveguide sections together or terminates a waveguide. The junctions are constructed from conventional digital components such as multipliers, adders, and delay elements. The signal processor of the present invention is typically used for digital reverberation and for synthesis of reed, string or other instruments.

80 citations

Proceedings ArticleDOI
14 May 2006
TL;DR: A new system for translating the infant cries from its facial image and cry sounds is presented and uses k-means clustering to derive the reason why the infant is crying.
Abstract: A new system for translating the infant cries from its facial image and cry sounds is presented in this paper. The system is designed to analyze the facial image and sound of the crying infant to derive the reason why the infant is crying. The image and the sound represent the same cry event. The image processing module determines the state of certain facial features, certain combinations of which determine the reason for crying. The sound processing module analyzes the data for the fundamental frequency and the first two formants and uses k-means clustering to determine the reason of the cry. The decisions from the image and sound processing modules are then fused using a decision level fusion system. The overall accuracy of the image and sound processing modules are 64% and 74.2%, respectively, and that of the fused decision is 75.2%.

80 citations

Patent
Ole Kirkeby1, Jussi Virolainen1
29 Oct 2008
TL;DR: In this article, the user provides a desired direction of spatial attention so that audio processing can focus on the desired direction and render a corresponding multi-channel audio signal to the user, analogous to a magnifying glass being used to pick out details in a picture.
Abstract: Aspects of the invention provide methods, computer-readable media, and apparatuses for spatially manipulating sound that is played back to a listener over a set of output transducers, e.g., headphones. The listener can direct spatial attention to focus on a portion of an audio scene, analogous to a magnifying glass being used to pick out details in a picture. An input multi-channel audio signal that is generated by audio sources is obtained, and directional information is determined for each of the audio sources. The user provides a desired direction of spatial attention so that audio processing can focus on the desired direction and render a corresponding multi-channel audio signal to the user. A region of an audio scene is expanded around the desired direction while the audio scene is compressed in another portion of the audio scene.

79 citations

Proceedings ArticleDOI
29 Nov 1996
TL;DR: Digital Alias-free Signal Processing is discussed in this paper to draw attention to the facts that this technique has already reached a considerable degree of maturity so that it can now be used as a widely applicable Digital Signal Processing (DSP) tool and that it is especially competitive in the area of Microwave and Radio Frequency signal processing.
Abstract: The advanced Information Technology we call Digital Alias-free Signal Processing (DASP) is discussed in this paper to draw attention to the facts that, first, this technique has already reached a considerable degree of maturity so that it can now be used as a widely applicable Digital Signal Processing (DSP) tool and, second, that it is especially competitive in the area of Microwave and Radio Frequency (RF) signal processing. Its utility arises from its applicability to digital processing of signals at frequencies considerably exceeding half of the mean sampling rate, which traditionally limit classical DSP applications.

79 citations

PatentDOI
TL;DR: In this article, a system and method for locating program boundaries and commercial boundaries using audio categories is described. But the system is not suitable for use in a video signal processor, as it requires the use of an audio classifier controller that determines the rates of change of audio categories.
Abstract: For use in a video signal processor, there is disclosed a system and method for locating program boundaries and commercial boundaries using audio categories. The system comprises an audio classifier controller that obtains information concerning the audio categories of the segments of an audio signal. Audio categories include such categories as silence, music, noise and speech. The audio classifier controller determines the rates of change of the audio categories. The audio classifier controller then compares each rate of change of the audio categories with a threshold value to locate the boundaries of the programs and commercials. The audio classifier controller is also capable of classifying at least one feature of an audio category change rate using a multifeature classifier to locate the boundaries of the programs and commercials.

79 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
81% related
Feature (computer vision)
128.2K papers, 1.7M citations
79% related
Robustness (computer science)
94.7K papers, 1.6M citations
78% related
Noise
110.4K papers, 1.3M citations
77% related
Image segmentation
79.6K papers, 1.8M citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202319
202263
2021217
2020525
2019659
2018597