Topic

Audio signal processing

About: Audio signal processing is a research topic. Over the lifetime, 21463 publications have been published within this topic receiving 319597 citations. The topic is also known as: audio processing & Acoustic signal processing.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Score informed audio source separation using a parametric model of non-negative spectrogram

[...]

Romain Hennequin¹, Bertrand David¹, Roland Badeau¹•Institutions (1)

Télécom ParisTech¹

22 May 2011

TL;DR: A new technique for monaural source separation in musical mixtures, which uses the knowledge of the musical score to initialize an algorithm which computes a parametric decomposition of the spectrogram based on non-negative matrix factorization (NMF).

...read moreread less

Abstract: In this paper we present a new technique for monaural source separation in musical mixtures, which uses the knowledge of the musical score. This information is used to initialize an algorithm which computes a parametric decomposition of the spectrogram based on non-negative matrix factorization (NMF). This algorithm provides time-frequency masks which are used to separate the sources with Wiener filtering.

...read moreread less

100 citations

Proceedings Article•DOI•

The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment

[...]

Forrest Briggs¹, Yonghong Huang², Raviv Raich¹, Konstantinos Eftaxias³, Zhong Lei, William Cukierski⁴, Sarah Frey Hadley, Adam S. Hadley¹, Matthew G. Betts¹, Xiaoli Z. Fern¹, Jed Irvine¹, Lawrence Neal¹, Anil Thomas⁵, Gabor Fodor⁶, Grigorios Tsoumakas⁷, Hong-Wei Ng⁸, Thi Ngoc Tho Nguyen⁹, Heikki Huttunen¹⁰, Pekka Ruusuvuori¹¹, Tapio Manninen¹⁰, Aleksandr Diment¹⁰, Tuomas Virtanen¹⁰, Julien Marzat¹², Joseph Defretin, David R. Callender, Chris Hurlburt, Ken Larrey, Maxim Milakov⁴ - Show less +24 more•Institutions (12)

Oregon State University¹, Intel², University of Surrey³, Université de Montréal⁴, Cisco Systems, Inc.⁵, Ericsson⁶, Aristotle University of Thessaloniki⁷, University of Illinois at Urbana–Champaign⁸, Agency for Science, Technology and Research⁹, Tampere University of Technology¹⁰, Institute for Systems Biology¹¹, Supélec¹²

14 Nov 2013

TL;DR: It is an open problem for signal processing and machine learning to reliably identify bird sounds in real-world audio data collected in an acoustic monitoring scenario.

...read moreread less

Abstract: Birds have been widely used as biological indicators for ecological research. They respond quickly to environmental changes and can be used to infer about other organisms (e.g., insects they feed on). Traditional methods for collecting data about birds involves costly human effort. A promising alternative is acoustic monitoring. There are many advantages to recording audio of birds compared to human surveys, including increased temporal and spatial resolution and extent, applicability in remote sites, reduced observer bias, and potentially lower cost. However, it is an open problem for signal processing and machine learning to reliably identify bird sounds in real-world audio data collected in an acoustic monitoring scenario. Some of the major challenges include multiple simultaneously vocalizing birds, other sources of non-bird sound (e.g., buzzing insects), and background noise like wind, rain, and motor vehicles.

...read moreread less

100 citations

Proceedings Article•DOI•

Environmental sound recognition using MP-based features

[...]

Selina Chu¹, Shrikanth S. Narayanan¹, C.-C. Jay Kuo¹•Institutions (1)

University of Southern California¹

12 May 2008

TL;DR: A novel method based on matching pursuit to analyze environment sounds for their feature extraction that is flexible, yet intuitive and physically interpretable, and can be used to supplement another well-known audio feature, i.e. MFCC, to yield higher recognition accuracy for environmental sounds.

...read moreread less

Abstract: Defining suitable features for environmental sounds is an important problem in an automatic acoustic scene recognition system. As with most pattern recognition problems, extracting the right feature set is the key to effective performance. A variety of features have been proposed for audio recognition, but the vast majority of the past work utilizes features that are well-known for structured data, such as speech and music, and assumes this association will transfer naturally well to unstructured sounds. In this paper, we propose a novel method based on matching pursuit (MP) to analyze environment sounds for their feature extraction. The proposed MP-based method utilizes a dictionary from which to select features, resulting in a representation that is flexible, yet intuitive and physically interpretable. We will show that these features are less sensitive to noise and are capable of effectively representing sounds that originate from different sources and different frequency ranges. The MP- based feature can be used to supplement another well-known audio feature, i.e. MFCC, to yield higher recognition accuracy for environmental sounds.

...read moreread less

100 citations

Patent•

Segmenting Audio Signals into Auditory Events

[...]

Brett G. Crockett¹•Institutions (1)

Dolby Laboratories¹

26 Feb 2002

TL;DR: In this paper, an audio signal is divided into auditory events, each of which tends to be perceived as separate and distinct, by calculating the spectral content of successive time blocks of the audio signal.

...read moreread less

Abstract: In one aspect, the invention divides an audio signal into auditory events, each of which tends to be perceived as separate and distinct, by calculating the spectral content of successive time blocks of the audio signal, calculating the difference in spectral content between successive time blocks of the audio signal, and identifying an auditory event boundary as the boundary between successive time blocks when the difference in the spectral content between such successive time blocks exceeds a threshold. In another aspect, the invention generates a reduced-information representation of an audio signal by dividing an audio signal into auditory events, each of which tends to be perceived as separate and distinct, and formatting and storing information relating to the auditory events. Optionally, the invention may also assign a characteristic to one or more of the auditory events. Auditory events may be determined according to the first aspect of the invention or by another method.

...read moreread less

100 citations

Patent•DOI•

Binaural-signal-processing techniques

[...]

Albert S. Feng¹, Chen Liu¹, Robert C. Bilger¹, Douglas L. Jones¹, Charissa R. Lansing¹, William D. O'Brien¹, Bruce C. Wheeler¹ - Show less +3 more•Institutions (1)

University of Illinois at Urbana–Champaign¹

16 Nov 1999-Journal of the Acoustical Society of America

TL;DR: In this article, a desired acoustic signal is extracted from a noisy environment by generating a signal representative of the desired signal with processor (30) using a discrete Fourier transform process.

...read moreread less

Abstract: A desired acoustic signal is extracted from a noisy environment by generating a signal representative of the desired signal with processor (30). Processor (30) receives aural signals from two sensors (22, 24) each at a different location. The two inputs to processor (30) are converted from analog to digital format and then submitted to a discrete Fourier transform process to generate discrete spectral signal representations. The spectral signals are delayed to provide a number of intermediate signals, each corresponding to a different spatial location relative to the two sensors. Locations of the noise source and the desired source, and the spectral content of the desired signal are determined fron the intermediate signal corresponding to the noise source locations. Inverse transformation of the selected intermediate signal followed by digital to analog conversion provides an output signal representative of the desired signal with output device (90). Techniques to localize multiple acoustic sources are also disclosed. Further, a technique to enhance noise reduction from multiple sources based on two-sensor reception is described.

...read moreread less

100 citations

Collapse

Network Information

Performance

Metrics

21,541

Papers

328,867

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	63
2021	217
2020	525
2019	659
2018	597

Audio signal processing

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics