Sound-source recognition: a theory and computational model

Open AccessDissertation

Sound-source recognition: a theory and computational model

Chats0

TLDR

A computer model of the recognition process is developed that is capable of “listening” to a recording of a musical instrument and classifying the instrument as one of 25 possibilities, based on current models of signal processing in the human auditory system.

Abstract:

The ability of a normal human listener to recognize objects in the environment from only the sounds they produce is extraordinarily robust with regard to characteristics of the acoustic environment and of other competing sound sources. In contrast, computer systems designed to recognize sound sources function precariously, breaking down whenever the target sound is degraded by reverberation, noise, or competing sounds. Robust listening requires extensive contextual knowledge, but the potential contribution of sound-source recognition to the process of auditory scene analysis has largely been neglected by researchers building computational models of the scene analysis process. This thesis proposes a theory of sound-source recognition, casting recognition as a process of gathering information to enable the listener to make inferences about objects in the environment or to predict their behavior. In order to explore the process, attention is restricted to isolated sounds produced by a small class of sound sources, the non-percussive orchestral musical instruments. Previous research on the perception and production of orchestral instrument sounds is reviewed from a vantage point based on the excitation and resonance structure of the sound-production process, revealing a set of perceptually salient acoustic features. A computer model of the recognition process is developed that is capable of “listening” to a recording of a musical instrument and classifying the instrument as one of 25 possibilities. The model is based on current models of signal processing in the human auditory system. It explicitly extracts salient acoustic features and uses a novel improvisational taxonomic architecture (based on simple statistical pattern-recognition techniques) to classify the sound source. The performance of the model is compared directly to that of skilled human listeners, using

Sound-source recognition: a theory and computational model

Citations

Intuitive computing methods and systems

Experiments in hearing

Automatic Musical Genre Classification of Audio Signals.

Musical instrument recognition using cepstral coefficients and temporal features

The science of sound

References

Classification and Regression Trees.

Pattern Classification

Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference

Classification and regression trees

The senses considered as perceptual systems

Related Papers (5)

Musical instrument recognition using cepstral coefficients and temporal features

Multidimensional perceptual scaling of musical timbres

Prediction-driven computational auditory scene analysis

Perceptual scaling of synthesized musical timbres: Common dimensions, specificities, and latent subject classes

Construction and evaluation of a robust multifeature speech/music discriminator