scispace - formally typeset
Open AccessDissertation

Sound-source recognition: a theory and computational model

Reads0
Chats0
TLDR
A computer model of the recognition process is developed that is capable of “listening” to a recording of a musical instrument and classifying the instrument as one of 25 possibilities, based on current models of signal processing in the human auditory system.
Abstract
The ability of a normal human listener to recognize objects in the environment from only the sounds they produce is extraordinarily robust with regard to characteristics of the acoustic environment and of other competing sound sources. In contrast, computer systems designed to recognize sound sources function precariously, breaking down whenever the target sound is degraded by reverberation, noise, or competing sounds. Robust listening requires extensive contextual knowledge, but the potential contribution of sound-source recognition to the process of auditory scene analysis has largely been neglected by researchers building computational models of the scene analysis process. This thesis proposes a theory of sound-source recognition, casting recognition as a process of gathering information to enable the listener to make inferences about objects in the environment or to predict their behavior. In order to explore the process, attention is restricted to isolated sounds produced by a small class of sound sources, the non-percussive orchestral musical instruments. Previous research on the perception and production of orchestral instrument sounds is reviewed from a vantage point based on the excitation and resonance structure of the sound-production process, revealing a set of perceptually salient acoustic features. A computer model of the recognition process is developed that is capable of “listening” to a recording of a musical instrument and classifying the instrument as one of 25 possibilities. The model is based on current models of signal processing in the human auditory system. It explicitly extracts salient acoustic features and uses a novel improvisational taxonomic architecture (based on simple statistical pattern-recognition techniques) to classify the sound source. The performance of the model is compared directly to that of skilled human listeners, using

read more

Citations
More filters
Patent

Intuitive computing methods and systems

TL;DR: A smart phone senses audio, imagery, and/or other stimulus from a user's environment, and acts autonomously to fulfill inferred or anticipated user desires as discussed by the authors, and can apply more or less resources to an image processing task depending on how successfully the task is proceeding or based on the user's apparent interest in the task.
Journal ArticleDOI

Experiments in hearing

TL;DR: In what case do you like reading so much? What about the type of the experiments in hearing book? The needs to read? Well, everybody has their own reason why should read some books.
Proceedings Article

Automatic Musical Genre Classification of Audio Signals.

TL;DR: Algorithms for the automatic genre categorization of audio signals are described and a set of features for representing texture and instrumentation and a novel set of Features for representing rhythmic structure and strength are proposed.
Proceedings ArticleDOI

Musical instrument recognition using cepstral coefficients and temporal features

TL;DR: A wide set of features covering both spectral and temporal properties of sounds was investigated, and their extraction algorithms were designed and validated using test data that consisted of 1498 samples covering the full pitch ranges of 30 orchestral instruments, played with different techniques.
Journal Article

The science of sound

A Lewis
- 01 Sep 1997 - 
TL;DR: In this article, the authors describe research and development on car noise and vibration by the German car manufacturer Opel, which has one of the world's largest anechoic chambers, 22m long by 16m wide, which allows the whole powertrain and exhaust system to be examined away from the bodywork.
References
More filters
Book

Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference

TL;DR: Probabilistic Reasoning in Intelligent Systems as mentioned in this paper is a complete and accessible account of the theoretical foundations and computational methods that underlie plausible reasoning under uncertainty, and provides a coherent explication of probability as a language for reasoning with partial belief.
Book

Classification and regression trees

Leo Breiman
TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.
Related Papers (5)