scispace - formally typeset
PatentDOI

System and methods for recognizing sound and music signals in high noise and distortion

Avery Li-Chun Wang, +1 more
- 20 Apr 2001 - 
- Vol. 121, Iss: 4, pp 1832
Reads0
Chats0
TLDR
In this article, the authors proposed a method for recognizing audio samples that locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings, where each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints.
Abstract
A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.

read more

Citations
More filters
Patent

Methods and apparatus to identify media using hash keys

TL;DR: In this paper, the authors describe a method to identify media using hash keys, using bitwise comparison of the first metered hash key and a second reference hash key corresponding to a second meta-key.
Patent

Method for user context recognition using sound signatures

TL;DR: In this paper, a method for user micro context recognition using sound signatures was proposed, which includes: recording an environment sound sample from a microphone into a mobile device by a trigger stimulus; simultaneous to recording a sound sample, collecting hardware and software macro context data from available mobile devices; extracting a sound signature from the recorded sound sample based on sound features and spectrograms; comparing for similar patterns the recorded signature with reference sound signatures stored in a sound database; updating the sound database, checking if the recorded sounds was recognized; performing a logical association between the sound label and the available
Patent

Method of and a system for matching audio tracks using chromaprints with a fast candidate selection routine

TL;DR: A computer-implemented method of matching of a first incoming audio track with an indexed audio track, executable at a server, the method comprising: selecting the indexed audio track as a candidate audio track from a plurality of indexed audio tracks; validating the candidate audio tracks against the first audio track as mentioned in this paper.
Patent

Deriving or calculating identifying data from video signals

TL;DR: In this paper, an analyzer comprising an electronic processor programmed for analyzing a video signal to derive or calculate identifying data from data representing picture elements of the video signal or from audio portions accompanying it, and a communications module to communicate the identifying data to a remote repository to obtain advertizing information therefrom.
Patent

Method and system for coupons based on automatic content recognition

TL;DR: In this article, an automatic content recognition (ACR)-enabled connected TV device may be operable to present an overlay offering a coupon utilizing an ACR system, where the overlay may also be presented on a paired device.
References
More filters
Journal ArticleDOI

Content-based classification, search, and retrieval of audio

TL;DR: The audio analysis, search, and classification engine described here reduces sounds to perceptual and acoustical features, which lets users search or retrieve sounds by any one feature or a combination of them, by specifying previously learned classes based on these features.
Proceedings Article

Finding similar files in a large file system

TL;DR: Application of sif can be found in file management, information collecting, program reuse, file synchronization, data compression, and maybe even plagiarism detection.
Patent

Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information

TL;DR: In this paper, a system that performs analysis and comparison of audio data files based upon the content of the data files is presented, which produces a set of numeric values (a feature vector) that can be used to classify and rank the similarity between individual audio files typically stored in a multimedia database or on the Web.
Proceedings ArticleDOI

Content-based retrieval of music and audio

TL;DR: In this article, a supervised vector quantizer is used to learn audio features from a corpus of simple sounds and musical excerpts, and the similarity measure is based on statistics derived from a supervised quantizer, rather than matching simple pitch or spectral characteristics.
Journal ArticleDOI

An overview of audio information retrieval

TL;DR: The state of the art in audio information retrieval is reviewed, and recent advances in automatic speech recognition, word spotting, speaker and music identification, and audio similarity are presented with a view towards making audio less “opaque”.
Related Papers (5)