PatentDOI
System and methods for recognizing sound and music signals in high noise and distortion
Reads0
Chats0
TLDR
In this article, the authors proposed a method for recognizing audio samples that locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings, where each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints.Abstract:
A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.read more
Citations
More filters
Patent
Palette-based classifying and synthesizing of auditory information
TL;DR: In this paper, spectral representations of an input sequence are used to synthesize a class of data, which can include individual events, distributions of events, and/or environments relating to the input sequence.
Patent
Low complexity repetition detection in media data
TL;DR: In this article, a subset of offset values are located in a set of media data using a first type of one or more types of features, which are extractable from (e.g., derivable from components of) the media data.
Patent
Linear predictive coding implementation of digital watermarks
TL;DR: In this article, the carrier signal independent data is encoded in a manner such that it is restricted or concentrated primarily in the non-deterministic signal components of the carrier signals and the signal components can include a discrete series of digital samples and/or a discreet series of carrier frequency sub-bands.
Patent
User system providing previews to an associated portable media player
TL;DR: In this article, a system and method for providing previews, such as song and video previews, to a portable media player is presented, where a play history is generated as media files are played by the portable player and provided to a central system hosting an e-commerce service providing media content.
Patent
Detecting click spam
TL;DR: In this article, a computer-implemented method for processing network activities is described, which includes identifying a model that specifies attributes for network objects, identifying a network object having one or more attributes that deviate from the model, and providing as an input to a ranking algorithm a value associated with the deviance of the identified network object.
References
More filters
Journal ArticleDOI
Content-based classification, search, and retrieval of audio
TL;DR: The audio analysis, search, and classification engine described here reduces sounds to perceptual and acoustical features, which lets users search or retrieve sounds by any one feature or a combination of them, by specifying previously learned classes based on these features.
Proceedings Article
Finding similar files in a large file system
TL;DR: Application of sif can be found in file management, information collecting, program reuse, file synchronization, data compression, and maybe even plagiarism detection.
Patent
Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information
TL;DR: In this paper, a system that performs analysis and comparison of audio data files based upon the content of the data files is presented, which produces a set of numeric values (a feature vector) that can be used to classify and rank the similarity between individual audio files typically stored in a multimedia database or on the Web.
Proceedings ArticleDOI
Content-based retrieval of music and audio
TL;DR: In this article, a supervised vector quantizer is used to learn audio features from a corpus of simple sounds and musical excerpts, and the similarity measure is based on statistics derived from a supervised quantizer, rather than matching simple pitch or spectral characteristics.
Journal ArticleDOI
An overview of audio information retrieval
TL;DR: The state of the art in audio information retrieval is reviewed, and recent advances in automatic speech recognition, word spotting, speaker and music identification, and audio similarity are presented with a view towards making audio less “opaque”.