Journal ArticleDOI
Robust audio fingerprinting using peak-pair-based hash of non-repeating foreground audio in a real environment
TLDR
A high-performance audio fingerprinting system used in real-world query-by-example applications for acoustic audio-based content identification, especially for use in heterogeneous portable consumer devices or on-line audio distributed system is proposed.Abstract:
In this paper, we propose a high-performance audio fingerprinting system used in real-world query-by-example applications for acoustic audio-based content identification, especially for use in heterogeneous portable consumer devices or on-line audio distributed system. In the proposed method, audio fingerprints are generated using a modulated complex lapped transform-based non-repeating foreground audio extraction and an adaptive thresholding method for prominent peak detection. Effective matching is performed using a robust peak-pair-based hash function of non-repeating foreground audio to protect against noise, echo, artifacts from pitch-shifting, time-stretching, resampling, equalization, or compression. Experimental results confirm that the proposed method is quite robust in various distorted conditions and achieves preliminarily promising accuracy results.read more
Citations
More filters
Journal ArticleDOI
A high-performance speech perceptual hashing authentication algorithm based on discrete wavelet transform and measurement matrix
TL;DR: The experimental results demonstrates the proposed speech perceptual hashing authentication algorithm has high efficiency in perceptual robustness, discrimination, time consumption and security, as well as having a high accuracy of tampering detection and localization.
Journal ArticleDOI
Multi-format speech BioHashing based on spectrogram
TL;DR: The experimental results show that the proposed multi-format speech BioHashing algorithm has the characteristics of good security, strong robustness, high real-time performance and wide application range.
Proceedings ArticleDOI
A fast speech feature extraction method based on perceptual hashing
TL;DR: A fast speech perceptual hashing feature extraction method based on modified discrete cosine transform (MDCT) and compressed sensing that has a high efficiency in terms of time consumption, security and distinction.
Journal ArticleDOI
Video hashing with secondary frames and invariant moments
TL;DR: Performance comparisons with some state-of-the-art algorithms illustrate that the proposed video hashing outperforms the compared algorithms in classification in terms of receiver operating characteristic results.
Proceedings ArticleDOI
Multi-format speech perception hashing based on time-frequency parameter fusion of energy zero ratio and frequency band variance
Yibo Huang,Yong Wang +1 more
TL;DR: Experiments show that the proposed multi-format speech perception hashing based on time-frequency parameter fusion of energy zero ratio and frequency band bariance is robustness, discrimination and key dependent.
References
More filters
Proceedings Article
A Highly Robust Audio Fingerprinting System.
Jaap A. Haitsma,Ton Kalker +1 more
TL;DR: An audio fingerprinting system that uses the fingerprint of an unknown audio clip as a query on a fingerprint database, which contains the fingerprints of a large library of songs, the audio clip can be identified.
Proceedings Article
An Industrial Strength Audio Search Algorithm.
TL;DR: The algorithm is noise and distortion resistant, computationally efficient, and massively scalable, capable of quickly identifying a short segment of music captured through a cellphone microphone in the presence of foreground voices and other dominant noise, out of a database of over a million tracks.
Proceedings ArticleDOI
A review of algorithms for audio fingerprinting
TL;DR: Different techniques mapping functional parts to blocks of a unified framework for audio fingerprinting are reviewed, with a focus on pattern matching and robust hashing.
Journal ArticleDOI
REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation
Zafar Rafii,Bryan Pardo +1 more
TL;DR: The REpeating Pattern Extraction Technique (REPET), a novel and simple approach for separating the repeating “background” from the non-repeating “foreground” in a mixture, can be successfully applied for music/voice separation, competing with two recent state-of-the-art approaches.
Proceedings ArticleDOI
Audio Fingerprinting: Combining Computer Vision & Data Stream Processing
Shumeet Baluja,Michele Covell +1 more
TL;DR: The waveprint system, a novel system for audio identification that uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched, is presented.
Related Papers (5)
Scalable and robust audio fingerprinting method tolerable to time-stretching
Jacob George,Ashok Jhunjhunwala +1 more