scispace - formally typeset
Journal ArticleDOI

Robust audio fingerprinting using peak-pair-based hash of non-repeating foreground audio in a real environment

TLDR
A high-performance audio fingerprinting system used in real-world query-by-example applications for acoustic audio-based content identification, especially for use in heterogeneous portable consumer devices or on-line audio distributed system is proposed.
Abstract
In this paper, we propose a high-performance audio fingerprinting system used in real-world query-by-example applications for acoustic audio-based content identification, especially for use in heterogeneous portable consumer devices or on-line audio distributed system. In the proposed method, audio fingerprints are generated using a modulated complex lapped transform-based non-repeating foreground audio extraction and an adaptive thresholding method for prominent peak detection. Effective matching is performed using a robust peak-pair-based hash function of non-repeating foreground audio to protect against noise, echo, artifacts from pitch-shifting, time-stretching, resampling, equalization, or compression. Experimental results confirm that the proposed method is quite robust in various distorted conditions and achieves preliminarily promising accuracy results.

read more

Citations
More filters
Journal ArticleDOI

A high-performance speech perceptual hashing authentication algorithm based on discrete wavelet transform and measurement matrix

TL;DR: The experimental results demonstrates the proposed speech perceptual hashing authentication algorithm has high efficiency in perceptual robustness, discrimination, time consumption and security, as well as having a high accuracy of tampering detection and localization.
Journal ArticleDOI

Multi-format speech BioHashing based on spectrogram

TL;DR: The experimental results show that the proposed multi-format speech BioHashing algorithm has the characteristics of good security, strong robustness, high real-time performance and wide application range.
Proceedings ArticleDOI

A fast speech feature extraction method based on perceptual hashing

TL;DR: A fast speech perceptual hashing feature extraction method based on modified discrete cosine transform (MDCT) and compressed sensing that has a high efficiency in terms of time consumption, security and distinction.
Journal ArticleDOI

Video hashing with secondary frames and invariant moments

TL;DR: Performance comparisons with some state-of-the-art algorithms illustrate that the proposed video hashing outperforms the compared algorithms in classification in terms of receiver operating characteristic results.
Proceedings ArticleDOI

Multi-format speech perception hashing based on time-frequency parameter fusion of energy zero ratio and frequency band variance

TL;DR: Experiments show that the proposed multi-format speech perception hashing based on time-frequency parameter fusion of energy zero ratio and frequency band bariance is robustness, discrimination and key dependent.
References
More filters
Proceedings Article

A Highly Robust Audio Fingerprinting System.

TL;DR: An audio fingerprinting system that uses the fingerprint of an unknown audio clip as a query on a fingerprint database, which contains the fingerprints of a large library of songs, the audio clip can be identified.
Proceedings Article

An Industrial Strength Audio Search Algorithm.

TL;DR: The algorithm is noise and distortion resistant, computationally efficient, and massively scalable, capable of quickly identifying a short segment of music captured through a cellphone microphone in the presence of foreground voices and other dominant noise, out of a database of over a million tracks.
Proceedings ArticleDOI

A review of algorithms for audio fingerprinting

TL;DR: Different techniques mapping functional parts to blocks of a unified framework for audio fingerprinting are reviewed, with a focus on pattern matching and robust hashing.
Journal ArticleDOI

REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation

TL;DR: The REpeating Pattern Extraction Technique (REPET), a novel and simple approach for separating the repeating “background” from the non-repeating “foreground” in a mixture, can be successfully applied for music/voice separation, competing with two recent state-of-the-art approaches.
Proceedings ArticleDOI

Audio Fingerprinting: Combining Computer Vision & Data Stream Processing

TL;DR: The waveprint system, a novel system for audio identification that uses a combination of computer-vision techniques and large-scale-data-stream processing algorithms to create compact fingerprints of audio data that can be efficiently matched, is presented.
Related Papers (5)