Scalable and robust audio fingerprinting method tolerable to time-stretching
TL;DR: The experiment results show the method is highly tolerable to time-stretch than the state-of-the-art Shazam's audio fingerprinting, and is scalable and tolerant toTime-stretching.
Abstract: A time-stretching invariant, robust audio fingerprinting method, based on landmarks in the audio spectrogram is proposed in this paper. Time-stretching of audio clips or songs are done to evade copyright detection as most of the fingerprinting techniques are time dependent. Time-stretching is also used in music industry to produce remix & song mash-ups and in multimedia broadcasting to fit content within the required duration. The proposed algorithm is based on the audio hashing of frequency peaks in the spectrogram. It is scalable and tolerant to time-stretching. The experiment results show the method is highly tolerable to time-stretch than the state-of-the-art Shazam's audio fingerprinting.
...read more
Citations
18 citations
Cites background or methods from "Scalable and robust audio fingerpri..."
...The performance of the algorithm decreases at higher additive noise in comparison with other algorithms [19], reporting an accuracy around 96....
[...]
...In [19], the authors proposed an audio fingerprinting method, based on landmarks in the audio spectrogram....
[...]
...[19] 2015 1500 audio files Proposes an audio fingerprinting method based on landmarks in the audio spectrogram Computer No No...
[...]
...[19] The authors propose an audio fingerprinting method that is tolerant to time-stretching and is scalable....
[...]
...The algorithm is based on the audio hashing of frequency peaks in the spectrogram [19]....
[...]
8 citations
Cites background from "Scalable and robust audio fingerpri..."
...Several works extend this approach to allow for tempo changes [10], pitch shifts [11], or both [12], [13]....
[...]
5 citations
5 citations
4 citations
Cites background or methods from "Scalable and robust audio fingerpri..."
...In [10] fingerprint modeling is related to robustness, avoiding audio degradation causes for time-stretching....
[...]
...For example, in [8], [10], [15] and [12] a hash table is used for looking for candidates, then a method of similarity is applied for reject candidates....
[...]
References
730 citations
"Scalable and robust audio fingerpri..." refers methods in this paper
...There are different algorithms to compute the LCS [12]....
[...]
...It is the implementation of LCS that gives the proposed method the tolerance against timestretching....
[...]
...The longest common subsequence (LCS) between two sequences is the longest possible combination of the elements common to both the sequences such that the order of the elements in both sequences is preserved....
[...]
648 citations
"Scalable and robust audio fingerpri..." refers background or methods in this paper
...The samples were tested on the proposed method as well as on Shazam....
[...]
...in [3], the peaks of the spectrogram of digital audio signal is less susceptible to noises and features derived from those peaks provides a robust solution to the audio fingerprinting problems....
[...]
...Wang et al.[3] proposed a highly robust audio fingerprinting technique based on landmarks of peaks in the spectrogram, which is, perhaps, the most popular audio fingerprinting technique and is commercially available as shazam....
[...]
...It clearly marks out that the proposed method is highly tolerable to time-stretching compared to Shazam....
[...]
...The results obtained by querying the second set of test samples on both the proposed as well as Shazam is displayed in Table II....
[...]
341 citations
"Scalable and robust audio fingerpri..." refers methods in this paper
...It is commonly implemented using phase-vocoder method [6][7]....
[...]
329 citations
"Scalable and robust audio fingerpri..." refers methods in this paper
...It is commonly implemented using phase-vocoder method [6][7]....
[...]
109 citations
"Scalable and robust audio fingerpri..." refers methods in this paper
...The application of audio fingerprinting has been significant solution to this problem and enables automatic identification and monitoring of copyrighted audio content[1]....
[...]