scispace - formally typeset
Journal ArticleDOI

Known-Artist Live Song Identification Using Audio Hashprints

TJ Tsai, +2 more
- 15 Feb 2017 - 
- Vol. 19, Iss: 7, pp 1569-1582
TLDR
This paper proposes a multistep approach to address the problem of live song identification for popular bands by representing the audio as a sequence of binary codes called hashprints, derived from a set of spectrotemporal filters that are learned in an unsupervised artist-specific manner.
Abstract
The goal of live song identification is to allow concertgoers to identify a live performance by recording a few seconds of the performance on their cell phone. This paper proposes a multistep approach to address this problem for popular bands. In the first step, GPS data are used to associate the audio query with a concert in order to infer who the musical artist is. This reduces the search space to a dataset containing the artist's studio recordings. In the next step, the known-artist search is solved by representing the audio as a sequence of binary codes called hashprints, which can be efficiently matched against the database using a two-stage cross-correlation approach. The hashprint representation is derived from a set of spectrotemporal filters that are learned in an unsupervised artist-specific manner. On the Gracenote live song identification benchmark, the proposed system outperforms five other baseline systems and improves the mean reciprocal rank of the previous state of the art from 0.68 to 0.79, while simultaneously reducing the average runtime per query from 10 to 0.9 s. We conduct extensive analyses of major factors affecting system performance.

read more

Citations
More filters
Posted Content

Now Playing: Continuous low-power music recognition

TL;DR: A low-power music recognizer that runs entirely on a mobile device and automatically recognizes music without user interaction is presented, which respects user privacy by running entirely on-device and can passively recognize a wide range of music.
Journal ArticleDOI

Learning Low-Dimensional Embeddings of Audio Shingles for Cross-Version Retrieval of Classical Music

Frank Zalkow, +1 more
- 18 Dec 2019 - 
TL;DR: A more detailed view into the retrieval problem is provided by analyzing the distances that appear in the nearest neighbor search, and it is shown that, using neural networks, one can reduce the audio shingles from 240 to fewer than 8 dimensions with only a moderate loss in retrieval accuracy.
Posted Content

Large-Scale Cover Song Detection in Digital Music Libraries Using Metadata, Lyrics and Audio Features.

TL;DR: This work investigates whether textual music information (such as metadata and lyrics) can be used along with audio for large-scale cover identification problem in a wide digital music library and benchmarks this problem using standard text and state of the art audio similarity measures.
Journal ArticleDOI

Large-Scale Multimodal Piano Music Identification Using Marketplace Fingerprinting

TL;DR: This paper studies the problem of identifying piano music in various modalities using a single, unified approach called marketplace fingerprinting, which substantially outperforms previous methods while simultaneously reducing average runtime.
Proceedings ArticleDOI

Investigating the Efficacy of Music Version Retrieval Systems for Setlist Identification

TL;DR: This paper proposes an end-to-end workflow that identifies relevant metadata and timestamps of live music performances using a version identification system and contributes a new dataset that contains 99.5 h of concerts with annotated metadata and times-tamps, along with the corresponding reference set.
References
More filters
Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.
Book

Adaptive Filter Theory

Simon Haykin
TL;DR: In this paper, the authors propose a recursive least square adaptive filter (RLF) based on the Kalman filter, which is used as the unifying base for RLS Filters.
Proceedings ArticleDOI

Locality-sensitive hashing scheme based on p-stable distributions

TL;DR: A novel Locality-Sensitive Hashing scheme for the Approximate Nearest Neighbor Problem under lp norm, based on p-stable distributions that improves the running time of the earlier algorithm and yields the first known provably efficient approximate NN algorithm for the case p<1.
Proceedings Article

Spectral Hashing

TL;DR: The problem of finding a best code for a given dataset is closely related to the problem of graph partitioning and can be shown to be NP hard and a spectral method is obtained whose solutions are simply a subset of thresholded eigenvectors of the graph Laplacian.
Journal ArticleDOI

Semantic hashing

TL;DR: In this paper, a deep graphical model of the word-count vectors obtained from a large set of documents is proposed. But the model is restricted to the deep layer of the deep neural network and cannot handle large numbers of documents.
Related Papers (5)