scispace - formally typeset
Proceedings ArticleDOI

Robust audio identification for MP3 popular music

Reads0
Chats0
TLDR
Experiments show that compressed-domain spectral entropy as the audio feature to implement a novel audio fingerprinting algorithm exhibits strong robustness against various audio signal distortions like recompression, noise interference, echo addition, equalization, band-pass filtering, pitch shifting, and slight time-scale modification.
Citations
More filters
Patent

Method for Embedding Voice Mail in a Spoken Utterance Using a Natural Language Processing Computer System

TL;DR: In this article, a method for processing a voice message in a computerized system is presented, which receives and records a speech utterance including a message portion and a communication portion.
Patent

Systems and methods for sound recognition

TL;DR: In this paper, a system and methods for recognizing sounds are provided, where user input relating to one or more sounds is received from a computing device, and instructions are executed by a processor to discriminate the one or multiple sounds, extract music features from the sounds, analyze the music features using databases, and obtain information regarding the features based on the analysis.
Patent

Systems and methods for enabling natural language processing

TL;DR: In this paper, the authors present a system and methods for searching databases by sound data input, and present a search technology that furnishes search results in a fast and accurate manner.
Journal ArticleDOI

SIFT-based local spectrogram image descriptor: a novel feature for robust music identification

TL;DR: In this article, scale invariant feature transform (SIFT) local descriptors computed from a spectrogram image were used as sub-fingerprints for music identification. But, their robustness is limited by the time-frequency misalignments caused by time stretching and pitch shifting.
Journal ArticleDOI

An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

TL;DR: An efficient content-based audio classification approach to classify audio signals into broad genres using a fuzzy c-means (FCM) algorithm that outperforms the existing state-of-the-art audio classification systems by more than 11% in classification performance.
References
More filters
Book

Digital Watermarking and Steganography

TL;DR: This new edition now contains essential information on steganalysis and steganography, and digital watermark embedding is given a complete update with new processes and applications.
Proceedings Article

A Highly Robust Audio Fingerprinting System.

TL;DR: An audio fingerprinting system that uses the fingerprint of an unknown audio clip as a query on a fingerprint database, which contains the fingerprints of a large library of songs, the audio clip can be identified.
Journal ArticleDOI

A Review of Audio Fingerprinting

TL;DR: Different techniques describing its functional blocks as parts of a common, unified framework for audio fingerprinting are reviewed.
Journal ArticleDOI

A feature-based robust digital image watermarking scheme

TL;DR: A robust digital image watermarking scheme that combines image feature extraction and image normalization is proposed to resist both geometric distortion and signal processing attacks.
Proceedings ArticleDOI

A feature-based robust digital image watermarking scheme

TL;DR: The overall architecture for a feature-based robust digital image watermarking scheme is designed and a simulated attacking procedure is performed using predefined attacks to evaluate the robustness of every candidate feature region selected.