An Industrial Strength Audio Search Algorithm.

Proceedings Article•

An Industrial Strength Audio Search Algorithm.

01 Jan 2003-

TL;DR: The algorithm is noise and distortion resistant, computationally efficient, and massively scalable, capable of quickly identifying a short segment of music captured through a cellphone microphone in the presence of foreground voices and other dominant noise, out of a database of over a million tracks.

read less

Abstract: We have developed and commercially deployed a flexible audio search engine. The algorithm is noise and distortion resistant, computationally efficient, and massively scalable, capable of quickly identifying a short segment of music captured through a cellphone microphone in the presence of foreground voices and other dominant noise, and through voice codec compression, out of a database of over a million tracks. The algorithm uses a combinatorially hashed time-frequency constellation analysis of the audio, yielding unusual properties such as transparency, in which multiple tracks mixed together may each be identified. Furthermore, for applications such as radio monitoring, search times on the order of a few milliseconds per query are attained, even on a massive music database.

...read moreread less

Citations

PDF

Open Access

More filters

Patent•

Intelligent Automated Assistant

[...]

Thomas R. Gruber¹, Adam Cheyer¹, Dag Kittlaus¹, Didier Rene Guzzoni¹, Christopher Dean Brigham¹, Richard Donald Giuli¹, Marcello Bastea-Forte¹, Harry J. Saddler¹ - Show less +4 more•Institutions (1)

Apple Inc.¹

11 Jan 2011

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Abstract: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

...read moreread less

1,462 citations

Journal Article•DOI•

Content-Based Music Information Retrieval: Current Directions and Future Challenges

[...]

Michael A. Casey¹, Remco C. Veltkamp², Masataka Goto, Marc Leman³, Christophe Rhodes¹, Malcolm Slaney⁴ - Show less +2 more•Institutions (4)

University of London¹, Utrecht University², Ghent University³, Yahoo!⁴

14 Mar 2008

TL;DR: In this paper, the authors outline the problems of content-based music information retrieval and explore the state-of-the-art methods using audio cues (e.g., query by humming, audio fingerprinting, contentbased music retrieval) and other cues such as music notation and symbolic representation.

...read moreread less

Abstract: The steep rise in music downloading over CD sales has created a major shift in the music industry away from physical media formats and towards online products and services. Music is one of the most popular types of online information and there are now hundreds of music streaming and download services operating on the World-Wide Web. Some of the music collections available are approaching the scale of ten million tracks and this has posed a major challenge for searching, retrieving, and organizing music content. Research efforts in music information retrieval have involved experts from music perception, cognition, musicology, engineering, and computer science engaged in truly interdisciplinary activity that has resulted in many proposed algorithmic and methodological solutions to music search using content-based methods. This paper outlines the problems of content-based music information retrieval and explores the state-of-the-art methods using audio cues (e.g., query by humming, audio fingerprinting, content-based music retrieval) and other cues (e.g., music notation and symbolic representation), and identifies some of the major challenges for the coming years.

...read moreread less

670 citations

Patent•

Using context information to facilitate processing of commands in a virtual assistant

[...]

Thomas R. Gruber¹, Christopher Dean Brigham¹, Daniel S. Keen¹, Gregory Novick¹, Phipps Benjamin S¹ - Show less +1 more•Institutions (1)

Apple Inc.¹

28 Sep 2012

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.

...read moreread less

Abstract: A virtual assistant uses context information to supplement natural language or gestural input from a user. Context helps to clarify the user's intent and to reduce the number of candidate interpretations of the user's input, and reduces the need for the user to provide excessive clarification input. Context can include any available information that is usable by the assistant to supplement explicit user input to constrain an information-processing problem and/or to personalize results. Context can be used to constrain solutions during various phases of processing, including, for example, speech recognition, natural language processing, task flow processing, and dialog generation.

...read moreread less

593 citations

Journal Article•DOI•

Detection and Classification of Acoustic Scenes and Events

[...]

Dan Stowell¹, Dimitrios Giannoulis¹, Emmanouil Benetos¹, Mathieu Lagrange², Mark D. Plumbley³ - Show less +1 more•Institutions (3)

Queen Mary University of London¹, École centrale de Nantes², University of Surrey³

01 May 2015-IEEE Transactions on Multimedia

TL;DR: The state of the art in automatically classifying audio scenes, and automatically detecting and classifyingaudio events is reported on.

...read moreread less

Abstract: For intelligent systems to make best use of the audio modality, it is important that they can recognize not just speech and music, which have been researched as specific tasks, but also general sounds in everyday environments. To stimulate research in this field we conducted a public research challenge: the IEEE Audio and Acoustic Signal Processing Technical Committee challenge on Detection and Classification of Acoustic Scenes and Events (DCASE). In this paper, we report on the state of the art in automatically classifying audio scenes, and automatically detecting and classifying audio events. We survey prior work as well as the state of the art represented by the submissions to the challenge from various research groups. We also provide detail on the organization of the challenge, so that our experience as challenge hosts may be useful to those organizing challenges in similar domains. We created new audio datasets and baseline systems for the challenge; these, as well as some submitted systems, are publicly available under open licenses, to serve as benchmarks for further research in general-purpose machine listening.

...read moreread less

468 citations

Cites background from "An Industrial Strength Audio Search..."

...Digital Object Identifier 10.1109/TMM.2015.2428998 can transcribe the notes and chords in music [3], or identify the track title and artist from a low-quality sound snippet [4]....
[...]

Patent•

Method and apparatus for building an intelligent automated assistant

[...]

Adam Cheyer¹, Didier Rene Guzzoni¹•Institutions (1)

Apple Inc.¹

08 Sep 2006

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.

...read moreread less

Abstract: A method and apparatus are provided for building an intelligent automated assistant. Embodiments of the present invention rely on the concept of “active ontologies” (e.g., execution environments constructed in an ontology-like manner) to build and run applications for use by intelligent automated assistants. In one specific embodiment, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontology includes at least one active processing element that models a domain. At least one of the remote services is then registered for use in the domain.

...read moreread less

389 citations

Collapse

An Industrial Strength Audio Search Algorithm.

Citations

Cites background from "An Industrial Strength Audio Search..."

Related Papers (5)