scispace - formally typeset
Patent

Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information

Reads0
Chats0
TLDR
In this paper, a system that performs analysis and comparison of audio data files based upon the content of the data files is presented, which produces a set of numeric values (a feature vector) that can be used to classify and rank the similarity between individual audio files typically stored in a multimedia database or on the Web.
Abstract
A system that performs analysis and comparison of audio data files based upon the content of the data files is presented. The analysis of the audio data produces a set of numeric values (a feature vector) that can be used to classify and rank the similarity between individual audio files typically stored in a multimedia database or on the World Wide Web. The analysis also facilitates the description of user-defined classes of audio files, based on an analysis of a set of audio files that are members of a user-defined class. The system can find sounds within a longer sound, allowing an audio recording to be automatically segmented into a series of shorter audio segments.

read more

Citations
More filters
Patent

Intelligent electronic appliance system and method

TL;DR: In this paper, a set top box for interacting with broadband media streams, with an adaptive user interface, content-based media processing and/or media metadata processing, and telecommunications integration, is presented.
Proceedings Article

Mel frequency cepstral coefficients for music modeling

Beth Logan
TL;DR: The results show that the use of the Mel scale for modeling music is at least not harmful for this problem, although further experimentation is needed to verify that this is the optimal scale in the general case and whether this transform is valid for music spectra.
Patent

Adaptive pattern recognition based control system and method

TL;DR: An adaptive interface for a programmable system, for predicting a desired user function, based on user history, as well as machine internal status and context, is presented for confirmation by the user, and the predictive mechanism is updated based on this feedback as mentioned in this paper.
Patent

Intuitive computing methods and systems

TL;DR: A smart phone senses audio, imagery, and/or other stimulus from a user's environment, and acts autonomously to fulfill inferred or anticipated user desires as discussed by the authors, and can apply more or less resources to an image processing task depending on how successfully the task is proceeding or based on the user's apparent interest in the task.
Patent

Connected audio and other media objects

TL;DR: In this paper, a decoding process extracts the identifier from a media object and possibly additional context information and forwards it to a server, in turn, maps the identifier to an action, such as returning metadata, re-directing the request to one or more other servers, requesting information from another server to identify the media object, etc.
References
More filters
Proceedings ArticleDOI

Construction and evaluation of a robust multifeature speech/music discriminator

TL;DR: A real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input is constructed and extensive data on system performance and the cross-validated training/test setup used to evaluate the system is provided.
Journal ArticleDOI

Tempo and beat analysis of acoustic musical signals

TL;DR: A method is presented for using a small number of bandpass filters and banks of parallel comb filters to analyze the tempo of, and extract the beat from, musical signals of arbitrary polyphonic complexity and containing arbitrary timbres that can be used predictively to guess when beats will occur in the future.
Patent

Computing and multimedia entertainment system

TL;DR: A remotely controllable computing and multimedia entertainment system includes a personal computer (24) having an entertainment circuit (12) made up of a radio frequency circuit (48), a television circuit (46), and an audio multimedia circuit (18) as mentioned in this paper.
Book

Principles of Digital Audio

TL;DR: The Principles of Digital Audio (PDA) as discussed by the authors has been updated and expanded to introduce both audio and computer users to the myriad new technologies that are now transforming the field of digital audio.
Patent

Method for controlling real-time presentation of audio/visual data on a computer system

TL;DR: In this article, a method of recording a real-time multimedia presentation and replaying a missed portion at an accelerated rate until the missed portion catches up to the current point in the presentation is presented.