scispace - formally typeset
Patent

System and method for multiresolution scalable audio signal encoding

TLDR
In this article, the authors proposed a model that considers audio signals to be composed of deterministic or sinusoidal components, transient components representing the onset of notes or other events in an audio signal, and stochastic components.
Abstract: 
An audio signal analyzer and encoder is based on a model that considers audio signals to be composed of deterministic or sinusoidal components, transient components representing the onset of notes or other events in an audio signal, and stochastic components. Deterministic components are represented as a series of overlapping sinusoidal waveforms. To generate the deterministic components, the input signal is divided into a set of frequency bands by a multi-complementary filter bank. The frequency band signals are oversampled so as to suppress cross-band aliasing energy in each band. Each frequency band is analyzed and encoded as a set of spectral components using a windowing time frame whose length is inversely proportional to the frequency range in that band. Low frequency bands are encoded using longer time frames than higher frequency bands. Transient components are represented by parameters denoting sinusoidal shaped waveforms produced when the transient components are transformed into a real valued frequency domain waveform. Stochastic or noise components are represented as a series of spectral envelopes. The parameters representing the three signal components compose a stream of compressed encoded audio data that can be further compressed so as to meet a specified transmission bandwidth limit by the deleting the least significant bits of quantized parameter values, reducing the update rates of parameters, and/or deleting the parameters used to encode higher frequency bands until the bandwidth of the compressed audio data meets the bandwidth requirement. Signal quality degrades in a graduated manner with successive reductions in the transmitted data rate.

read more

Citations
More filters
Patent

Client/server architecture for text-to-speech synthesis

TL;DR: In this paper, a client/server text-to-speech synthesis system and method is proposed, where the server stores large databases for pronunciation analysis, prosody generation, and acoustic unit selection corresponding to a normalized text, while the client performs computationally intensive decompression and concatenation of selected acoustic units to generate speech.
Patent

Battery cooling system

TL;DR: In this article, a cordless power tool has a housing which includes a mechanism to couple with a removable battery pack, which includes one or more battery cells as well as a vent system in the battery pack housing which enables fluid to move through the housing.
Patent

Wireless multimedia player

TL;DR: In this paper, a wireless device, system and method for receiving and playing multimedia files streamed from a multimedia server over a wireless telecommunications network is described, where a desired multimedia file is selected from one or more multimedia files stored in the multimedia server, which server is operatively connected to the wireless network.
Patent

Quality improvement techniques in an audio encoder

TL;DR: In this paper, an audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon energy separation between the coding channels, and the disparity between excitation patterns of the separate input channels.
PatentDOI

Real-time control of playback rates in presentations

TL;DR: In this paper, a multi-channel architecture with different audio channels corresponding to different playback rates for a presentation to be transmitted over a network is proposed, where a user can make a real-time change in playback rate causing selection of a channel corresponding to the new playback rate and a frame required for prompt and smooth transition in the playback rate of the presentation.
References
More filters
Journal ArticleDOI

Speech analysis/Synthesis based on a sinusoidal representation

TL;DR: A sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves, which forms the basis for new approaches to the problems of speech transformations including time-scale and pitch-scale modification, and midrate speech coding.
Proceedings ArticleDOI

Low bit rate high quality audio coding with combined harmonic and wavelet representations

TL;DR: A novel high quality audio coding method using adaptive signal representation, based on sinusoidal and wavelet analysis of signals, which separates out tones, transients, and broadband noise.
Journal Article

A Method for Extrapolation of Missing Digital Audio Data

TL;DR: A method for extrapolating missing or corrupted samples in a digital audio data stream is presented and involves spectral extrapolation to synthesize an estimate of the missing material using a sinusoidal representation.
Patent

Electronic musical instrument with a note detector capable of detecting a plurality of notes sounded simultaneously

TL;DR: In this article, a voice input to a microphone is converted by an A/D converter to a digital signal which is then delivered to a DSP, which extracts the notes of the input voice for determining same.