scispace - formally typeset
Patent

Low bit rate audio coder and decoder operating in a transform domain using vector quantization

TLDR
In this paper, a pre-emphasis step is performed to perform gross decorrelation, followed by an adaptive linear prediction to perform further decorrelation and a transform is performed on the residual of the linear prediction, to obtain transform coefficients representing the residual in the frequency domain.
Abstract
Audio source data is subjected to a pre-emphasis step (302) to perform gross decorrelation, followed by an adaptive linear prediction (306) to perform further decorrelation. A transform is performed on the residual of the linear prediction, to obtain transform coefficients representing the residual in the frequency domain. A number of tonal components are identified (310), subtracted from the transform coefficients and encoded by vector quantization. The transform coefficients are then grouped into sub-bands, and each sub-band encoded in the frequency domain by vector quantization. The sub-bands are of uniform width on an auditory scale, so that each vector may comprise a different number of transform coefficients.

read more

Citations
More filters
Patent

System and mobile cellular telephone device for playing recorded music

Devon A. Rolf
TL;DR: In this article, a mobile cellular telephone is used to select a music recording from a remote source, such as online music recording storage facility, and wirelessly receive the selected music recording.
Patent

Multi-channel signal encoding and decoding

TL;DR: In this paper, a multi-channel linear predictive analysis-by-synthesis signal encoding method was proposed to detect inter-channel correlation and select one of several possible encoding modes (S24, S29, S30) based on the detected correlation.
Patent

Acoustic communication system

TL;DR: In this article, the authors described a number of encoders for encoding a data signal within an audio signal, where the data signal is separated into a tonal part and a residual part.
Patent

Method and apparatus for seamlessly switching reception between multimedia streams in a wireless communication system

TL;DR: In this article, the authors describe techniques to seamlessly switch reception between multimedia programs by identifying a program with potential for user selection, and then decoding the identified program prior to its selection so that the program can be decompressed and displayed earlier if it is subsequently selected.
Patent

Method and apparatus to recover a high frequency component of audio data

TL;DR: In this article, a method and an apparatus to recover a high frequency component of an MP3 encoded audio signal in an audio decoder is presented, which includes: generating a filter bank value of a low frequency band from a modified discrete cosine transform (MDCT) coefficient, which is extracted from an input bitstream according to a window type, extracting transient information of a frame according to the window type and selecting a weight coefficient according to extracted transient information, and adjusting the recovered filter bank values of recovered high frequency components according to weight coefficient.
References
More filters
Journal ArticleDOI

Linear prediction: A tutorial review

TL;DR: This paper gives an exposition of linear prediction in the analysis of discrete signals as a linear combination of its past values and present and past values of a hypothetical input to a system whose output is the given signal.
Book

Readings in speech recognition

Alex Waibel, +1 more
TL;DR: This chapter discusses four main approaches to speech recognition: template-based, knowledge-Based, Stochastic, connectionist, and connectionist.
Journal ArticleDOI

Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency

TL;DR: In this paper, the critical band rate and the critical bandwidth are expressed as functions of frequency, and relatively simple equations are given to express the dependence of critical bands rate on frequency with an accuracy better than 0.2 Bark.
Journal ArticleDOI

Optimizing digital speech coders by exploiting masking properties of the human ear

TL;DR: New results of masking and loudness reduction of noise are reported and the design principles of speech coding systems exploiting auditory masking are described.
Journal ArticleDOI

A tutorial on MPEG/audio compression

TL;DR: This tutorial covers the theory behind MPEG/audio compression and the basics of psychoacoustic modeling and the methods the algorithm uses to compress audio data with the least perceptible degradation.