Patent
Low bit rate audio coder and decoder operating in a transform domain using vector quantization
TLDR
In this paper, a pre-emphasis step is performed to perform gross decorrelation, followed by an adaptive linear prediction to perform further decorrelation and a transform is performed on the residual of the linear prediction, to obtain transform coefficients representing the residual in the frequency domain.Abstract:
Audio source data is subjected to a pre-emphasis step (302) to perform gross decorrelation, followed by an adaptive linear prediction (306) to perform further decorrelation. A transform is performed on the residual of the linear prediction, to obtain transform coefficients representing the residual in the frequency domain. A number of tonal components are identified (310), subtracted from the transform coefficients and encoded by vector quantization. The transform coefficients are then grouped into sub-bands, and each sub-band encoded in the frequency domain by vector quantization. The sub-bands are of uniform width on an auditory scale, so that each vector may comprise a different number of transform coefficients.read more
Citations
More filters
Patent
System and mobile cellular telephone device for playing recorded music
TL;DR: In this article, a mobile cellular telephone is used to select a music recording from a remote source, such as online music recording storage facility, and wirelessly receive the selected music recording.
Patent
Multi-channel signal encoding and decoding
TL;DR: In this paper, a multi-channel linear predictive analysis-by-synthesis signal encoding method was proposed to detect inter-channel correlation and select one of several possible encoding modes (S24, S29, S30) based on the detected correlation.
Patent
Acoustic communication system
Aled Wynne Jones,Michael Raymond Reynolds,David Bartlett,Ian Michael Hosking,Donald Glenn Guy,Peter John Kelly,Daniel R. E. Timson,Nicolas Vasilopolous,Alan Michael Hart,Robert John Morland +9 more
TL;DR: In this article, the authors described a number of encoders for encoding a data signal within an audio signal, where the data signal is separated into a tonal part and a residual part.
Patent
Method and apparatus for seamlessly switching reception between multimedia streams in a wireless communication system
TL;DR: In this article, the authors describe techniques to seamlessly switch reception between multimedia programs by identifying a program with potential for user selection, and then decoding the identified program prior to its selection so that the program can be decompressed and displayed earlier if it is subsequently selected.
Patent
Method and apparatus to recover a high frequency component of audio data
Oh Yoon-Hark,Hyuck-Jae Lee +1 more
TL;DR: In this article, a method and an apparatus to recover a high frequency component of an MP3 encoded audio signal in an audio decoder is presented, which includes: generating a filter bank value of a low frequency band from a modified discrete cosine transform (MDCT) coefficient, which is extracted from an input bitstream according to a window type, extracting transient information of a frame according to the window type and selecting a weight coefficient according to extracted transient information, and adjusting the recovered filter bank values of recovered high frequency components according to weight coefficient.
References
More filters
Journal ArticleDOI
Linear prediction: A tutorial review
TL;DR: This paper gives an exposition of linear prediction in the analysis of discrete signals as a linear combination of its past values and present and past values of a hypothetical input to a system whose output is the given signal.
Book
Readings in speech recognition
Alex Waibel,Kai-Fu Lee +1 more
TL;DR: This chapter discusses four main approaches to speech recognition: template-based, knowledge-Based, Stochastic, connectionist, and connectionist.
Journal ArticleDOI
Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency
E. Zwicker,Ernst Terhardt +1 more
TL;DR: In this paper, the critical band rate and the critical bandwidth are expressed as functions of frequency, and relatively simple equations are given to express the dependence of critical bands rate on frequency with an accuracy better than 0.2 Bark.
Journal ArticleDOI
Optimizing digital speech coders by exploiting masking properties of the human ear
TL;DR: New results of masking and loudness reduction of noise are reported and the design principles of speech coding systems exploiting auditory masking are described.
Journal ArticleDOI
A tutorial on MPEG/audio compression
TL;DR: This tutorial covers the theory behind MPEG/audio compression and the basics of psychoacoustic modeling and the methods the algorithm uses to compress audio data with the least perceptible degradation.