Showing papers on "Code-excited linear prediction published in 2011"

PDF

Open Access

Patent•

Estimation of speech energy based on code excited linear prediction (CELP) parameters extracted from a partially- decoded CELP-encoded bit stream and applications of same

[...]

Emmanuel Rossignol Thepie Fapi, Eric Poulin, Jean Pierre Doyon

22 Aug 2011

TL;DR: In this paper, a method for estimating speech energy of an encoded bit stream based on coding parameters extracted from the partially-decoded bit stream is presented, which is based on the extracted at least one CELP parameter from the bit stream, without calculating a linear prediction coding (LPC) filter response energy.

...read moreread less

Abstract: Methods, systems, and non-transitory computer readable media for estimating speech energy of an encoded bit stream based on coding parameters extracted from the partially-decoded bit stream are disclosed. According to one aspect, a method for estimating speech energy of an encoded bit stream based on coding parameters extracted from the partially-decoded bit stream includes receiving a CELP-encoded bit stream, partially decoding the bit stream, and estimating the speech energy of the bit stream based a set of four or fewer CELP parameters extracted from the partially decoded bit stream. According to another aspect, a method for estimating speech energy of an encoded bit stream based on coding parameters extracted from the partially-decoded bit stream includes receiving a CELP-encoded bit stream, partially decoding the bit stream, extracting at least one CELP parameter from the partially-decoded bit stream, and estimating the speech energy of the bit stream based on the extracted at least one CELP parameter without calculating a linear prediction coding (LPC) filter response energy.

...read moreread less

43 citations

Patent•

MFCC and CELP to detect turbine engine faults

[...]

Andrew F. Geib, Chung Chiech Kuo, Martin Gawecki, EnShuo Tsau, Je Won Kang, Paul Raymond Scheid - Show less +2 more

23 Jun 2011

TL;DR: In this article, a fault detection and diagnosis method for a gas turbine engines comprises collecting a sensor signal from an acoustic or vibrational sensor at the gas turbine engine, pre-processing the sensor signal to remove predictable background, and extracting a feature set from the sensor signals using Mel-Frequency Cepstral Coefficients (MFCC) algorithms and/or Code Excited Linear Prediction (CELP) algorithms.

...read moreread less

Abstract: A fault detection and diagnosis method for a gas turbine engines comprises collecting a sensor signal from an acoustic or vibrational sensor at the gas turbine engine, pre-processing the sensor signal to remove predictable background, and extracting a feature set from the sensor signal using Mel-Frequency Cepstral Coefficients (MFCC) algorithms and/or Code Excited Linear Prediction (CELP) algorithms. Fault and non-fault states are reported based on comparison between the feature set and a library of fault and non-fault feature profiles corresponding to fault and non-fault states of the gas turbine engine.

...read moreread less

38 citations

Patent•

Audio encoding device and audio decoding device

[...]

Shuji Miyasaka, Kazutaka Abe, Zong Xian Liu, Yong Hwee Sim, Anh Tuan Tran - Show less +1 more

14 Sep 2011

TL;DR: In this article, the spectral coefficients of synthesized signal from CELP core layer are utilized to fulfill spectral gaps in error signal spectrum coefficients from a transform coding layer, and decoded signal spectral coefficients are generated.

...read moreread less

Abstract: Provided is an audio encoding device that can suppress degradation of audio quality. Spectral coefficients of synthesized signal from CELP core layer are utilized to fulfill spectral gaps in error signal spectrum coefficients from a transform coding layer. By both spectral coefficients, decoded signal spectral coefficients are generated. The decoded signal spectral coefficients and the input signal spectral coefficients are divided into a plurality of sub bands. In each sub band, the energy of the input signal spectral coefficient corresponding to a zero decoded error signal spectral coefficient is calculated, and the energy of the decoded signal spectral coefficient corresponding to the zero decoding error signal spectral coefficient is calculated, and their energy ratio is calculated and is quantized and transmitted.

...read moreread less

20 citations

Journal Article•DOI•

Burst packet loss concealment using multiple codebooks and comfort noise for CELP-type speech coders in wireless sensor networks.

[...]

Nam In Park¹, Hong Kook Kim², Jung Min A², Seong Ro Lee³, Seung Ho Choi¹ - Show less +1 more•Institutions (3)

Gwangju Institute of Science and Technology¹, Mokpo National University², Seoul National University of Science and Technology³

17 May 2011-Sensors

TL;DR: A packet loss concealment (PLC) algorithm for CELP-type speech coders is proposed in order to improve the quality of decoded speech under burst packet loss conditions in a wireless sensor network and provides significantly better speech quality than the PLC algorithm employed in G.729.

...read moreread less

Abstract: In this paper, a packet loss concealment (PLC) algorithm for CELP-type speech coders is proposed in order to improve the quality of decoded speech under burst packet loss conditions in a wireless sensor network. Conventional receiver-based PLC algorithms in the G.729 speech codec are usually based on speech correlation to reconstruct the decoded speech of lost frames by using parameter information obtained from the previous correctly received frames. However, this approach has difficulty in reconstructing voice onset signals since the parameters such as pitch, linear predictive coding coefficient, and adaptive/fixed codebooks of the previous frames are mostly related to silence frames. Thus, in order to reconstruct speech signals in the voice onset intervals, we propose a multiple codebook-based approach that includes a traditional adaptive codebook and a new random codebook composed of comfort noise. The proposed PLC algorithm is designed as a PLC algorithm for G.729 and its performance is then compared with that of the PLC algorithm currently employed in G.729 via a perceptual evaluation of speech quality, a waveform comparison, and a preference test under different random and burst packet loss conditions. It is shown from the experiments that the proposed PLC algorithm provides significantly better speech quality than the PLC algorithm employed in G.729 under all the test conditions.

...read moreread less

17 citations

Proceedings Article•

Automatic Speech Codec Identification with Applications to Tampering Detection of Speech Recordings

[...]

Jingting Zhou

01 Jan 2011

TL;DR: Taking advantage of noise patterns they generated, an algorithm was proposed to detect GSM-AMR,EFR,HR and SILK codecs and extended to identify subframe offset to do tampering detection of cellphone speech recordings.

...read moreread less

Abstract: In this paper we explored many versions of CELP codecs and studied different codebooks they use to encode noisy part of residual. Taking advantage of noise patterns they generated, an algorithm was proposed to detect GSM-AMR,EFR,HR and SILK codecs. Then it’s extended to identify subframe offset to do tampering detection of cellphone speech recordings.

...read moreread less

14 citations

Patent•

Audio signal bandwidth extension in celp-based speech coder

[...]

Jonathan Alastair Gibbs¹, James P. Ashley², Udar Mittal¹•Institutions (2)

Motorola¹, Google²

28 Sep 2011

TL;DR: In this paper, the authors proposed a method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including an audio-based decoder element.

...read moreread less

Abstract: A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes obtaining a second excitation signal having an audio bandwidth extending beyond the audio bandwidth of the CELP excitation signal, obtaining a set of signals by filtering the second excitation signal with a set of bandpass filters, scaling the set of signals using a set of energy-based parameters, and obtaining a composite output signal by combining the scaled set of signals with a signal based on the audio signal decoded by the CELP-based decoder element.

...read moreread less

13 citations

Proceedings Article•DOI•

Environmental sound recognition with CELP-based features

[...]

EnShuo Tsau¹, Seung-Hwan Kim¹, C.-C. Jay Kuo¹•Institutions (1)

University of Southern California¹

08 Aug 2011

TL;DR: It is shown by experimental results that the CELP-based features outperform the MFCC features in the ESR problem by a significant 9% margin in average and the integrated MFCC and CELp-based feature set can even reach a correct classification rate of 95.2% using the Bayesian network classifier.

...read moreread less

Abstract: In this work, we propose the use of a set of new features based on CELP (Code Excited Linear Prediction) to enhance the performance of the environmental sound recognition (ESR) problem. Traditionally, Mel Frequency Cepstral Coefficients (MFCC) have been used for the recognition of structured data like speech and music. However, their performance for the ESR problem is limited. An audio signal can be well preserved by its highly compressed CELP bit streams, which motivates us to study the CELP-based features for the audio scene recognition problem. We present a way to extract a set of features from the CELP bit streams and compare the performance of ESR using different feature sets with the Bayesian network classifier. It is shown by experimental results that the CELP-based features outperform the MFCC features in the ESR problem by a significant 9% margin in average and the integrated MFCC and CELP-based feature set can even reach a correct classification rate of 95.2% using the Bayesian network classifier.

...read moreread less

12 citations

Journal Article•DOI•

One-Pulse FEC Coding for Robust CELP-Coded Speech Transmission Over Erasure Channels

[...]

Angel M. Gomez¹, Jose L. Carmona¹, José A. González¹, Victoria E. Sánchez¹•Institutions (1)

University of Granada¹

01 Oct 2011-IEEE Transactions on Multimedia

TL;DR: An optimization procedure, which takes into account the overall synthesis error, is proposed in order to provide better pulse-position and pulse-amplitude quantization codebooks and shows that the number of bits required to represent the resynchronization pulse is effectively reduced.

...read moreread less

Abstract: In this paper, we present an improved quantization scheme for the redundancy data of a forward error correction (FEC) technique proposed for the transmission of code-excited linear prediction (CELP)-coded speech over erasure channels. The use of a FEC-based error protection scheme is motivated by the well-known fact that, after a frame erasure, the previous excitation is not available and a desynchronization between the encoder and the decoder long-term prediction (LTP) filters appears, causing an additional distortion which is propagated to subsequent frames. LTP synchronization can be recovered by means of a single-pulse representation of the previous excitation. No additional delay is introduced by this technique which only requires a small transmission bandwidth increase. In this paper, we focus on the efficient encoding of this pulse. Thus, an optimization procedure, which takes into account the overall synthesis error, is proposed in order to provide better pulse-position and pulse-amplitude quantization codebooks. Moreover, by extending the previous procedure, an efficient joint position-amplitude quantization can be obtained. Objective quality tests applied to our proposal show that, by means of the proposed codebooks, the number of bits required to represent the resynchronization pulse is effectively reduced. In addition, a discontinuous transmission mechanism is derived from the cost functional used during joint position-amplitude quantization, further reducing the bit-rate.

...read moreread less

11 citations

Patent•

Predictive coding with block shapes derived from a prediction error

[...]

Matthias Narroschke¹, Florian Knicker¹•Institutions (1)

Panasonic¹

26 Apr 2011

TL;DR: In this article, the first component is coded by using prediction and the second component is segmented to different parts used for its coding according to the prediction error, and then the last component is used for decoding according to its prediction error.

...read moreread less

Abstract: The present invention relates to block-wise coding and decoding of a video signal including at least two color components. The first component is coded by using prediction and the second component is segmented to different parts used for its coding according to the prediction error.

...read moreread less

11 citations

Patent•

Low-delay unified speech and audio codec

[...]

Ralf Geiger, Markus Schnell, Guillaume Fuchs, Ravelli Emmanuel, Tom Bäckström, Jeremie Lecomte, Konstantin Schmidt, Nikolaus Rettelbach, Manfred Lutzky, Bernhard Grill - Show less +6 more

27 May 2011

TL;DR: In this paper, a unified speech and audio decoder is described, which comprises a frame buffer configured to buffer a sub-part of a datastream composed of consecutive frames in units of the frames so that the subpart continuously comprises at least one frame, each frame representing a coded version of a respective portion of consecutive portions of an audio signal.

...read moreread less

Abstract: A unified speech and audio decoder is described, which comprises a frame buffer configured to buffer a sub-part of a datastream composed of consecutive frames in units of the frames so that the subpart continuously comprises at least one frame, each frame representing a coded version of a respective portion of consecutive portions of an audio signal, and each frame comprising a mode identifier assigning the respective frame to a respective one of a plurality of coding modes comprising a CELP (codebook excitation linear prediction) coding mode and a transform coded excitation linear prediction coding mode. Further, the unified speech and audio decoder comprises a CELP decoder configured to decode the frames to which the CELP coding mode is assigned to reconstruct the respective portions, and a transform coded excitation linear prediction decoder configured to decode the frames to which the transform coded excitation linear prediction coding mode is assigned, to reconstruct the respective portions, wherein the frame buffer is configured to distribute the frames buffered to the CELP decoder and the transform coded excitation linear prediction decoder under removal of the respective frames from the frame buffer, frame-wise.

...read moreread less

11 citations

Proceedings Article•DOI•

Effect of Low Bit Rate Speech Coding on Epoch Extraction

[...]

Anil Kumar Vuppala¹, Jainath Yadav¹, Saswat Chakrabarti¹, K. Sreenivasa Rao¹•Institutions (1)

Indian Institute of Technology Kharagpur¹

24 Mar 2011

TL;DR: The effect of low bit rate speech coding on the accuracy of detection of epochs is explored in this paper using CMU-Arctic data using the epoch locations from electro-glottograph as reference.

...read moreread less

Abstract: Speech coding is one of the major degradation involved in building the speech systems in mobile environment. In this paper, we are exploring the effect of low bit rate speech coding on the accuracy of detection of epochs. Epoch is referred as the instant of significant excitation of the vocal-tract system during production of speech. Many speech applications depend on the the accurate estimation of the epoch locations. Epoch extraction from speech signal is challenging due to time-varying characteristics of the excitation source and vocal-tract system. For determining the epochs, two recently developed accurate methods (i) zero frequency filter (ZFF) (ii) Dynamic programming projected phase slope (DYPSA) are used. Most of the epoch extraction methods except ZFF method, attempt to remove the characteristics of the vocal-tract system, in order to emphasize the excitation characteristics in the residual. ZFF method extracts the epoch locations directly from the speech signals using impulse like nature of excitation. Speech coders used in this study are GSM full rate (ETSI 06.10), CELP (FS-1016), and MELP (TI 2.4 kbps). Performance of epoch extraction methods is evaluated using CMU-Arctic data using the epoch locations from electro-glottograph as reference.

...read moreread less

Proceedings Article•DOI•

Implementation of CELP CODER and to evaluate the performance in terms of bit rate, coding delay and quality of speech

[...]

Milind Tandel¹, Vandana Shah², Bhavina Patel²•Institutions (2)

G H Patel College Of Engineering & Technology¹, Sarvajanik College of Engineering and Technology²

08 Apr 2011

TL;DR: The implementation of CELP CODEC and its analytical evaluation of performance in terms of bit rate, coding delay and Quality of speech are discussed.

...read moreread less

Abstract: Factors serving as constraints in today's wireless communication system include bandwidth and power. In wireless systems that require the transmission of speech, these goals are addressed by developing efficient methods of reducing the amount of information required to transmit and receive quality speech. For this reason, speech coding has been, and remains, the topic of aggressive research. This paper discusses the implementation of CELP CODEC and its analytical evaluation of performance in terms of bit rate, coding delay and Quality of speech. The CELP coder is one of the best methods for producing high quality speech at bit rates between 4.8 and 9.6 Kbps.

...read moreread less

Journal Article•

Comparison of coding schemes for genetic algorithms

[...]

Zhang Chao-qun¹, Zheng Jian-guo, Qian Jie¹•Institutions (1)

Guangxi University for Nationalities¹

01 Jan 2011-Application Research of Computers

TL;DR: This paper focused on analyzing and comparing several popular coding schemes for genetic algorithms such as binary coding, real coding,matrix coding, tree coding and quantum coding, and summarized their principles, advantages and disadvantages, application scopes and application trends.

...read moreread less

Abstract: Designing a rational coding scheme for a concrete problem is one of application difficulties of genetic algorithms,but up to now there is no uniform solution to itThis paper focused on analyzing and comparing several popular coding schemes for genetic algorithms such as binary coding,real coding,matrix coding,tree coding and quantum coding,then summarized their principles,advantages and disadvantages,application scopes and application trendsFurthermore,pointed out some future research directions for coding schemes

...read moreread less

Proceedings Article•DOI•

Hybrid Scalar/Vector Quantization of Mel-Frequency Cepstral Coefficients for Low Bit-Rate Coding of Speech

[...]

Laura E. Boucheron¹, Phillip L. De Leon¹, Steven Sandoval¹•Institutions (1)

New Mexico State University¹

29 Mar 2011

TL;DR: The results show perceptual evaluation of speech quality (PESQ) of the MFCC-based codec matches the state-of-the-art MELPe codec at 600 bps and exceeds the CELP codec at 2000 -- 4000 bps coding rates.

...read moreread less

Abstract: In this paper, we propose a low bit-rate speech codec based on a hybrid scalar/vector quantization of the mel-frequency cepstral coefficients (MFCCs). We begin by showing that if a high-resolution mel-frequency cepstrum (MFC) is computed, good-quality speech reconstruction is possible from the MFCCs despite the lack of explicit phase information. By evaluating the contribution toward speech quality that individual MFCCs make and applying appropriate quantization, our results show perceptual evaluation of speech quality (PESQ) of the MFCC-based codec matches the state-of-the-art MELPe codec at 600 bps and exceeds the CELP codec at 2000 -- 4000 bps coding rates. The main advantage of the proposed codec is in distributed speech recognition (DSR) since speech features based on MFCCs can be directly obtained from code words thus eliminating additional decode and feature extract stages.

...read moreread less

Patent•

Encoder apparatus and encoding method

[...]

Takuya Kawashima¹, Masahiro Oshikiri¹•Institutions (1)

Panasonic¹

05 Sep 2011

TL;DR: In this paper, an encoder apparatus that can suppress the quality degradation of encoding processes is provided. But the spectrum of an input signal and a residual spectrum are used to determine a given number of pre-selected suppression factors to a CELP component suppressing unit (104), and a distortion evaluating unit (112) determines one of the designated suppression factors by use of the spectrum generated by decoding a second code obtained by the second encoding process.

...read moreread less

Abstract: Provided is an encoder apparatus that can suppress the quality degradation of encoding processes. An ultimate selection candidate limiting unit (109) uses the spectrum of an input signal and a residual spectrum to designate a given number of pre-selected suppression factors to a CELP component suppressing unit (104); the CELP component suppressing unit (104) uses the designated suppression factors to generate a suppressed spectrum; a CELP residual signal spectrum calculating unit (105), to which the suppressed spectrum is input, calculates a residual spectrum; a conversion encoding unit (110) uses the residual spectrum to performs a second encoding process; and a distortion evaluating unit (112) determines one of the designated suppression factors by use of the spectrum of a second decoded signal generated by decoding a second code obtained by the second encoding process and further by use of the suppressed spectrum and the spectrum of the input signal.

...read moreread less

Journal Article•DOI•

Speech compression exploiting linear prediction coefficients codebook and hamming correction code algorithm

[...]

Islam Amro¹, Raed Abu Zitar², Hussein Bahadili•Institutions (2)

Al-Quds Open University¹, New York Institute of Technology²

01 Jun 2011-International Journal of Speech Technology

TL;DR: A replacement algorithm for Linear Prediction Coefficients (LPC) along with Hamming Correction Code based Compressor (HCDC) algorithms are investigated for speech compression, based on constructing dynamic reflection coefficients codebook.

...read moreread less

Abstract: In this paper, a replacement algorithm for Linear Prediction Coefficients (LPC) along with Hamming Correction Code based Compressor (HCDC) algorithms are investigated for speech compression. We started with an CELP system with order 12 and with Discrete Cosine Transform (DCT) based residual excitation. Forty coefficients with transmission rate of 5.14 kbps were first used. For each frame of the testing signals we applied a multistage HCDC, we tested the compression performance for parities from 2 to 7, we were able to achieve compression only at parity 4. This rate reduction was made with no compromise in the original CELP signal quality since compression is lossless. The compression approach is based on constructing dynamic reflection coefficients codebook, this codebook is constructed and used simultaneously using a certain store/retrieve threshold. The initial linear prediction codec we used is excited by a discrete cosine transform (DCT) residual, the results were tested using the MOS and SSNR, we had acceptable ranges for the MOS (average 3.6), and small variations of the SSNR (±5 db).

...read moreread less

Patent•

Flexible and scalable combined innovation codebook for use in CELP coder and decoder

[...]

Bessette Bruno

08 Apr 2011

TL;DR: In this paper, a combined innovation codebook coding device comprises a pre-quantizer of a first, adaptive codebook excitation residual, and a CELP innovation-codebook search module responsive to the second excitation contribution produced from the first adaptive code book excitation.

...read moreread less

Abstract: In a CELP coder, a combined innovation codebook coding device comprises a pre-quantizer of a first, adaptive-codebook excitation residual, and a CELP innovation-codebook search module responsive to a second excitation residual produced from the first, adaptive-codebook excitation residual. In a CELP decoder, a combined innovation codebook comprises a de-quantizer of pre-quantized coding parameters into a first excitation contribution, and a CELP innovation-codebook structure responsive to CELP innovation-codebook parameters to produce a second excitation contribution.

...read moreread less

Patent•

Encoding device, decoding device, spectrum fluctuation calculation method, and spectrum amplitude adjustment method

[...]

Masahiro Oshikiri¹•Institutions (1)

Panasonic¹

13 Jan 2011

TL;DR: In this paper, an encoding device whereby it is possible to improve the quality of an encoded signal, even when encoding music signals, is described. But it is not shown how to use it in practice.

...read moreread less

Abstract: Disclosed is an encoding device whereby it is possible to improve the quality of an encoded signal, even when encoding music signals. In the encoding device, a Code-Excited Linear Prediction (CELP) encoder (101) generates first encoded data by encoding an input signal, a CELP decoder (102) generates a decoded signal by decoding the first encoded data input from the CELP encoder (101), and a characteristic parameter encoder (106) calculates a parameter that expresses the degree of fluctuation in the ratio of the peak components and the floor components between the spectra of the decoded signal and the input signal.

...read moreread less

Patent•

Decoder, encoder, and methods thereof

[...]

Takuya Kawashima¹, Masahiro Oshikiri¹•Institutions (1)

Panasonic¹

27 May 2011

TL;DR: In this paper, a decoder capable of improving the sound quality of a decoded sound signal in an encoding method which combines speech encoding and music encoding in a hierarchical structure is presented.

...read moreread less

Abstract: Disclosed is a decoder capable of improving the sound quality of a decoded sound signal in an encoding method which combines speech encoding and music encoding in a hierarchical structure. A transform-encoding decoding unit (202) decodes transform-encoded data to generate a spectrum of a decoded transform-encoded signal. A band decision unit (203) uses the spectrum of the decoded transform-encoded signal to decide whether each of a plurality of bands in which frequency components of an input signal are divided constitute a first band in which a transform encoded pulse is not established or a second band in which said pulse is established. A CELP component suppression unit (207) suppresses the spectrum of a CELP decoded signal, which is the frequency component of a decoded signal of CELP encoded data, to the extent that suppression in the first band is weaker than suppression in the second band.

...read moreread less

Proceedings Article•DOI•

Compressed domain speech enhancement based on the joint modification of codebook gains

[...]

Bing-yin Xia¹, Changchun Bao¹, Yan Liang¹, Xuan Zhou¹, Yu-wen He¹, Ru-wei Li¹ - Show less +2 more•Institutions (1)

Beijing University of Technology¹

14 Dec 2011

TL;DR: The proposed compressed domain speech enhancement method could provide larger amount of noise reduction in both white and colored noise with smaller attenuation on the speech level, and the objective speech quality is improved evidently.

...read moreread less

Abstract: A compressed domain speech enhancement method based on the joint modification of adaptive and algebraic codebook gains for the codec of ITU-T G.722.2 is proposed in this paper. First the power of excitation signal corresponding to the noise is estimated by the method of minimum statistics. Then the decision-directed approach is used to get an estimate of the a priori SNR. And the algebraic codebook gain is modified by multiplying a Wiener-type modification factor. In order to solve the problem of power loss in voiced segment, the modified adaptive codebook gain is got by keeping the power of the modified excitation signal equal to the scaled version of the noisy one. The result of performance evaluation under ITU-T G.160 shows that, in comparison with the method that only modifies the algebraic codebook gain, the proposed method could provide larger amount of noise reduction in both white and colored noise with smaller attenuation on the speech level, and the objective speech quality is improved evidently.

...read moreread less

The Equivalence of ADPCM and CELP Coding

[...]

Peter Kabal

01 Jan 2011

TL;DR: This work is licensed under the Creative Commons Attribution 3.0 Unported License to view a copy of this license, visit.

...read moreread less

Abstract: to Remix – to adapt this work Subject to conditions outlined in the license. This work is licensed under the Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit

...read moreread less

Proceedings Article•DOI•

An interactive speech coding tool using LabVIEW

[...]

Karthikeyan Natesan Ramamurthy¹, Jayaraman J. Thiagarajan¹, Andreas Spanias¹•Institutions (1)

Arizona State University¹

24 Mar 2011

TL;DR: This paper discusses the development of an interactive speech coding tool in National Instruments LabVIEW™ software for the Federal Standard-1016 CELP algorithm, and how this tool can be used to teach the various modules of the CelP based speech coders to undergraduate and graduate students.

...read moreread less

Abstract: Code Excited Linear Prediction (CELP) is a closed-loop analysis-by-synthesis speech coding algorithm that has been standardized in Federal Standard-1016. Variants of the CELP algorithm form the core of many speech coding standards that exist today. In this paper, we discuss the development of an interactive speech coding tool in National Instruments LabVIEW™ software for the Federal Standard-1016 CELP algorithm. A brief description of the speech coding algorithm and the features of the LabVIEW speech coding tool are presented. Illustrations demonstrating the use of the interactive software tool in analyzing the speech coding algorithm are provided. This tool can be used to teach the various modules of the CELP based speech coders to undergraduate and graduate students.

...read moreread less

Journal Article•DOI•

Analysis of Fundamental Frequency Contour of Coded Speech Based on Multi-Pulse Based Code Excited Linear Prediction Algorithm

[...]

Suphattharachai Chomphan

14 Jun 2011-Journal of Computer Science

TL;DR: It is proved that the MP-CELP speech coder deteriorates the fundamental frequency contour of the transmitted speech.

...read moreread less

Abstract: Problem statement: In low-bit-rate speech communication, speech coding deteriorates the characteristics of the coded speech significantly. An important feature of the speech is the fundamental frequency contour which determines the pitch information of the speech. It has been known that pitch information is one of the core parameter of the multi-pulse based code excited linear prediction (MPCELP) speech coder. Therefore the study of the deteriorated fundamental frequency contour should be conducted properly. Approach: This study proposes an analysis of the fundamental frequency contour of the coded speech based on MP-CELP speech coder. The comparison of the fundamental frequency contour of the natural speech and that of the coded speech has been performed. The MP-CELP with three levels of bitrate scalability is selected as the core speech coder. The speech material includes a hundred of male speech utterances and a hundred of female speech utterances. Results: The experimental results show that the speech coder causes the deterioration of the fundamental frequency contour empirically. The Root Mean Square Error (RMSE) between the fundamental frequency contour of the natural speech and that of the coded speech for three different bitrates has been conducted. The lower bitrate causes the higher value of RMSE. Conclusion: From the study, it is a proved that the MP-CELP speech coder deteriorates the fundamental frequency contour of the transmitted speech.

...read moreread less

Patent•

Encoding method, decoding method, encoding device, decoding device, and program

[...]

Moriya Takehiro, Harada Noboru, Kamamoto Masaru

07 Jul 2011

TL;DR: In this article, the authors propose a method to improve encoding quality when a frame to be encoded by a first encoding system does not continue when each frame is encoded by the CELP (Code-Excited Linear Prediction) encoding system or a second encoding system such as a frequency encoding system.

...read moreread less

Abstract: PROBLEM TO BE SOLVED: To improve encoding quality when a frame to be encoded by a first encoding system does not continue when each frame is encoded by the first encoding system such as a CELP (Code-Excited Linear Prediction) encoding system or a second encoding system such as a frequency encoding system. SOLUTION: When a frame is encoded by the first encoding system or when a code encoded by the first encoding system is decoded, a decoded signal corresponding to a time-series signal at each point belonging to a past frame is linearly predicted and analyzed, and the obtained residual signal is used as a substitute for an excitation signal. COPYRIGHT: (C)2011,JPO&INPIT

...read moreread less

Patent•

Encoder apparatus and method that perform preliminary signal selection for transform coding before main signal selection for transform coding

[...]

Takuya Kawashima¹, Masahiro Oshikiri¹•Institutions (1)

Panasonic¹

05 Sep 2011

TL;DR: In this paper, an encoder apparatus is provided that suppresses the quality degradation of encoding processes, and an ultimate selection candidate limiting unit uses the spectrum of an input signal and a residual spectrum to designate a given number of pre-selected suppression factors to a CELP component suppressing unit, which uses the designated suppressions to generate a suppressed spectrum.

...read moreread less

Abstract: An encoder apparatus is provided that suppresses the quality degradation of encoding processes. An ultimate selection candidate limiting unit uses the spectrum of an input signal and a residual spectrum to designate a given number of pre-selected suppression factors to a CELP component suppressing unit, which uses the designated suppression factors to generate a suppressed spectrum. A CELP residual signal spectrum calculating unit, to which the suppressed spectrum is input, calculates a residual spectrum. A conversion encoding unit uses the residual spectrum to perform a second encoding process. A distortion evaluating unit determines one of the designated suppression factors by use of the spectrum of a second decoded signal generated by decoding a second code obtained by the second encoding process, and further by use of the suppressed spectrum and the spectrum of the input signal.

...read moreread less

Proceedings Article•DOI•

Complexity scalability design in coding of the adaptive codebook for ITU-T G.729 speech coder

[...]

Fu-Kun Chen¹, Guan-Ming Chen¹, Yue-Dar Jou²•Institutions (2)

National Taiwan University¹, R.O.C Military Academy²

01 Dec 2011

TL;DR: Simulation results show that the proposed approaches can effectively reduce the computational complexity of the close-loop pitch search with imperceptible degradation of the speech quality.

...read moreread less

Abstract: Using the adaptive codebook and stochastic codebook, the code-excited linear prediction (CELP) speech coders can achieve a good speech quality at low bit rate. When the complexity reduction of the stochastic codebook search reaches a bottleneck, the only way to reduce the complexity of the speech codec would be simplifying the computation of the adaptive codebook. This paper introduces and describes the complexity scalability design for the adaptive codebook search of the G.729 speech coder. Two schemes, the search-range reduction and the dimension-reduced method, for complexity reduction will be discussed. Simulation results show that the proposed approaches can effectively reduce the computational complexity of the close-loop pitch search with imperceptible degradation of the speech quality.

...read moreread less

Proceedings Article•

Sparse signal approximation algorithms in a CELP coder

[...]

Przemyslaw Dymarski¹, Rafal Romaniuk¹•Institutions (1)

Warsaw University of Technology¹

01 Aug 2011

TL;DR: New algorithms are proposed, based on cyclic and parallel use of a fast implementation of the optimized orthogonal matching pursuit algorithm, i.e. the recursive modified Gram - Schmidt algorithm, that yield a statistically significant reduction of signal approximation error at a controllable computational complexity.

...read moreread less

Abstract: This work presents a series of sparse signal modeling algorithms implemented in a typical CELP coder in order to compare their performances at a reasonable computational load. New algorithms are proposed, based on cyclic and parallel use of a fast implementation of the optimized orthogonal matching pursuit algorithm, i.e. the recursive modified Gram - Schmidt algorithm. These algorithms yield a statistically significant reduction of signal approximation error at a controllable computational complexity.

...read moreread less

Proceedings Article•DOI•

Frame error-robust MDCT bit reduction scheme using inter-frame correlation for G.729.1

[...]

Keunseok Cho¹, Sangbae Jung², Hyun-woo Kim³, Minsoo Hahn¹•Institutions (3)

KAIST¹, Gyeongsang National University², Electronics and Telecommunications Research Institute³

22 May 2011

TL;DR: A new algorithm for encoding spectral envelope in G.729.1 for VoIP is proposed and a new bit allocation strategy using reduced bits from the proposed algorithm is also proposed.

...read moreread less

Abstract: In this paper, a new algorithm for encoding spectral envelope in G.729.1 for VoIP is proposed. In the TDAC part of G.729.1, the spectral envelope and MDCT coefficients extracted in the weighted CELP coding error (lower-band) and the higher-band input signal are encoded. In order to reduce allocation bits for spectral envelope coding, a new algorithm using sub-band correlation between adjacent frames is proposed. In addition, a new bit allocation strategy using reduced bits from the proposed algorithm is also proposed. The performance of the proposed algorithm is evaluated by sound quality and bit reduction rates in clean and frame loss conditions.

...read moreread less

Book Chapter•DOI•

Chaos for speech coding and production

[...]

Behnam Kia¹, William L. Ditto², Mark L. Spano¹•Institutions (2)

Arizona State University¹, University of Hawaii at Manoa²

07 Nov 2011

TL;DR: The presence of nonlinearity and chaotic behavior in the human speech production system has been reported previously, but chaotic dynamics has not been widely exploited in speech coding and artificial speech production algorithms.

...read moreread less

Abstract: The presence of nonlinearity and chaotic behavior in the human speech production system has been reported previously; however to date chaotic dynamics has not been widely exploited in speech coding and artificial speech production algorithms. In this paper we illustrate how we can utilize chaotic dynamics in speech coding and synthesis and discuss how it can improve the performance of these processes. As an example we choose code-excited linear predictive coding and, instead of an excitation codebook consisting of Gaussian random waveforms, we use chaotic systems to produce chaotic excitations. This simple technique has the potential to greatly improve the efficiency of modern communication devices such as cell phones. We call the resulting scheme chaos-excited linear predictive coding.

...read moreread less

Patent•

Encoding apparatus, decoding apparatus, encoding method, and decoding method for adjusting a spectrum amplitude

[...]

Masahiro Oshikiri¹•Institutions (1)

Panasonic¹

13 Jan 2011

TL;DR: In this article, a Code-Excited Linear Prediction (CELP) encoder generates first encoded data by encoding an input signal, and a CELP decoder generates a decoded signal by decoding the first encoded data input from the encoder, while a characteristic parameter encoder calculates a parameter that expresses the degree of fluctuation in the ratio of the peak components and the floor components between the spectra of the decoded signals and the input signal.

...read moreread less

Abstract: An encoding device is provided for increasing the quality of an encoded signal, even when encoding music signals. In the encoding device, a Code-Excited Linear Prediction (CELP) encoder generates first encoded data by encoding an input signal, and a CELP decoder generates a decoded signal by decoding the first encoded data input from the CELP encoder. Additionally, a characteristic parameter encoder calculates a parameter that expresses the degree of fluctuation in the ratio of the peak components and the floor components between the spectra of the decoded signal and the input signal.

...read moreread less