Proceedings ArticleDOI
Methods for nonlinear spectral distortion of speech signals
John Makhoul
- Vol. 1, pp 87-90
TLDR
This paper presents a general analysis-synthesis scheme for the arbitrary spectral distortion of speech signals without the need for pitch extraction, and linear predictive warping, cepstral Warping, and autocorrelation warping are given as examples of the general scheme.Abstract:
The spectral distortion of speech signals, without affecting the pitch or the speed of the signal, has met with some difficulty due to the need for pitch extraction. This paper presents a general analysis-synthesis scheme for the arbitrary spectral distortion of speech signals without the need for pitch extraction. Linear predictive warping, cepstral warping, and autocorrelation warping, are given as examples of the general scheme. Applications include the unscrambling of helium speech, spectral compression for the hard of hearing, bit rate reduction in speech compression systems, and efficiency of spectral representation for speech recognition systems.read more
Citations
More filters
Journal ArticleDOI
Comparative study of several distortion measures for speech recognition
TL;DR: This study compared several different spectral distortion measures including the Itakura-Saito distortion measure, the log likelihood ratio and weighted slope metric distortion measures, and two proposed perceptually based distortion measures in terms of their effects on the performance of standard dynamic time warping (DTW) based, isolated word, speech recognizer.
Proceedings ArticleDOI
LPCW: An LPC vocoder with linear predictive spectral warping
John Makhoul,L. Cosell +1 more
TL;DR: An LPC vocoder employing the recently developed method of linear predictive warping (LPW), which achieves improved speech quality for the same bit rate in a minimally redundant model.
Journal ArticleDOI
Helium speech enhancement using the short-time Fourier transform
TL;DR: In this paper, a short-time Fourier transform (FTFT) signal representation is used to enhance the intelligibility of hyperbaric helium-oxygen atmosphere. But the results of formal intelligibility tests are reviewed, these tests show that the noise reduction scheme is detrimental to intelligibility, but fail to conclusively resolve the importance of a non-linear formant frequency shift.
Journal ArticleDOI
Time-frequency speech transformation based on an elementary waveform representation
TL;DR: The classical theory of speech production proves the validity of the EWSM parameters; their modifications yield well-localized time-frequency transformations, including frequency compression/expansion, pitch, formant and noise modification.
Journal ArticleDOI
Speech processing at BBN
TL;DR: This survey of speech processing activities covers a period that began around 1971, and a number of today's best-regarded techniques in speech and language processing stem from BBN's early work.
References
More filters
Journal ArticleDOI
Discrete representation of signals
Alan V. Oppenheim,Don H. Johnson +1 more
TL;DR: The requirements for digital sequences by other digital sequences and the use of such representations to implement a nonlinear warping of the digital frequency axis are discussed within the framework of simulating linear time-invariant systems.
Journal ArticleDOI
Spectral linear prediction: Properties and applications
TL;DR: LP is compared with traditional analysis-by-synthesis (AbS) techniques for spectral modeling and it is found that linear prediction offers computational advantages over AbS, as well as better modeling properties if the variations of the signal spectrum from the desired spectral model are large.
Proceedings ArticleDOI
LPCW: An LPC vocoder with linear predictive spectral warping
John Makhoul,L. Cosell +1 more
TL;DR: An LPC vocoder employing the recently developed method of linear predictive warping (LPW), which achieves improved speech quality for the same bit rate in a minimally redundant model.
Proceedings ArticleDOI
New lattice methods for linear prediction
TL;DR: This paper presents a new formulation for linear prediction, which is viewed as one of a class of lattice methods which guarantee the stability of the all-pole filter, with or without windowing of the signal, with finite wordlength computations, and with the number of computations being comparable to the traditional autocorrelation and covariance methods.