scispace - formally typeset
Proceedings ArticleDOI

Methods for nonlinear spectral distortion of speech signals

John Makhoul
- Vol. 1, pp 87-90
TLDR
This paper presents a general analysis-synthesis scheme for the arbitrary spectral distortion of speech signals without the need for pitch extraction, and linear predictive warping, cepstral Warping, and autocorrelation warping are given as examples of the general scheme.
Abstract
The spectral distortion of speech signals, without affecting the pitch or the speed of the signal, has met with some difficulty due to the need for pitch extraction. This paper presents a general analysis-synthesis scheme for the arbitrary spectral distortion of speech signals without the need for pitch extraction. Linear predictive warping, cepstral warping, and autocorrelation warping, are given as examples of the general scheme. Applications include the unscrambling of helium speech, spectral compression for the hard of hearing, bit rate reduction in speech compression systems, and efficiency of spectral representation for speech recognition systems.

read more

Citations
More filters
Journal ArticleDOI

Comparative study of several distortion measures for speech recognition

TL;DR: This study compared several different spectral distortion measures including the Itakura-Saito distortion measure, the log likelihood ratio and weighted slope metric distortion measures, and two proposed perceptually based distortion measures in terms of their effects on the performance of standard dynamic time warping (DTW) based, isolated word, speech recognizer.
Proceedings ArticleDOI

LPCW: An LPC vocoder with linear predictive spectral warping

TL;DR: An LPC vocoder employing the recently developed method of linear predictive warping (LPW), which achieves improved speech quality for the same bit rate in a minimally redundant model.
Journal ArticleDOI

Helium speech enhancement using the short-time Fourier transform

TL;DR: In this paper, a short-time Fourier transform (FTFT) signal representation is used to enhance the intelligibility of hyperbaric helium-oxygen atmosphere. But the results of formal intelligibility tests are reviewed, these tests show that the noise reduction scheme is detrimental to intelligibility, but fail to conclusively resolve the importance of a non-linear formant frequency shift.
Journal ArticleDOI

Time-frequency speech transformation based on an elementary waveform representation

TL;DR: The classical theory of speech production proves the validity of the EWSM parameters; their modifications yield well-localized time-frequency transformations, including frequency compression/expansion, pitch, formant and noise modification.
Journal ArticleDOI

Speech processing at BBN

TL;DR: This survey of speech processing activities covers a period that began around 1971, and a number of today's best-regarded techniques in speech and language processing stem from BBN's early work.
References
More filters
Journal ArticleDOI

Discrete representation of signals

TL;DR: The requirements for digital sequences by other digital sequences and the use of such representations to implement a nonlinear warping of the digital frequency axis are discussed within the framework of simulating linear time-invariant systems.
Journal ArticleDOI

Spectral linear prediction: Properties and applications

TL;DR: LP is compared with traditional analysis-by-synthesis (AbS) techniques for spectral modeling and it is found that linear prediction offers computational advantages over AbS, as well as better modeling properties if the variations of the signal spectrum from the desired spectral model are large.
Proceedings ArticleDOI

LPCW: An LPC vocoder with linear predictive spectral warping

TL;DR: An LPC vocoder employing the recently developed method of linear predictive warping (LPW), which achieves improved speech quality for the same bit rate in a minimally redundant model.
Proceedings ArticleDOI

New lattice methods for linear prediction

TL;DR: This paper presents a new formulation for linear prediction, which is viewed as one of a class of lattice methods which guarantee the stability of the all-pole filter, with or without windowing of the signal, with finite wordlength computations, and with the number of computations being comparable to the traditional autocorrelation and covariance methods.