Robust recognition of loud and Lombard speech in the fighter cockpit environment

doi:10.1109/ICASSP.1989.266517

Proceedings ArticleDOI

Robust recognition of loud and Lombard speech in the fighter cockpit environment

B.J. Stanton, +2 more

- pp 675-678

Chats0

TLDR

In this paper, a method is devised that uses the differences in spectral slope between linear predictive coding log magnitude spectra to weight the point-by-point energy differences between the spectra.

Abstract:

The major goal of this research is to reduce the discrepancy in recognition performance between normal and abnormal speech, given that reference templates were derived only from normal speech. A method is devised that uses the differences in spectral slope between linear predictive coding log magnitude spectra to weight the point-by-point energy differences between the spectra. The distances of all reference tokens of like phonemes are combined to form a smallest cumulative distance (SCD) method. When SCD is combined with the method of slope-dependent weighting (SDW), the most significant success is obtained. In terms of error rates for a fixed phoneme vector length of five, SDW+SCD is found to reduce the difference in error rate between normal and abnormal speech by approximately 50%. >

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Nonlinear feature based classification of speech under stress

G. Zhou, +2 more

- 01 Mar 2001 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: Three new features derived from the nonlinear Teager (1980) energy operator (TEO) are investigated for stress classification and it is shown that the TEO-CB-Auto-Env feature outperforms traditional pitch and mel-frequency cepstrum coefficients (MFCC) substantially.

...read moreread less

Journal ArticleDOI

A comparative study of traditional and newly proposed features for recognition of speech under stress

Sahar E. Bou-Ghazale, +1 more

- 01 Jul 2000 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The results show that unlike fast Fourier transform's (FFT) immunity to noise, the linear prediction power spectrum is more immune than FFT to stress as well as to a combination of a noisy and stressful environment.

...read moreread less

Journal ArticleDOI

Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition

John H. L. Hansen

- 01 Nov 1996 -

Speech Communication

TL;DR: It is suggested that recent studies based on a Source Generator Framework can provide a viable foundation in which to establish robust speech recognition techniques, and three novel approaches for signal enhancement and stress equalization are considered to address the issue of recognition under noisy stressful conditions.

...read moreread less

Journal ArticleDOI

On the relationship between, and measurement of, amplitude and frequency in birdsong

Sue Anne Zollinger, +4 more

- 01 Oct 2012 -

Animal Behaviour

TL;DR: A growing number of studies ask whether and how bird songs vary between areas with low versus high levels of anthropogenic noise as discussed by the authors and find that birds are seen to sing at higher frequencies in urban versus rural populations, presumably because of selection for higher-pitched songs in the face of low-frequency urban noise.

...read moreread less

Journal ArticleDOI

Feature analysis and neural network-based classification of speech under stress

John H. L. Hansen, +1 more

- 01 Jul 1996 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: Several speech features are considered as potential stress-sensitive relayers using a previously established stressed speech database (SUSAS) and a neural network-based classifier is formulated based on an extended delta-bar-delta learning rule.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Dynamic programming algorithm optimization for spoken word recognition

H. Sakoe, +1 more

- 01 Feb 1978 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: This paper reports on an optimum dynamic progxamming (DP) based time-normalization algorithm for spoken word recognition, in which the warping function slope is restricted so as to improve discrimination between words in different categories.

...read moreread less

Journal ArticleDOI

Minimum prediction residual principle applied to speech recognition

F. Itakura

- 01 Feb 1975 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A computer system is described in which isolated words, spoken by a designated talker, are recognized through calculation of a minimum prediction residual through optimally registering the reference LPC onto the input autocorrelation coefficients using the dynamic programming algorithm.

...read moreread less

Journal ArticleDOI

Distance measures for speech processing

A. Gray, +1 more

- 01 Oct 1976 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: The likelihood ratio, cepstral measure, and cosh measure are easily evaluated recursively from linear prediction filter coefficients, and each has a meaningful and interrelated frequency domain interpretation.

...read moreread less

Journal ArticleDOI

On the performance of the quefrency-weighted cepstral coefficients in vowel recognition

Kuldip K. Paliwal

- 01 Jan 1982 -

Speech Communication

TL;DR: The quefrency-weighted cepstral coefficients (also known as the root-power sums) are studied as to their effectiveness in a vowel recognition experiment and found to perform better than the cepStral coefficients with a Euclidean distance measure.

...read moreread less

Journal ArticleDOI

Spectral slope distance measures with linear prediction analysis for word recognition in noise

B. Hanson, +1 more

- 01 Jul 1987 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: Initial testing of spectral slope distance measures derived from linear prediction analysis models of speech for speaker-dependent isolated word recognition indicates that they give considerable performance improvement over the standard cepstral distance measure in several noise conditions.

...read moreread less