Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach

doi:10.21437/INTERSPEECH.2014-579

Proceedings ArticleDOI

Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach

- pp 2710-2714

TLDR

This paper presents a new approach to single-channel speech enhancement involving both noise and channel distortion (i.e., convolutional noise) based on finding longest matching segments (LMS) from a corpus of clean, wideband speech.

Abstract:

This paper presents a new approach to single-channel speech enhancement involving both noise and channel distortion (i.e., convolutional noise). The approach is based on finding longest matching segments (LMS) from a corpus of clean, wideband speech. The approach adds three novel developments to our previous LMS research. First, we address the problem of channel distortion as well as additive noise. Second, we present an improved method for modeling noise. Third, we present an iterative algorithm for improved speech estimates. In experiments using speech recognition as a test with the Aurora 4 database, the use of our enhancement approach as a preprocessor for feature extraction significantly improved the performance of a baseline recognition system. In another comparison against conventional enhancement algorithms, both the PESQ and the segmental SNR ratings of the LMS algorithm were superior to the other methods for noisy speech enhancement.

Speech Enhancement from Additive Noise and Channel Distortion - a Corpus-Based Approach

Citations

HMM-Based Speech Enhancement Using Sub-Word Models and Noise Adaptation

Reconstruction-based speech enhancement from robust acoustic features

Hidden Markov model-based speech enhancement

References

Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

Speech enhancement using a minimum mean square error short-time spectral amplitude estimator

Noise power spectral density estimation based on optimal smoothing and minimum statistics

A statistical model-based voice activity detection

Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled

Related Papers (5)

Signal subspace speech enhancement with perceptual post-filtering

Event driven speech enhancement

A single channel speech enhancement technique using psychoacoustic principles

All-pole modeling of degraded speech

Two-stage data-driven single channel speech enhancement with cepstral analysis pre-processing