Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

doi:10.1109/TASSP.1985.1164550

Journal ArticleDOI

Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

Yariv Ephraim, +1 more

- 01 Dec 1984 -

IEEE Transactions on Acoustics, Speech, ...

- Vol. 33, Iss: 2, pp 443-445

TLDR

In this article, a system which utilizes a minimum mean square error (MMSE) estimator is proposed and then compared with other widely used systems which are based on Wiener filtering and the "spectral subtraction" algorithm.

Abstract:

This paper focuses on the class of speech enhancement systems which capitalize on the major importance of the short-time spectral amplitude (STSA) of the speech signal in its perception. A system which utilizes a minimum mean-square error (MMSE) STSA estimator is proposed and then compared with other widely used systems which are based on Wiener filtering and the "spectral subtraction" algorithm. In this paper we derive the MMSE STSA estimator, based on modeling speech and noise spectral components as statistically independent Gaussian random variables. We analyze the performance of the proposed STSA estimator and compare it with a STSA estimator derived from the Wiener estimator. We also examine the MMSE STSA estimator under uncertainty of signal presence in the noisy observations. In constructing the enhanced signal, the MMSE STSA estimator is combined with the complex exponential of the noisy phase. It is shown here that the latter is the MMSE estimator of the complex exponential of the original phase, which does not affect the STSA estimation. The proposed approach results in a significant reduction of the noise, and provides enhanced speech with colorless residual noise. The complexity of the proposed algorithm is approximately that of other systems in the discussed class.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech

Cees H. Taal, +3 more

- 01 Sep 2011 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A short-time objective intelligibility measure (STOI) is presented, which shows high correlation with the intelligibility of noisy and time-frequency weighted noisy speech (e.g., resulting from noise reduction) of three different listening experiments and showed better correlation with speech intelligibility compared to five other reference objective intelligible models.

...read moreread less

Journal ArticleDOI

Noise power spectral density estimation based on optimal smoothing and minimum statistics

Rainer Martin

- 01 Jul 2001 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: An unbiased noise estimator is developed which derives the optimal smoothing parameter for recursive smoothing of the power spectral density of the noisy speech signal by minimizing a conditional mean square estimation error criterion in each time step.

...read moreread less

Journal ArticleDOI

A statistical model-based voice activity detection

Jongseo Sohn, +2 more

- 01 Jan 1999 -

IEEE Signal Processing Letters

TL;DR: An effective hang-over scheme which considers the previous observations by a first-order Markov process modeling of speech occurrences is proposed which shows significantly better performances than the G.729B VAD in low signal-to-noise ratio (SNR) and vehicular noise environments.

...read moreread less

Journal ArticleDOI

A regression approach to speech enhancement based on deep neural networks

Yong Xu, +3 more

- 01 Jan 2015 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: The proposed DNN approach can well suppress highly nonstationary noise, which is tough to handle in general, and is effective in dealing with noisy speech data recorded in real-world scenarios without the generation of the annoying musical artifact commonly observed in conventional enhancement methods.

...read moreread less

Journal ArticleDOI

Supervised Speech Separation Based on Deep Learning: An Overview

DeLiang Wang, +1 more

- 01 Oct 2018 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A comprehensive overview of deep learning-based supervised speech separation can be found in this paper, where three main components of supervised separation are discussed: learning machines, training targets, and acoustic features.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Table of Integrals, Series, and Products

I.S. Gradshteyn, +5 more

TL;DR: Combinations involving trigonometric and hyperbolic functions and power 5 Indefinite Integrals of Special Functions 6 Definite Integral Integral Functions 7.Associated Legendre Functions 8 Special Functions 9 Hypergeometric Functions 10 Vector Field Theory 11 Algebraic Inequalities 12 Integral Inequality 13 Matrices and related results 14 Determinants 15 Norms 16 Ordinary differential equations 17 Fourier, Laplace, and Mellin Transforms 18 The z-transform

...read moreread less

Table of integrals, series and products

I.S. Gradshteyn, +1 more

Journal ArticleDOI

Suppression of acoustic noise in speech using spectral subtraction

S. Boll

- 01 Apr 1979 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A stand-alone noise suppression algorithm that resynthesizes a speech waveform and can be used as a pre-processor to narrow-band voice communications systems, speech recognition systems, or speaker authentication systems.

...read moreread less

Proceedings ArticleDOI

Enhancement of speech corrupted by acoustic noise

M. Berouti, +2 more

TL;DR: This paper describes a method for enhancing speech corrupted by broadband noise based on the spectral noise subtraction method, which can automatically adapt to a wide range of signal-to-noise ratios, as long as a reasonable estimate of the noise spectrum can be obtained.

...read moreread less

Book ChapterDOI

An Introduction to Statistical Communication Theory

David Middleton

- 01 Dec 1961 -

Biometrika

TL;DR: This IEEE Classic Reissue provides at an advanced level, a uniquely fundamental exposition of the applications of Statistical Communication Theory to a vast spectrum of important physical problems.

...read moreread less

Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

Citations

An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech

Noise power spectral density estimation based on optimal smoothing and minimum statistics

A statistical model-based voice activity detection

A regression approach to speech enhancement based on deep neural networks

Supervised Speech Separation Based on Deep Learning: An Overview

References

Table of Integrals, Series, and Products

Table of integrals, series and products

Suppression of acoustic noise in speech using spectral subtraction

Enhancement of speech corrupted by acoustic noise

An Introduction to Statistical Communication Theory

Related Papers (5)

Suppression of acoustic noise in speech using spectral subtraction

Speech Enhancement: Theory and Practice

Noise power spectral density estimation based on optimal smoothing and minimum statistics

Enhancement of speech corrupted by acoustic noise

Evaluation of Objective Quality Measures for Speech Enhancement