A Data-Driven Approach to A Priori SNR Estimation

doi:10.1109/TASL.2010.2045799

Journal ArticleDOI

A Data-Driven Approach to A Priori SNR Estimation

Suhadi Suhadi, +1 more

- 01 Jan 2011 -

IEEE Transactions on Audio, Speech, and ...

- Vol. 19, Iss: 1, pp 186-195

TLDR

A data-driven approach to a priori SNR estimation is presented, which reduces speech distortion, particularly in speech onset, while retaining a high level of noise attenuation in speech absence.

Abstract:

The a priori signal-to-noise ratio (SNR) plays an important role in many speech enhancement algorithms. In this paper, we present a data-driven approach to a priori SNR estimation. It may be used with a wide range of speech enhancement techniques, such as, e.g., the minimum mean square error (MMSE) (log) spectral amplitude estimator, the super Gaussian joint maximum a posteriori (JMAP) estimator, or the Wiener filter. The proposed SNR estimator employs two trained artificial neural networks, one for speech presence, one for speech absence. The classical decision-directed a priori SNR estimator by Ephraim and Malah is broken down into its two additive components, which now represent the two input signals to the neural networks. Both output nodes are combined to represent the new a priori SNR estimate. As an alternative to the neural networks, also simple lookup tables are investigated. Employment of these data-driven nonlinear a priori SNR estimators reduces speech distortion, particularly in speech onset, while retaining a high level of noise attenuation in speech absence.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Experimental Study on Extreme Learning Machine Applications for Speech Enhancement

Tassadaq Hussain, +5 more

- 26 Oct 2017 -

IEEE Access

TL;DR: The experimental results indicate that when the amount of training data is limited, both ELM- and H-ELM-based speech enhancement techniques consistently outperform the conventional BP-based shallow and deep learning algorithms, in terms of standardized objective evaluations, under various testing conditions.

...read moreread less

Proceedings ArticleDOI

An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition.

Bo Li, +2 more

TL;DR: The preliminary experimental results on the Aurora 2 speech database show that with multi-condition training data the DNN itself is capable of learning robust representations, however, if only clean data is available, the MLSA algorithm is the best spectral restoration training method for DNNs.

...read moreread less

Journal ArticleDOI

Generalized maximum a posteriori spectral amplitude estimation for speech enhancement

Yu Tsao, +1 more

- 01 Feb 2016 -

Speech Communication

TL;DR: The proposed generalized maximum a posteriori spectral amplitude (GMAPA) algorithm dynamically specifies the weight of prior density of speech spectra according to the SNR of the testing speech signals to calculate the optimal gain function.

...read moreread less

Journal ArticleDOI

Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement

Pei Chee Yong, +2 more

- 01 Feb 2013 -

Speech Communication

TL;DR: Simulation results show that the proposed gain function, which can flexibly model exponential distributions, is a potential alternative speech enhancement gain function.

...read moreread less

Journal ArticleDOI

Instantaneous A Priori SNR Estimation by Cepstral Excitation Manipulation

Samy Elshamy, +3 more

- 01 Aug 2017 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A novel a priori SNR estimator based on synthesizing an idealized excitation signal in the cepstral domain is introduced, which is less prone to sudden acoustic environmental changes and musical noise and able to preserve weak harmonic structures resulting in a spectrum that is more full-bodied.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Adaptive Filter Theory

Simon Haykin

TL;DR: In this paper, the authors propose a recursive least square adaptive filter (RLF) based on the Kalman filter, which is used as the unifying base for RLS Filters.

...read moreread less

Proceedings ArticleDOI

A direct adaptive method for faster backpropagation learning: the RPROP algorithm

Martin Riedmiller, +1 more

TL;DR: A learning algorithm for multilayer feedforward networks, RPROP (resilient propagation), is proposed that performs a local adaptation of the weight-updates according to the behavior of the error function to overcome the inherent disadvantages of pure gradient-descent.

...read moreread less

Journal ArticleDOI

Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

Yariv Ephraim, +1 more

- 01 Dec 1984 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: In this article, a system which utilizes a minimum mean square error (MMSE) estimator is proposed and then compared with other widely used systems which are based on Wiener filtering and the "spectral subtraction" algorithm.

...read moreread less

Journal Article

Speech enhancement using a minimum mean square error short-time spectral amplitude estimator

Ephraim

- 01 Jan 1984 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: This paper derives a minimum mean-square error STSA estimator, based on modeling speech and noise spectral components as statistically independent Gaussian random variables, which results in a significant reduction of the noise, and provides enhanced speech with colorless residual noise.

...read moreread less

Book

Speech Enhancement: Theory and Practice

Philipos C. Loizou

TL;DR: Clear and concise, this book explores how human listeners compensate for acoustic noise in noisy environments and suggests steps that can be taken to realize the full potential of these algorithms under realistic conditions.

...read moreread less

A Data-Driven Approach to A Priori SNR Estimation

Citations

Experimental Study on Extreme Learning Machine Applications for Speech Enhancement

An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition.

Generalized maximum a posteriori spectral amplitude estimation for speech enhancement

Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement

Instantaneous A Priori SNR Estimation by Cepstral Excitation Manipulation

References

Adaptive Filter Theory

A direct adaptive method for faster backpropagation learning: the RPROP algorithm

Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

Speech enhancement using a minimum mean square error short-time spectral amplitude estimator

Speech Enhancement: Theory and Practice

Related Papers (5)

Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator

Noise power spectral density estimation based on optimal smoothing and minimum statistics

Speech enhancement based on a priori signal to noise estimation

Speech Enhancement: Theory and Practice

Suppression of acoustic noise in speech using spectral subtraction