HMM-based strategies for enhancement of speech signals embedded in nonstationary noise

doi:10.1109/89.709670

Journal ArticleDOI

HMM-based strategies for enhancement of speech signals embedded in nonstationary noise

Hossein Sameti, +3 more

- 01 Jan 1998 -

IEEE Transactions on Speech and Audio Pr...

- Vol. 6, Iss: 5, pp 445-455

TLDR

Both objective (global SNR) and subjective mean opinion score (MOS) evaluations demonstrate consistent superiority of the HMM-based enhancement systems that incorporate the innovations described in this paper over the conventional spectral subtraction method.

Abstract:

An improved hidden Markov model-based (HMM-based) speech enhancement system designed using the minimum mean square error principle is implemented and compared with a conventional spectral subtraction system. The improvements to the system are: (1) incorporation of mixture components in the HMM for noise in order to handle noise nonstationarity in a more flexible manner, (2) two efficient methods in the speech enhancement system design that make the system real-time implementable, and (3) an adaptation method to the noise type in order to accommodate a wide variety of noise expected under the enhancement system's operating environment. The results of the experiments designed to evaluate the performance of the HMM-based speech enhancement systems in comparison with spectral subtraction are reported. Three types of noise-white noise, simulated helicopter noise, and multitalker (cocktail party) noise-were used to corrupt the test speech signals. Both objective (global SNR) and subjective mean opinion score (MOS) evaluations demonstrate consistent superiority of the HMM-based enhancement systems that incorporate the innovations described in this paper over the conventional spectral subtraction method.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

New insights into the noise reduction Wiener filter

Jingdong Chen, +3 more

- 01 Jul 2006 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper studies the quantitative performance behavior of the Wiener filter in the context of noise reduction and shows that in the single-channel case the a posteriori signal-to-noise ratio (SNR) is greater than or equal to the a priori SNR (defined before theWiener filter), indicating that the Wieners filter is always able to achieve noise reduction.

...read moreread less

Monaural speech segregation based on pitch tracking and

Biophysics Program, +1 more

TL;DR: In this paper, the authors propose a system for speech segmentation that deals with low-frequency and high-frequency signals differently, based on temporal continuity and cross-channel correlation, and groups segments according to periodicity.

...read moreread less

Journal ArticleDOI

Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization

Nasser Mohammadiha, +2 more

- 01 Oct 2013 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper proposes a novel speech enhancement method that is based on a Bayesian formulation of NMF (BNMF), and compares the performance of the developed algorithms with state-of-the-art speech enhancement schemes using various objective measures.

...read moreread less

Journal ArticleDOI

Monaural speech segregation based on pitch tracking and amplitude modulation

Guoning Hu, +1 more

- 01 Sep 2004 -

IEEE Transactions on Neural Networks

TL;DR: This work proposes a novel system for voiced speech segregation that segregates resolved and unresolved harmonics differently, and it yields substantially better performance, especially for the high-frequency part of speech.

...read moreread less

Proceedings Article

Large-vocabulary speech recognition under adverse acoustic environments.

Li Deng, +3 more

TL;DR: Three key innovations are developed and evaluated in this work: a new model learning paradigm that comprises a noise-insertion process followed by noise reduction, a noise adaptive training algorithm that integrates noise reduction into probabilistic multi-style system training, and a new algorithm for noise reduction that makes no assumptions about noise stationarity.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Book

Vector Quantization and Signal Compression

Allen Gersho, +1 more

TL;DR: The author explains the design and implementation of the Levinson-Durbin Algorithm, which automates the very labor-intensive and therefore time-heavy and expensive process of designing and implementing a Quantizer.

...read moreread less

Journal ArticleDOI

Suppression of acoustic noise in speech using spectral subtraction

S. Boll

- 01 Apr 1979 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A stand-alone noise suppression algorithm that resynthesizes a speech waveform and can be used as a pre-processor to narrow-band voice communications systems, speech recognition systems, or speaker authentication systems.

...read moreread less

Journal ArticleDOI

Adaptive noise cancelling: Principles and applications

Bernard Widrow, +8 more

TL;DR: It is shown that in treating periodic interference the adaptive noise canceller acts as a notch filter with narrow bandwidth, infinite null, and the capability of tracking the exact frequency of the interference; in this case the canceller behaves as a linear, time-invariant system, with the adaptive filter converging on a dynamic rather than a static solution.

...read moreread less

Journal ArticleDOI

On the convergence properties of the em algorithm

C. F. Jeff Wu

- 01 Mar 1983 -

Annals of Statistics

TL;DR: In this paper, the EM algorithm converges to a local maximum or a stationary value of the (incomplete-data) likelihood function under conditions that are applicable to many practical situations.

...read moreread less