Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

doi:10.1109/TASL.2011.2174223

Journal ArticleDOI

Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

Manish Narwaria, +4 more

- 01 May 2012 -

IEEE Transactions on Audio, Speech, and ...

- Vol. 20, Iss: 4, pp 1217-1232

Chats0

TLDR

This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech and utilizes the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment.

Abstract:

Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Reconstruction of Phonated Speech from Whispers Using Formant-Derived Plausible Pitch Modulation

Ian McLoughlin, +4 more

- 11 May 2015 -

ACM Transactions on Accessible Computing

TL;DR: Conversion of whispers into natural-sounding phonated speech as a noninvasive prosthetic aid for people with voice impairments who can only whisper is considered.

...read moreread less

Journal ArticleDOI

Non-intrusive speech quality assessment using multi-resolution auditory model features for degraded narrowband speech

Rajesh Kumar Dubey, +1 more

- 01 Dec 2015 -

Iet Signal Processing

TL;DR: It is found that the proposed method that uses a combination of MRAM features, MFCC, and LSF feature vectors for non-intrusive speech quality performs better than both the other algorithms.

...read moreread less

Proceedings ArticleDOI

Non-intrusive quality assessment for enhanced speech signals based on spectro-temporal features

Qiaohong Li, +3 more

TL;DR: Experimental results on NOIZEUS dataset demonstrate that proposed non-intrusive quality assessment metric by using spectro-temporal features can obtain better performance for enhanced speech signals.

...read moreread less

Proceedings ArticleDOI

Novel Subband Autoencoder Features for Non-Intrusive Quality Assessment of Noise Suppressed Speech.

Meet H. Soni, +1 more

TL;DR: A novel feature extraction architecture of Deep Neural Network (DNN), namely, subband autoencoder (SBAE) is proposed, inspired by the Human Auditory System (HAS) and extracts features from speech spectrum in an unsupervised manner.

...read moreread less

Journal ArticleDOI

A training-based speech regeneration approach with cascading mapping models

Hamid Sharifzadeh, +5 more

- 01 Aug 2017 -

Computers & Electrical Engineering

TL;DR: By considering the current limitations of speech reconstruction methods, a novel algorithm for converting whispers to normal speech is proposed and the efficiency of the algorithm is explored, and the algorithm relies upon cascading mapping models and makes use of artificially generated whispers to regenerate natural phonated speech from whispers.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011 -

ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

Book

Fundamentals of speech recognition

Lawrence R. Rabiner, +1 more

TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.

...read moreread less

Book

Kernel Methods for Pattern Analysis

John Shawe-Taylor, +1 more

TL;DR: This book provides an easy introduction for students and researchers to the growing field of kernel-based pattern analysis, demonstrating with examples how to handcraft an algorithm or a kernel for a new specific application, and covering all the necessary conceptual and mathematical tools to do so.

...read moreread less

Journal ArticleDOI

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences

S. Davis, +1 more

- 01 Aug 1980 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: In this article, several parametric representations of the acoustic signal were compared with regard to word recognition performance in a syllable-oriented continuous speech recognition system, and the emphasis was on the ability to retain phonetically significant acoustic information in the face of syntactic and duration variations.

...read moreread less

Proceedings Article

The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions

David Pearce, +1 more

TL;DR: A database designed to evaluate the performance of speech recognition algorithms in noisy conditions and recognition results are presented for the first standard DSR feature extraction scheme that is based on a cepstral analysis.

...read moreread less