Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

doi:10.1109/TASL.2011.2174223

Journal ArticleDOI

Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

Manish Narwaria, +4 more

- 01 May 2012 -

IEEE Transactions on Audio, Speech, and ...

- Vol. 20, Iss: 4, pp 1217-1232

Chats0

TLDR

This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech and utilizes the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment.

Abstract:

Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.

Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

Citations

Mulsemedia: State of the Art, Perspectives, and Challenges

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM

Long-Term Spectral Statistics for Voice Presentation Attack Detection

Novel deep autoencoder features for non-intrusive speech quality assessment

References

Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients

On the use of channel-attentive MFCC for robust recognition of partially corrupted speech

Two-scale Auditory Feature Based Non-intrusive Speech Quality Evaluation

Visually-Derived Wiener Filters for Speech Enhancement

Noise-Robust Speech Recognition Using Top-Down Selective Attention With an HMM Classifier

Related Papers (5)

Low-Complexity, Nonintrusive Speech Quality Assessment

Single-Ended Speech Quality Measurement Using Machine Learning Methods

P.563—The ITU-T Standard for Single-Ended Speech Quality Assessment

Subjective comparison and evaluation of speech enhancement algorithms

Non-intrusive GMM-based speech quality measurement

Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

Citations

Mulsemedia: State of the Art, Perspectives, and Challenges

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM

Long-Term Spectral Statistics for Voice Presentation Attack Detection

Novel deep autoencoder features for non-intrusive speech quality assessment

References

Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients

On the use of channel-attentive MFCC for robust recognition of partially corrupted speech

Two-scale Auditory Feature Based Non-intrusive Speech Quality Evaluation

Visually-Derived Wiener Filters for Speech Enhancement

Noise-Robust Speech Recognition Using Top-Down Selective Attention With an HMM Classifier

Related Papers (5)

Low-Complexity, Nonintrusive Speech Quality Assessment

Single-Ended Speech Quality Measurement Using Machine Learning Methods

P.563&#8212;The ITU-T Standard for Single-Ended Speech Quality Assessment

Subjective comparison and evaluation of speech enhancement algorithms

Non-intrusive GMM-based speech quality measurement

P.563—The ITU-T Standard for Single-Ended Speech Quality Assessment