Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

doi:10.1109/TASL.2011.2174223

Journal ArticleDOI

Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

Manish Narwaria, +4 more

- 01 May 2012 -

IEEE Transactions on Audio, Speech, and ...

- Vol. 20, Iss: 4, pp 1217-1232

Chats0

TLDR

This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech and utilizes the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment.

Abstract:

Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Mulsemedia: State of the Art, Perspectives, and Challenges

Gheorghita Ghinea, +3 more

- 01 Oct 2014 -

ACM Transactions on Multimedia Computing...

TL;DR: A historic perspective on mulsemedia work is presented and current developments in the area are reviewed and standardization efforts, via the MPEG-V standard, are described.

...read moreread less

Proceedings ArticleDOI

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.

Szu-Wei Fu, +3 more

TL;DR: In this paper, an end-to-end, non-intrusive speech quality evaluation model, termed Quality-Net, based on bidirectional long short-term memory (LSTM) was proposed.

...read moreread less

Posted Content

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM

Szu-Wei Fu, +3 more

- 16 Aug 2018 -

arXiv: Sound

TL;DR: In this article, an end-to-end, non-intrusive speech quality evaluation model, termed Quality-Net, based on bidirectional long short-term memory (LSTM) was proposed.

...read moreread less

Journal ArticleDOI

Long-Term Spectral Statistics for Voice Presentation Attack Detection

Hannah Muckenhirn, +3 more

- 01 Nov 2017 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: Investigations on ASVspoof 2015 challenge database and AVspoof database show that the proposed approach with a linear discriminative classifier yields a better system, irrespective of whether the spoofed signal is replayed to the microphone or is directly injected into the system software process.

...read moreread less

Proceedings ArticleDOI

Novel deep autoencoder features for non-intrusive speech quality assessment

Meet H. Soni, +1 more

TL;DR: Quantification of the experimental results suggests that proposed metric gives more accurate and correlated scores than an existing benchmark for objective, non-intrusive quality assessment metric ITU-T P.563 standard.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Low-Complexity, Nonintrusive Speech Quality Assessment

V. Grancharov, +3 more

- 01 Nov 2006 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A low-complexity algorithm for monitoring the speech quality over a network that can be computed from commonly used speech-coding parameters without explicit distortion modeling is described.

...read moreread less

Journal ArticleDOI

ANIQUE: an auditory model for single-ended speech quality estimation

Doh-Suk Kim

- 15 Aug 2005 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The proposed auditory non-intrusive quality estimation (ANIQUE) model is based on the functional roles of human auditory systems and the characteristics of human articulation systems and demonstrates the effectiveness of the proposed model.

...read moreread less

Proceedings ArticleDOI

Detection of abrupt spectral changes using support vector machines an application to audio signal segmentation

Manuel Davy, +1 more

TL;DR: An hybrid time-frequency/support vector machine algorithm for the detection of abrupt spectral changes using a stationarity index derived from support vector novelty detection theory by using sub-images extracted from the time- frequencies as feature vectors is introduced.

...read moreread less

Journal ArticleDOI

Single-Ended Speech Quality Measurement Using Machine Learning Methods

Tiago H. Falk, +1 more

- 01 Nov 2006 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A novel single-ended algorithm constructed from models of speech signals, including clean and degraded speech, and speech corrupted by multiplicative noise and temporal discontinuities, found to be more effective than P.563, the current "state-of-art" standard single- ended algorithm.

...read moreread less

Journal ArticleDOI

Speaker Verification Using Support Vector Machines and High-Level Features

William M. Campbell, +4 more

- 01 Sep 2007 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A method of speaker modeling based upon support vector machines based upon linearizing a log likelihood ratio scoring system is described and generalizations of this method are shown to produce excellent results on a variety of high-level features.

...read moreread less