scispace - formally typeset
Book ChapterDOI

Non-intrusive speech quality assessment with support vector regression

TLDR
Experimental results indicate that the proposed approach outperforms the standard P.563 algorithm for non-intrusive assessment of speech quality with a total of 1792 speech files and the associated subjective scores.
Abstract
We propose a new non-intrusive speech quality assessment algorithm based on Support Vector Regression (SVR) and Mel Frequency Cepstral Coefficients (MFCCs). The basic idea is to map the MFCCs into the desired quality score using SVR. The sensitivity of the MFCCs to external noise is exploited to gauge the changes in the speech signal to evaluate its perceptual quality. The use of SVR exploits the advantages of machine learning with the ability to learn complex data patterns for an effective and generalized mapping of features into a perceptual score, in contrast with the oft-utilized feature pooling process in the existing speech quality estimators. Experimental results indicate that the proposed approach outperforms the standard P.563 algorithm for non-intrusive assessment of speech quality with a total of 1792 speech files and the associated subjective scores.

read more

Citations
More filters
Proceedings ArticleDOI

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM.

TL;DR: In this paper, an end-to-end, non-intrusive speech quality evaluation model, termed Quality-Net, based on bidirectional long short-term memory (LSTM) was proposed.
Posted Content

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM

TL;DR: In this article, an end-to-end, non-intrusive speech quality evaluation model, termed Quality-Net, based on bidirectional long short-term memory (LSTM) was proposed.
Proceedings ArticleDOI

Novel deep autoencoder features for non-intrusive speech quality assessment

TL;DR: Quantification of the experimental results suggests that proposed metric gives more accurate and correlated scores than an existing benchmark for objective, non-intrusive quality assessment metric ITU-T P.563 standard.
Journal ArticleDOI

Scalable image quality assessment with 2D mel-cepstrum and machine learning approach

TL;DR: This paper investigates image features based on two-dimensional mel-cepstrum for the purpose of IQA and proposes a new metric by formulating IQA as a pattern recognition problem, which helps to overcome the limitations of the existing pooling methods.
Journal ArticleDOI

Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

TL;DR: This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech and utilizes the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment.
References
More filters
Proceedings Article

Learning with Kernels

Journal ArticleDOI

Evaluation of Objective Quality Measures for Speech Enhancement

TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
Book

Information Retrieval for Music and Motion

TL;DR: Analysis and Retrieval Techniques for Music Data, SyncPlayer: An Advanced Audio Player, and Relational Features and Adaptive Segmentation.
Patent

Speech audio process

TL;DR: This speech processes engine adopts the Kalman filtering with the glottis information of specific first speaker to purify audio speech signal, thus realizes more effective automatic speech recognition.
Related Papers (5)