Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification

doi:10.1109/TASL.2010.2064308

Open AccessJournal ArticleDOI

Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification

Kong Aik Lee, +4 more

- 01 May 2011 -

IEEE Transactions on Audio, Speech, and ...

- Vol. 19, Iss: 4, pp 861-870

Chats0

TLDR

Experiments conducted on the NIST 2006 speaker verification task indicate that the Bhattacharyya measure outperforms the Fisher kernel, term frequency log-likelihood ratio (TFLLR) scaling, and rank normalization reported earlier in literature.

Abstract:

Support vector machines (SVMs), and kernel classifiers in general, rely on the kernel functions to measure the pairwise similarity between inputs. This paper advocates the use of discrete representation of speech signals in terms of the probabilities of discrete events as feature for speaker verification and proposes the use of Bhattacharyya coefficient as the similarity measure for this type of inputs to SVM. We analyze the effectiveness of the Bhattacharyya measure from the perspective of feature normalization and distribution warping in the SVM feature space. Experiments conducted on the NIST 2006 speaker verification task indicate that the Bhattacharyya measure outperforms the Fisher kernel, term frequency log-likelihood ratio (TFLLR) scaling, and rank normalization reported earlier in literature. Moreover, the Bhattacharyya measure is computed using a data-independent square-root operation instead of data-driven normalization, which simplifies the implementation. The effectiveness of the Bhattacharyya measure becomes more apparent when channel compensation is applied at the model and score levels. The performance of the proposed method is close to that of the popular GMM supervector with a small margin.

Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification

Citations

Spoken Language Recognition: From Fundamentals to Practice

Optimization Algorithms and Applications for Speech and Language Processing

Generalizing I-Vector Estimation for Rapid Speaker Recognition

Lung sound classification using local binary pattern

Influence of Individual Differences in fMRI-Based Pain Prediction Models on Between-Individual Prediction Performance.

References

LIBSVM: A library for support vector machines

Pattern Classification

Speaker Verification Using Adapted Gaussian Mixture Models

RASTA processing of speech

The Divergence and Bhattacharyya Distance Measures in Signal Selection

Related Papers (5)

An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition

GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition

A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition

Support vector machines using GMM supervectors for speaker verification

Front-End Factor Analysis for Speaker Verification