scispace - formally typeset
Open AccessJournal ArticleDOI

Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification

Reads0
Chats0
TLDR
Experiments conducted on the NIST 2006 speaker verification task indicate that the Bhattacharyya measure outperforms the Fisher kernel, term frequency log-likelihood ratio (TFLLR) scaling, and rank normalization reported earlier in literature.
Abstract
Support vector machines (SVMs), and kernel classifiers in general, rely on the kernel functions to measure the pairwise similarity between inputs. This paper advocates the use of discrete representation of speech signals in terms of the probabilities of discrete events as feature for speaker verification and proposes the use of Bhattacharyya coefficient as the similarity measure for this type of inputs to SVM. We analyze the effectiveness of the Bhattacharyya measure from the perspective of feature normalization and distribution warping in the SVM feature space. Experiments conducted on the NIST 2006 speaker verification task indicate that the Bhattacharyya measure outperforms the Fisher kernel, term frequency log-likelihood ratio (TFLLR) scaling, and rank normalization reported earlier in literature. Moreover, the Bhattacharyya measure is computed using a data-independent square-root operation instead of data-driven normalization, which simplifies the implementation. The effectiveness of the Bhattacharyya measure becomes more apparent when channel compensation is applied at the model and score levels. The performance of the proposed method is close to that of the popular GMM supervector with a small margin.

read more

Citations
More filters
Journal ArticleDOI

Spoken Language Recognition: From Fundamentals to Practice

TL;DR: This paper attempts to provide an introductory tutorial on the fundamentals of the theory and the state-of-the-art solutions of spoken language recognition, from both phonological and computational aspects.
Journal ArticleDOI

Optimization Algorithms and Applications for Speech and Language Processing

TL;DR: A range of problems in which optimization formulations and algorithms play a role are outlined, giving some additional details on certain application problems in machine translation, speaker/language recognition, and automatic speech recognition.
Journal ArticleDOI

Generalizing I-Vector Estimation for Rapid Speaker Recognition

TL;DR: This paper proposes a novel approach to i-vector extraction without the need to evaluate the full posterior covariance thereby speeding up the run-time extraction process and introduces the so-called subspace-orthogonalizing prior (SOP) to replace the standard Gaussian prior in i- vector formulation.
Posted Content

Lung sound classification using local binary pattern

TL;DR: Results show that the proposed features with SVM and also with kNN classifier outperform commonly used wavelet-based features as well as the authors' previously investigated mel-frequency cepstral coefficients (MFCCs) based statistical features, specifically in abnormal sound detection.
Journal ArticleDOI

Influence of Individual Differences in fMRI-Based Pain Prediction Models on Between-Individual Prediction Performance.

TL;DR: This study elucidates the influence of interindividual variability in pain on the between-individual prediction performance, and the results will be useful for the design of more accurate and robust fMRI-based pain prediction models.
References
More filters
Journal ArticleDOI

LIBSVM: A library for support vector machines

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Journal ArticleDOI

Speaker Verification Using Adapted Gaussian Mixture Models

TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.
Journal ArticleDOI

RASTA processing of speech

TL;DR: The theoretical and experimental foundations of the RASTA method are reviewed, the relationship with human auditory perception is discussed, the original method is extended to combinations of additive noise and convolutional noise, and an application is shown to speech enhancement.
Journal ArticleDOI

The Divergence and Bhattacharyya Distance Measures in Signal Selection

TL;DR: This partly tutorial paper compares the properties of an often used measure, the divergence, with a new measure that is often easier to evaluate, called the Bhattacharyya distance, which gives results that are at least as good and often better than those given by the divergence.
Related Papers (5)