scispace - formally typeset
Proceedings ArticleDOI

Speaker recognition from coded speech and the effects of score normalization

TLDR
Both types of score normalization significantly improve performance, and can eliminate the performance loss that occurs when there is a mismatch between training and testing conditions.
Abstract
We investigate the effect of speech coding on automatic speaker recognition when training and testing conditions are matched and mismatched. Experiments used standard speech coding algorithms (GSM, G.729, G.723, MELP) and a speaker recognition system based on Gaussian mixture models adapted from a universal background model. There is little loss in recognition performance for toll quality speech coders and slightly more loss when lower quality speech coders are used. Speaker recognition from coded speech using handset-dependent score normalization and test score normalization are examined. Both types of score normalization significantly improve performance, and can eliminate the performance loss that occurs when there is a mismatch between training and testing conditions.

read more

Citations
More filters
Journal ArticleDOI

An overview of text-independent speaker recognition: From features to supervectors

TL;DR: This paper starts with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling and elaborate advanced computational techniques to address robustness and session variability.
Posted Content

Wavenet based low rate speech coding

TL;DR: This work describes how a WaveNet generative speech model can be used to generate high quality speech from the bit stream of a standard parametric coder operating at 2.4 kb/s and shows that the speech produced by the system is able to additionally perform implicit bandwidth extension and does not significantly impair recognition of the original speaker for the human listener.
Journal ArticleDOI

Speaker Verification from Short Utterance Perspective: A Review

TL;DR: This work is attempted to investigate speaker verification (SV), particularly focusing on short utterances for the purpose of person authentication, by bringing out a framework involving the directions discussed into a common platform for having a deployable system using short utterance-based SV.
Journal ArticleDOI

Compensating for the effects of site and equipment variation on delphinid species identification from their echolocation clicks

TL;DR: This question is examined in the context of identifying delphinid species by their echolocation clicks in the Southern California Bight in order to reduce the ambiguity between species classification performance and other confounding factors.
Proceedings ArticleDOI

Effect of speech coding on speaker identification

TL;DR: The effect of speech coding on text independent speaker identification (SI) is presented and it is observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features.
References
More filters
Journal ArticleDOI

A 2dvEv- bit distributed algorithm for the directed Euler trail problem

TL;DR: The algorithm can be used as a building block for solving other distributed graph problems, and can be slightly modified to run on a strongly-connected diagraph for generating the existent Euler trail or to report that no Euler trails exist.
Journal ArticleDOI

Speaker Verification Using Adapted Gaussian Mixture Models

TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.
Proceedings Article

Comparison of background normalization methods for text-independent speaker verification.

TL;DR: This paper compares two approaches to background model representation for a text-independent speaker verification task using Gaussian mixture models and describes how Bayesian adaptation can be used to derive claimant speaker models, providing a structure leading to significant computational savings during recognition.
Proceedings ArticleDOI

A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard

TL;DR: The enhanced MELP speech coder is described, which is a candidate for the new U.S. Federal Standard at 2.4 kbits/s and has been optimized for performance in acoustic background noise and in channel errors, as well as for efficient real-time implementation.