Proceedings ArticleDOI
Speaker recognition from coded speech and the effects of score normalization
Robert B. Dunn,Thomas F. Quatieri,D.A. Reynolds,Joseph P. Campbell +3 more
- Vol. 2, pp 1562-1567
TLDR
Both types of score normalization significantly improve performance, and can eliminate the performance loss that occurs when there is a mismatch between training and testing conditions.Abstract:
We investigate the effect of speech coding on automatic speaker recognition when training and testing conditions are matched and mismatched. Experiments used standard speech coding algorithms (GSM, G.729, G.723, MELP) and a speaker recognition system based on Gaussian mixture models adapted from a universal background model. There is little loss in recognition performance for toll quality speech coders and slightly more loss when lower quality speech coders are used. Speaker recognition from coded speech using handset-dependent score normalization and test score normalization are examined. Both types of score normalization significantly improve performance, and can eliminate the performance loss that occurs when there is a mismatch between training and testing conditions.read more
Citations
More filters
Journal ArticleDOI
An overview of text-independent speaker recognition: From features to supervectors
Tomi Kinnunen,Haizhou Li +1 more
TL;DR: This paper starts with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling and elaborate advanced computational techniques to address robustness and session variability.
Posted Content
Wavenet based low rate speech coding
W. Bastiaan Kleijn,Felicia S. C. Lim,Alejandro Luebs,Jan Skoglund,Florian Stimberg,Quan Wang,Thomas C. Walters +6 more
TL;DR: This work describes how a WaveNet generative speech model can be used to generate high quality speech from the bit stream of a standard parametric coder operating at 2.4 kb/s and shows that the speech produced by the system is able to additionally perform implicit bandwidth extension and does not significantly impair recognition of the original speaker for the human listener.
Journal ArticleDOI
Speaker Verification from Short Utterance Perspective: A Review
TL;DR: This work is attempted to investigate speaker verification (SV), particularly focusing on short utterances for the purpose of person authentication, by bringing out a framework involving the directions discussed into a common platform for having a deployable system using short utterance-based SV.
Journal ArticleDOI
Compensating for the effects of site and equipment variation on delphinid species identification from their echolocation clicks
TL;DR: This question is examined in the context of identifying delphinid species by their echolocation clicks in the Southern California Bight in order to reduce the ambiguity between species classification performance and other confounding factors.
Proceedings ArticleDOI
Effect of speech coding on speaker identification
TL;DR: The effect of speech coding on text independent speaker identification (SI) is presented and it is observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features.
References
More filters
Journal ArticleDOI
A 2dvEv- bit distributed algorithm for the directed Euler trail problem
Wen-Huei Chen,Chuan Yi Tang +1 more
TL;DR: The algorithm can be used as a building block for solving other distributed graph problems, and can be slightly modified to run on a strongly-connected diagraph for generating the existent Euler trail or to report that no Euler trails exist.
Journal ArticleDOI
Speaker Verification Using Adapted Gaussian Mixture Models
TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.
Proceedings Article
Comparison of background normalization methods for text-independent speaker verification.
TL;DR: This paper compares two approaches to background model representation for a text-independent speaker verification task using Gaussian mixture models and describes how Bayesian adaptation can be used to derive claimant speaker models, providing a structure leading to significant computational savings during recognition.
Proceedings ArticleDOI
A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard
TL;DR: The enhanced MELP speech coder is described, which is a candidate for the new U.S. Federal Standard at 2.4 kbits/s and has been optimized for performance in acoustic background noise and in channel errors, as well as for efficient real-time implementation.