Speaker recognition from coded speech and the effects of score normalization

doi:10.1109/ACSSC.2001.987749

Proceedings ArticleDOI

Speaker recognition from coded speech and the effects of score normalization

- Vol. 2, pp 1562-1567

TLDR

Both types of score normalization significantly improve performance, and can eliminate the performance loss that occurs when there is a mismatch between training and testing conditions.

Abstract:

We investigate the effect of speech coding on automatic speaker recognition when training and testing conditions are matched and mismatched. Experiments used standard speech coding algorithms (GSM, G.729, G.723, MELP) and a speaker recognition system based on Gaussian mixture models adapted from a universal background model. There is little loss in recognition performance for toll quality speech coders and slightly more loss when lower quality speech coders are used. Speaker recognition from coded speech using handset-dependent score normalization and test score normalization are examined. Both types of score normalization significantly improve performance, and can eliminate the performance loss that occurs when there is a mismatch between training and testing conditions.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

An overview of text-independent speaker recognition: From features to supervectors

Tomi Kinnunen, +1 more

- 01 Jan 2010 -

Speech Communication

TL;DR: This paper starts with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling and elaborate advanced computational techniques to address robustness and session variability.

...read moreread less

Posted Content

Wavenet based low rate speech coding

W. Bastiaan Kleijn, +6 more

- 01 Dec 2017 -

arXiv: Audio and Speech Processing

TL;DR: This work describes how a WaveNet generative speech model can be used to generate high quality speech from the bit stream of a standard parametric coder operating at 2.4 kb/s and shows that the speech produced by the system is able to additionally perform implicit bandwidth extension and does not significantly impair recognition of the original speaker for the human listener.

...read moreread less

Journal ArticleDOI

Speaker Verification from Short Utterance Perspective: A Review

Rohan Kumar Das, +1 more

- 02 Nov 2018 -

Iete Technical Review

TL;DR: This work is attempted to investigate speaker verification (SV), particularly focusing on short utterances for the purpose of person authentication, by bringing out a framework involving the directions discussed into a common platform for having a deployable system using short utterance-based SV.

...read moreread less

Journal ArticleDOI

Compensating for the effects of site and equipment variation on delphinid species identification from their echolocation clicks

Marie A. Roch, +3 more

- 23 Jan 2015 -

Journal of the Acoustical Society of Ame...

TL;DR: This question is examined in the context of identifying delphinid species by their echolocation clicks in the Southern California Bight in order to reduce the ambiguity between species classification performance and other confounding factors.

...read moreread less

Proceedings ArticleDOI

Effect of speech coding on speaker identification

Anil Kumar Vuppala, +2 more

TL;DR: The effect of speech coding on text independent speaker identification (SI) is presented and it is observed that there is a significant reduction of performance in SI system due to coding, and effect is more prominent in case of SI system build with source features.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A 2dvEv- bit distributed algorithm for the directed Euler trail problem

Wen-Huei Chen, +1 more

- 09 Aug 1993 -

Information Processing Letters

TL;DR: The algorithm can be used as a building block for solving other distributed graph problems, and can be slightly modified to run on a strongly-connected diagraph for generating the existent Euler trail or to report that no Euler trails exist.

...read moreread less

Journal ArticleDOI

Speaker Verification Using Adapted Gaussian Mixture Models

Douglas A. Reynolds, +2 more

- 01 Jan 2000 -

Digital Signal Processing

TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.

...read moreread less

Linguistic Data Consortium

Treebank Penn

Proceedings Article

Comparison of background normalization methods for text-independent speaker verification.

Douglas A. Reynolds

TL;DR: This paper compares two approaches to background model representation for a text-independent speaker verification task using Gaussian mixture models and describes how Bayesian adaptation can be used to derive claimant speaker models, providing a structure leading to significant computational savings during recognition.

...read moreread less

Proceedings ArticleDOI

A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard

Alan V. McCree, +4 more

TL;DR: The enhanced MELP speech coder is described, which is a candidate for the new U.S. Federal Standard at 2.4 kbits/s and has been optimized for performance in acoustic background noise and in channel errors, as well as for efficient real-time implementation.

...read moreread less

Speaker recognition from coded speech and the effects of score normalization

Citations

An overview of text-independent speaker recognition: From features to supervectors

Wavenet based low rate speech coding

Speaker Verification from Short Utterance Perspective: A Review

Compensating for the effects of site and equipment variation on delphinid species identification from their echolocation clicks

Effect of speech coding on speaker identification

References

A 2dvEv- bit distributed algorithm for the directed Euler trail problem

Speaker Verification Using Adapted Gaussian Mixture Models

Linguistic Data Consortium

Comparison of background normalization methods for text-independent speaker verification.

A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard

Related Papers (5)

Fundamentals of speech recognition

Discrete-Time Processing of Speech Signals

Automatic speech recognition on mobile devices and over communication networks

AANN: an alternative to GMM for pattern recognition

Speaker Verification Using Adapted Gaussian Mixture Models