A model-based transformational approach to robust speaker recognition.

Open AccessProceedings Article

A model-based transformational approach to robust speaker recognition.

Remco Teunen, +2 more

- pp 495-498

Chats0

TLDR

This work proposes a novel statistical modeling and compensation method for robust speaker recognition that yields similar improvements as the HNORM score-based compensation method, but with a fraction of the training time.

Abstract:

A novel statistical modeling and compensation method for robust speaker recognition is presented. The method specifically addresses the degradation in speaker verification performance due to the mismatch in channels (e.g., telephone handsets) between enrollment and testing sessions. In mismatched conditions, the new approach uses speaker-independent channel transformations to synthesize a speaker model that corresponds to the channel of the testing session. Effectively verification is always performed in matched channel conditions. Results on the 1998 NIST Speaker Recognition Evaluation corpus show that the new approach yields performance that matches the best reported results. Specifically, our approach yields similar improvements (19.9% reduction in EER compared to CMN alone) as the HNORM score-based compensation method, but with a fraction of the training time.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

An overview of text-independent speaker recognition: From features to supervectors

Tomi Kinnunen, +1 more

- 01 Jan 2010 -

Speech Communication

TL;DR: This paper starts with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling and elaborate advanced computational techniques to address robustness and session variability.

...read moreread less

Journal ArticleDOI

Joint Factor Analysis Versus Eigenchannels in Speaker Recognition

Patrick Kenny, +3 more

- 01 May 2007 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: It is shown how the two approaches to the problem of session variability in Gaussian mixture model (GMM)-based speaker verification, eigenchannels, and joint factor analysis can be implemented using essentially the same software at all stages except for the enrollment of target speakers.

...read moreread less

Proceedings ArticleDOI

An overview of automatic speaker recognition technology

Douglas A. Reynolds

TL;DR: Some of the strengths and weaknesses of current speaker recognition technologies are discussed, and some potential future trends in research, development and applications are outlined.

...read moreread less

Journal ArticleDOI

Robust Speaker Recognition in Noisy Conditions

Ji Ming, +3 more

- 01 Jul 2007 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This paper describes a method that combines multicondition model training and missing-feature theory to model noise with unknown temporal-spectral characteristics, and is found to achieve lower error rates.

...read moreread less

Journal ArticleDOI

An Overview of Speaker Identification: Accuracy and Robustness Issues

Roberto Togneri, +1 more

- 09 Jun 2011 -

IEEE Circuits and Systems Magazine

TL;DR: The main paradigms for speaker identification, and recent work on missing data methods to increase robustness are presented, and combined approaches involving bottom-up estimation and top-down processing are reviewed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Speaker identification and verification using Gaussian mixture speaker models

Douglas A. Reynolds

- 01 Aug 1995 -

Speech Communication

TL;DR: High performance speaker identification and verification systems based on Gaussian mixture speaker models: robust, statistically based representations of speaker identity, evaluated on four publically available speech databases.

...read moreread less

Cepstrum analysis technique for automatic speaker verification

S. Furui

TL;DR: New techniques for automatic speaker verification using telephone speech based on a set of functions of time obtained from acoustic analysis of a fixed, sentence-long utterance using a new time warping method using a dynamic programming technique.

...read moreread less

Proceedings Article

Comparison of background normalization methods for text-independent speaker verification.

Douglas A. Reynolds

TL;DR: This paper compares two approaches to background model representation for a text-independent speaker verification task using Gaussian mixture models and describes how Bayesian adaptation can be used to derive claimant speaker models, providing a structure leading to significant computational savings during recognition.

...read moreread less

Journal ArticleDOI

Robust speaker recognition: a feature-based approach

Richard J. Mammone, +2 more

- 01 Jan 1996 -

IEEE Signal Processing Magazine

TL;DR: Linear predictive (LP) analysis, the first step of feature extraction, is discussed, and various robust cepstral features derived from LP coefficients are described, including the afJine transform, which is a feature transformation approach that integrates mismatch to simultaneously combat both channel and noise distortion.

...read moreread less

Proceedings Article