An Overview of Speaker Identification: Accuracy and Robustness Issues

doi:10.1109/MCAS.2011.941079

Journal ArticleDOI

An Overview of Speaker Identification: Accuracy and Robustness Issues

Roberto Togneri, +1 more

- 09 Jun 2011 -

IEEE Circuits and Systems Magazine

- Vol. 11, Iss: 2, pp 23-61

Chats0

TLDR

The main paradigms for speaker identification, and recent work on missing data methods to increase robustness are presented, and combined approaches involving bottom-up estimation and top-down processing are reviewed.

Abstract:

This paper presents the main paradigms for speaker identification, and recent work on missing data methods to increase robustness. The feature extraction, speaker modeling and system classification are discussed. Evaluations of speaker identification performance subject to environmental noise are presented. While performance is impressive in clean speech conditions, there is rapid degradation with mismatched additive noise. Missing data methods can compensate against arbitrary disturbances and remove environmental mismatches. An overview of missing data methods is provided and applications to robust speaker identification summarized. Finally combined approaches involving bottom-up estimation and top-down processing are reviewed, and their significance discussed.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Spoofing and countermeasures for speaker verification

Zhizheng Wu, +5 more

- 01 Feb 2015 -

Speech Communication

TL;DR: A survey of past work and priority research directions for the future is provided, showing that future research should address the lack of standard datasets and the over-fitting of existing countermeasures to specific, known spoofing attacks.

...read moreread less

Spoofing and countermeasures for speaker verification: a sur vey

Zhizheng Wu, +5 more

TL;DR: In this paper, the authors provide a survey of spoofing countermeasures for automatic speaker verificati on, highlighting the need for more effort in the future to ensure adequate protection against spoofing attacks.

...read moreread less

Journal ArticleDOI

Speaker identification features extraction methods: A systematic review

Sreenivas Sremath Tirumala, +3 more

- 30 Dec 2017 -

Expert Systems With Applications

TL;DR: It is identified that the current SI research trend is to develop a robust universal SI framework to address the important problems of SI such as adaptability, complexity, multi-lingual recognition, and noise robustness.

...read moreread less

Proceedings ArticleDOI

Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks

Duc Le, +1 more

TL;DR: This work proposes and evaluates a suite of hybrid classifiers based on Hidden Markov Models and Deep Belief Networks, and provides insights into important similarities and differences between speech and emotion.

...read moreread less

Posted Content

Speaker Recognition Based on Deep Learning: An Overview

Zhongxin Bai, +1 more

- 02 Dec 2020 -

arXiv: Audio and Speech Processing

TL;DR: Several major subtasks of speaker recognition are reviewed, including speaker verification, identification, diarization, and robust speaker recognition, with a focus on deep-learning-based methods.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A Tutorial on Support Vector Machines for Pattern Recognition

Christopher John Burges

- 01 Jun 1998 -

Data Mining and Knowledge Discovery

TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.

...read moreread less

Journal ArticleDOI

Suppression of acoustic noise in speech using spectral subtraction

S. Boll

- 01 Apr 1979 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A stand-alone noise suppression algorithm that resynthesizes a speech waveform and can be used as a pre-processor to narrow-band voice communications systems, speech recognition systems, or speaker authentication systems.

...read moreread less

Journal ArticleDOI

An introduction to biometric recognition

Anil K. Jain, +2 more

- 01 Jan 2004 -

IEEE Transactions on Circuits and System...

TL;DR: A brief overview of the field of biometrics is given and some of its advantages, disadvantages, strengths, limitations, and related privacy concerns are summarized.

...read moreread less

Journal ArticleDOI

Speaker Verification Using Adapted Gaussian Mixture Models

Douglas A. Reynolds, +2 more

- 01 Jan 2000 -

Digital Signal Processing

TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.

...read moreread less

Journal ArticleDOI

Robust text-independent speaker identification using Gaussian mixture speaker models

Douglas A. Reynolds, +1 more

- 01 Jan 1995 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task.

...read moreread less