Journal ArticleDOI
An Overview of Speaker Identification: Accuracy and Robustness Issues
Roberto Togneri,Daniel Pullella +1 more
Reads0
Chats0
TLDR
The main paradigms for speaker identification, and recent work on missing data methods to increase robustness are presented, and combined approaches involving bottom-up estimation and top-down processing are reviewed.Abstract:
This paper presents the main paradigms for speaker identification, and recent work on missing data methods to increase robustness. The feature extraction, speaker modeling and system classification are discussed. Evaluations of speaker identification performance subject to environmental noise are presented. While performance is impressive in clean speech conditions, there is rapid degradation with mismatched additive noise. Missing data methods can compensate against arbitrary disturbances and remove environmental mismatches. An overview of missing data methods is provided and applications to robust speaker identification summarized. Finally combined approaches involving bottom-up estimation and top-down processing are reviewed, and their significance discussed.read more
Citations
More filters
Journal ArticleDOI
Spoofing and countermeasures for speaker verification
TL;DR: A survey of past work and priority research directions for the future is provided, showing that future research should address the lack of standard datasets and the over-fitting of existing countermeasures to specific, known spoofing attacks.
Spoofing and countermeasures for speaker verification: a sur vey
TL;DR: In this paper, the authors provide a survey of spoofing countermeasures for automatic speaker verificati on, highlighting the need for more effort in the future to ensure adequate protection against spoofing attacks.
Journal ArticleDOI
Speaker identification features extraction methods: A systematic review
TL;DR: It is identified that the current SI research trend is to develop a robust universal SI framework to address the important problems of SI such as adaptability, complexity, multi-lingual recognition, and noise robustness.
Proceedings ArticleDOI
Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks
Duc Le,Emily Mower Provost +1 more
TL;DR: This work proposes and evaluates a suite of hybrid classifiers based on Hidden Markov Models and Deep Belief Networks, and provides insights into important similarities and differences between speech and emotion.
Posted Content
Speaker Recognition Based on Deep Learning: An Overview
Zhongxin Bai,Xiao-Lei Zhang +1 more
TL;DR: Several major subtasks of speaker recognition are reviewed, including speaker verification, identification, diarization, and robust speaker recognition, with a focus on deep-learning-based methods.
References
More filters
Journal ArticleDOI
A Tutorial on Support Vector Machines for Pattern Recognition
TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.
Journal ArticleDOI
Suppression of acoustic noise in speech using spectral subtraction
TL;DR: A stand-alone noise suppression algorithm that resynthesizes a speech waveform and can be used as a pre-processor to narrow-band voice communications systems, speech recognition systems, or speaker authentication systems.
Journal ArticleDOI
An introduction to biometric recognition
TL;DR: A brief overview of the field of biometrics is given and some of its advantages, disadvantages, strengths, limitations, and related privacy concerns are summarized.
Journal ArticleDOI
Speaker Verification Using Adapted Gaussian Mixture Models
TL;DR: The major elements of MIT Lincoln Laboratory's Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs) are described.
Journal ArticleDOI
Robust text-independent speaker identification using Gaussian mixture speaker models
Douglas A. Reynolds,Richard Rose +1 more
TL;DR: The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task.
Related Papers (5)
Robust text-independent speaker identification using Gaussian mixture speaker models
Douglas A. Reynolds,Richard Rose +1 more
An overview of text-independent speaker recognition: From features to supervectors
Tomi Kinnunen,Haizhou Li +1 more