Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation

doi:10.48550/arXiv.2203.10992

Proceedings ArticleDOI

Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation

Xuechen Liu, +2 more

- pp 85-91

Chats0

TLDR

This paper begins the concern of enhancing the spoofing robustness of the automatic speaker verification (ASV) system, without the primary presence of a separate countermeasure module by employing three unsupervised domain adaptation techniques to optimize the back-end using the audio data in the training partition of the ASVspoof 2019 dataset.

Abstract:

In this paper, we initiate the concern of enhancing the spoofing robustness of the automatic speaker verification (ASV) system, without the primary presence of a separate countermeasure module. We start from the standard ASV framework of the ASVspoof 2019 baseline and approach the problem from the back-end classifier based on probabilistic linear discriminant analysis. We employ three unsupervised domain adaptation techniques to optimize the back-end using the audio data in the training partition of the ASVspoof 2019 dataset. We demonstrate notable improvements on both logical and physical access scenarios, especially on the latter where the system is attacked by replayed audios, with a maximum of 36.1% and 5.3% relative improvement on bonafide and spoofed cases, respectively. We perform additional studies such as per-attack breakdown analysis, data composition, and integration with a countermeasure system at score-level with Gaussian back-end.

Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation

Citations

SASV 2022: The First Spoofing-Aware Speaker Verification Challenge

Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward

Mel Spectrogram Based Automatic Speaker Verification Using GMM-UBM

References

Auto-Encoding Variational Bayes

Matrix analysis: Frontmatter

WaveNet: A Generative Model for Raw Audio

X-Vectors: Robust DNN Embeddings for Speaker Recognition

Return of frustratingly easy domain adaptation