Audio Replay Attack Detection Using High-Frequency Features.

doi:10.21437/INTERSPEECH.2017-776

Proceedings ArticleDOI

Audio Replay Attack Detection Using High-Frequency Features.

Marcin Witkowski, +4 more

- pp 27-31

Chats0

TLDR

This paper addresses a replay spoofing attack against a speaker recognition system by detecting that the analysed signal has passed through multiple analogue-to-digital conversions by modelling the subband spectrum and using the proposed features derived from the linear prediction analysis.

Abstract:

This paper presents our contribution to the ASVspoof 2017 Challenge. It addresses a replay spoofing attack against a speaker recognition system by detecting that the analysed signal has passed through multiple analogue-to-digital (AD) conversions. Specifically, we show that most of the cues that enable to detect the replay attacks can be found in the high-frequency band of the replayed recordings. The described anti-spoofing countermeasures are based on (1) modelling the subband spectrum and (2) using the proposed features derived from the linear prediction (LP) analysis. The results of the investigated methods show a significant improvement in comparison to the baseline system of the ASVspoof 2017 Challenge. A relative equal error rate (EER) reduction by 70% was achieved for the development set and a reduction by 30% was obtained for the evaluation set.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Advances in anti-spoofing: from the perspective of ASVspoof challenges

Madhu R. Kamble, +3 more

TL;DR: The literature review of ASV spoof detection, novel acoustic feature representations, deep learning, end-to-end systems, etc, along with recent efforts to develop countermeasures for spoof speech detection (SSD) task are presented.

...read moreread less

Proceedings ArticleDOI

Effectiveness of Speech Demodulation-Based Features for Replay Detection.

Madhu R. Kamble, +2 more

TL;DR: This paper explores speech demodulation-based features using Hilbert transform (HT) and Teager Energy Operator (TEO) for replay detection and proposes features, namely, HT-based Instantaneous Amplitude (IA) and Instantaneous Frequency (IF) Cosine Coefficients and Energy Separation Algorithm (ESA) based features.

...read moreread less

Proceedings ArticleDOI

A Light Convolutional GRU-RNN Deep Feature Extractor for ASV Spoofing Detection.

Alejandro Gomez-Alanis, +3 more

TL;DR: This work proposes the use of a Light Convolutional Gated Recurrent Neural Network (LC-GRNN) as a deep feature extractor to robustly represent speech signals as utterance-level embeddings, which are later used by a back-end recognizer which performs the final genuine/spoofed classification.

...read moreread less

Proceedings ArticleDOI

Modulation dynamic features for the detection of replay attacks

Gajan Suthokumar, +3 more

TL;DR: This paper proposes two novel features to capture the static and dynamic characteristics of the signal from the modulation spectrum, which complement short term spectral features for use in replay detection.

...read moreread less

Proceedings ArticleDOI

Long Range Acoustic and Deep Features Perspective on ASVspoof 2019

Rohan Kumar Das, +2 more

TL;DR: A comprehensive analysis on the nature of different kinds of spoofing attacks and system development is made and the use of deep features that enhances the discriminative ability between genuine and spoofed speech is investigated.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Martín Abadi, +39 more

- 01 Jan 2015 -

arXiv: Distributed, Parallel, and Cluste...

TL;DR: The TensorFlow interface and an implementation of that interface that is built at Google are described, which has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields.

...read moreread less

Journal ArticleDOI

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences

S. Davis, +1 more

- 01 Aug 1980 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: In this article, several parametric representations of the acoustic signal were compared with regard to word recognition performance in a syllable-oriented continuous speech recognition system, and the emphasis was on the ability to retain phonetically significant acoustic information in the face of syntactic and duration variations.

...read moreread less

Proceedings Article

Deep speech 2: end-to-end speech recognition in English and mandarin

Dario Amodei, +68 more

TL;DR: In this article, an end-to-end deep learning approach was used to recognize either English or Mandarin Chinese speech-two vastly different languages-using HPC techniques, enabling experiments that previously took weeks to now run in days.

...read moreread less

Journal ArticleDOI

Calculation of a constant Q spectral transform

Judith C. Brown

- 01 Jan 1991 -

Journal of the Acoustical Society of Ame...

TL;DR: In this article, a constant Q transform with a constant ratio of center frequency to resolution has been proposed to obtain a constant pattern in the frequency domain for sounds with harmonic frequency components.

...read moreread less