Deceiving the Protector: Fooling Face Presentation Attack Detection Algorithms

doi:10.1109/ICB45273.2019.8987293

Home
/
Papers
/
Deceiving the Protector: Fooling Face Presentation Attack Detection Algorithms

Proceedings Article•DOI•

Deceiving the Protector: Fooling Face Presentation Attack Detection Algorithms

Akshay Agarwal¹, Akarsha Sehwag¹, Mayank Vatsa¹, Richa Singh¹•Institutions (1)

Indraprastha Institute of Information Technology¹

04 Jun 2019-pp 1-6

TL;DR: For the first time in the literature, it is possible to "fool" the PAD algorithms using adversarial perturbations using convolutional autoencoder to learn the perturbation network.

read less

Abstract: Face recognition systems are vulnerable to presentation attacks such as replay and 3D masks. In the literature, several presentation attack detection (PAD) algorithms are developed to address this problem. However, for the first time in the literature, this paper showcases that it is possible to "fool" the PAD algorithms using adversarial perturbations. The proposed perturbation approach attacks the presentation attack detection algorithms at the PAD feature level via transformation of features from one class (attack class) to another (real class). The PAD feature tampering network utilizes convolutional autoencoder to learn the perturbations. The proposed algorithm is evaluated with respect to CNN and local binary pattern (LBP) based PAD algorithms. Experiments on three databases, Replay, SMAD, and Face Morph, showcase that the proposed approach increases the equal error rate of PAD algorithms by at least two times. For instance, on the SMAD database, PAD equal error rate (EER) of 20.1% is increased to 55.7% after attacking the PAD algorithm.

...read moreread less

Citations

PDF

Open Access

More filters

Posted Content•

On the Robustness of Face Recognition Algorithms Against Attacks and Bias

[...]

Richa Singh¹, Akshay Agarwal², Maneet Singh², Shruti Nagpal², Mayank Vatsa² - Show less +1 more•Institutions (2)

Indian Institute of Technology, Jodhpur¹, Indraprastha Institute of Information Technology²

07 Feb 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, different types of attacks such as physical presentation attacks, disguise/makeup, digital adversarial attacks, and morphing/tampering using GANs have been discussed.

...read moreread less

Abstract: Face recognition algorithms have demonstrated very high recognition performance, suggesting suitability for real world applications. Despite the enhanced accuracies, robustness of these algorithms against attacks and bias has been challenged. This paper summarizes different ways in which the robustness of a face recognition algorithm is challenged, which can severely affect its intended working. Different types of attacks such as physical presentation attacks, disguise/makeup, digital adversarial attacks, and morphing/tampering using GANs have been discussed. We also present a discussion on the effect of bias on face recognition models and showcase that factors such as age and gender variations affect the performance of modern algorithms. The paper also presents the potential reasons for these challenges and some of the future research directions for increasing the robustness of face recognition models.

...read moreread less

5 citations

Book Chapter•DOI•

Facial Retouching and Alteration Detection

[...]

Puspita Majumdar, Akshay Agarwal, Mayank Vatsa, Richa Singh

01 Jan 2022

TL;DR: A comprehensive survey of existing algorithms for retouched and altered image detection is presented in this article , where multiple experiments are performed to highlight the open challenges of alteration detection, and the majority of these images are created for fun and beautification purposes, they may be used with malicious intent for negative applications such as deepnude or spreading visual fake news.

...read moreread less

Abstract: Abstract On the social media platforms, the filters for digital retouching and face beautification have become a common trend. With the availability of easy-to-use image editing tools, the generation of altered images has become an effortless task. Apart from this, advancements in the Generative Adversarial Network (GAN) leads to creation of realistic facial images and alteration of facial images based on the attributes. While the majority of these images are created for fun and beautification purposes, they may be used with malicious intent for negative applications such as deepnude or spreading visual fake news. Therefore, it is important to detect digital alterations in images and videos. This chapter presents a comprehensive survey of existing algorithms for retouched and altered image detection. Further, multiple experiments are performed to highlight the open challenges of alteration detection.

...read moreread less

5 citations

Proceedings Article•DOI•

MixNet for Generalized Face Presentation Attack Detection

[...]

Nilay Sanghvi¹, Sushant Kumar Singh¹, Akshay Agarwal², Mayank Vatsa³, Richa Singh³ - Show less +1 more•Institutions (3)

Indraprastha Institute of Information Technology¹, Texas A&M University–Kingsville², Indian Institute of Technology, Jodhpur³

10 Jan 2021

TL;DR: In this article, the authors proposed a deep learning-based network termed as MixNet to detect presentation attacks in cross-database and unseen attack settings, which utilizes state-of-the-art convolutional neural network architectures and learns the feature mapping for each attack category.

...read moreread less

Abstract: The non-intrusive nature and high accuracy of face recognition algorithms have led to their successful deployment across multiple applications ranging from border access to mobile unlocking and digital payments. However, their vulnerability against sophisticated and cost-effective presentation attack mediums raises essential questions regarding its reliability. In the literature, several presentation attack detection algorithms are presented; however, they are still far behind from reality. The major problem with existing work is the generalizability against multiple attacks both in the seen and unseen setting. The algorithms which are useful for one kind of attack (such as print) perform unsatisfactorily for another type of attack (such as silicone masks). In this research, we have proposed a deep learning-based network termed as MixNet to detect presentation attacks in cross-database and unseen attack settings. The proposed algorithm utilizes state-of-the-art convolutional neural network architectures and learns the feature mapping for each attack category. Experiments are performed using multiple challenging face presentation attack databases such as SMAD and Spoof In the Wild (SiW-M) databases. Extensive experiments and comparison with existing state of the art algorithms show the effectiveness of the proposed algorithm.

...read moreread less

5 citations

Journal Article•DOI•

Beyond the Pixel World: A Novel Acoustic-Based Face Anti-Spoofing System for Smartphones

[...]

01 Jan 2022-IEEE Transactions on Information Forensics and Security

TL;DR: Zhang et al. as discussed by the authors designed a two-branch framework that combines the global and local frequency clues of input signals to distinguish inputs, live vs. spoofing faces accurately, which employs the crafted acoustic signal as the probe to perform face liveness detection.

...read moreread less

Abstract: 2D face presentation attacks are one of the most notorious and pervasive face spoofing types, which have caused pressing security issues to facial authentication systems. While RGB-based face anti-spoofing (FAS) models have proven to counter the face spoofing attack effectively, most existing FAS models suffer from the overfitting problem (i.e., lack generalization capability to data collected from an unseen environment). Recently, many models have been devoted to capturing auxiliary information (e.g., depth and infrared maps) to achieve a more robust face liveness detection performance. However, these methods require expensive sensors and cost extra hardware to capture the specific modality information, limiting their applications in practical scenarios. To tackle these problems, we devise a novel and cost-effective FAS system based on the acoustic modality, named Echo-FAS, which employs the crafted acoustic signal as the probe to perform face liveness detection. We first propose to build a large-scale, high-diversity, and acoustic-based FAS database, Echo-Spoof. Then, based upon Echo-Spoof, we propose designing a novel two-branch framework that combines the global and local frequency clues of input signals to distinguish inputs, live vs. spoofing faces accurately. The devised Echo-FAS comprises the following three merits: (1) It only needs one available speaker and microphone as sensors while not requiring any expensive hardware; (2) It can successfully capture the 3D geometrical information of input queries and achieve a remarkable face anti-spoofing performance; and (3) It can be handily allied with other RGB-based FAS models to mitigate the overfitting problem in the RGB modality and make the FAS model more accurate and robust. Our proposed Echo-FAS provides new insights regarding the development of FAS systems for mobile devices.

...read moreread less

2 citations

Proceedings Article•DOI•

Robust IRIS Presentation Attack Detection Through Stochastic Filter Noise

[...]

Vishi Jain, Akshay Agarwal, Richa Singh, Mayank Vatsa, Nalini K. Ratha - Show less +1 more

21 Aug 2022

TL;DR: Wang et al. as mentioned in this paper proposed a multi-task deep learning model with a denoising convolutional skip autoencoder and a classifier to inbuilt robustness against noisy images.

...read moreread less

Abstract: The vulnerability of iris recognition algorithms against presentation attacks demands a robust defense mechanism. Much research has been done in the literature to create a robust attack detection algorithm; however, most of the algorithms suffer from generalizability, such as inter database testing or unseen attack type. The problem of attack detection can further be exacerbated if the images contain noise such as Gaussian or Salt-Pepper noise. In this research, we propose a multi-task deep learning model with a denoising convolutional skip autoencoder and a classifier to inbuilt robustness against noisy images. The Gaussian noise layer is introduced as a dropout between the encoder network’s hidden layers, which helps the model learn generalized features that are robust to data noise. The proposed algorithm is evaluated on multiple presentation attack databases and extensive experiments across different noise types and a comparison with other deep learning models show the generalizability and efficacy of the proposed model.

...read moreread less

2 citations

References

PDF

Open Access

More filters

Book Chapter•DOI•

Visualizing and Understanding Convolutional Networks

[...]

Matthew D. Zeiler¹, Rob Fergus¹•Institutions (1)

New York University¹

06 Sep 2014

TL;DR: A novel visualization technique is introduced that gives insight into the function of intermediate feature layers and the operation of the classifier in large Convolutional Network models, used in a diagnostic role to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark.

...read moreread less

Abstract: Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark Krizhevsky et al. [18]. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we explore both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of the classifier. Used in a diagnostic role, these visualizations allow us to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark. We also perform an ablation study to discover the performance contribution from different model layers. We show our ImageNet model generalizes well to other datasets: when the softmax classifier is retrained, it convincingly beats the current state-of-the-art results on Caltech-101 and Caltech-256 datasets.

...read moreread less

12,783 citations

Proceedings Article•

Intriguing properties of neural networks

[...]

Christian Szegedy¹, Wojciech Zaremba², Ilya Sutskever¹, Joan Bruna², Dumitru Erhan¹, Ian Goodfellow³, Rob Fergus², Rob Fergus⁴ - Show less +4 more•Institutions (4)

Google¹, New York University², Université de Montréal³, Facebook⁴

01 Jan 2014

TL;DR: It is found that there is no distinction between individual highlevel units and random linear combinations of high level units, according to various methods of unit analysis, and it is suggested that it is the space, rather than the individual units, that contains of the semantic information in the high layers of neural networks.

...read moreread less

Abstract: Deep neural networks are highly expressive models that have recently achieved state of the art performance on speech and visual recognition tasks. While their expressiveness is the reason they succeed, it also causes them to learn uninterpretable solutions that could have counter-intuitive properties. In this paper we report two such properties. First, we find that there is no distinction between individual high level units and random linear combinations of high level units, according to various methods of unit analysis. It suggests that it is the space, rather than the individual units, that contains of the semantic information in the high layers of neural networks. Second, we find that deep neural networks learn input-output mappings that are fairly discontinuous to a significant extend. We can cause the network to misclassify an image by applying a certain imperceptible perturbation, which is found by maximizing the network's prediction error. In addition, the specific nature of these perturbations is not a random artifact of learning: the same perturbation can cause a different network, that was trained on a different subset of the dataset, to misclassify the same input.

...read moreread less

9,561 citations

"Deceiving the Protector: Fooling Fa..." refers background in this paper

...[24] observe that small manipulations in the pixel values can lead to its misclassification by deep learning algorithms....
[...]

Journal Article•DOI•

What is a support vector machine

[...]

William Stafford Noble¹•Institutions (1)

University of Washington¹

01 Dec 2006-Nature Biotechnology

TL;DR: Support vector machines are becoming popular in a wide variety of biological applications, but how do they work and what are their most promising applications in the life sciences?

...read moreread less

Abstract: Support vector machines (SVMs) are becoming popular in a wide variety of biological applications. But, what exactly are SVMs and how do they work? And what are their most promising applications in the life sciences?

...read moreread less

3,801 citations

"Deceiving the Protector: Fooling Fa..." refers methods in this paper

...On 2D attacks, the SVM with RBF kernel yields the lowest EER value whereas on silicone and digital morph attack linear kernel shows the best performance....
[...]
...%) on digital face morph database increased by more than 14, 10, and 16 times with linear, polynomial, and RBF kernel-based SVM with CNN features, respectively....
[...]
...Similarly, the sensitivity of different SVM kernels is also evaluated against feature tampering....
[...]
...The performance degradation across each SVM kernel on the hand-held set of Replay-Attack database is even higher under inter-attack network training in comparison to intraattack learning....
[...]
...The support vector machine (SVM) [10] classifier is trained for presentation attack detection on CNN features....
[...]

Proceedings Article•DOI•

Return of the Devil in the Details: Delving Deep into Convolutional Nets

[...]

Ken Chatfield¹, Karen Simonyan¹, Andrea Vedaldi¹, Andrew Zisserman¹•Institutions (1)

University of Oxford¹

14 May 2014

TL;DR: It is shown that the data augmentation techniques commonly applied to CNN-based methods can also be applied to shallow methods, and result in an analogous performance boost, and it is identified that the dimensionality of the CNN output layer can be reduced significantly without having an adverse effect on performance.

...read moreread less

Abstract: The latest generation of Convolutional Neural Networks (CNN) have achieved impressive results in challenging benchmarks on image recognition and object detection, significantly raising the interest of the community in these methods. Nevertheless, it is still unclear how different CNN methods compare with each other and with previous state-of-the-art shallow representations such as the Bag-of-Visual-Words and the Improved Fisher Vector. This paper conducts a rigorous evaluation of these new techniques, exploring different deep architectures and comparing them on a common ground, identifying and disclosing important implementation details. We identify several useful properties of CNN-based representations, including the fact that the dimensionality of the CNN output layer can be reduced significantly without having an adverse effect on performance. We also identify aspects of deep and shallow methods that can be successfully shared. In particular, we show that the data augmentation techniques commonly applied to CNN-based methods can also be applied to shallow methods, and result in an analogous performance boost. Source code and models to reproduce the experiments in the paper is made publicly available.

...read moreread less

3,533 citations

Journal Article•DOI•

Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

[...]

Naveed Akhtar¹, Ajmal Mian¹•Institutions (1)

University of Western Australia¹

19 Feb 2018-IEEE Access

TL;DR: A comprehensive survey on adversarial attacks on deep learning in computer vision can be found in this paper, where the authors review the works that design adversarial attack, analyze the existence of such attacks and propose defenses against them.

...read moreread less

Abstract: Deep learning is at the heart of the current rise of artificial intelligence. In the field of computer vision, it has become the workhorse for applications ranging from self-driving cars to surveillance and security. Whereas, deep neural networks have demonstrated phenomenal success (often beyond human capabilities) in solving complex problems, recent studies show that they are vulnerable to adversarial attacks in the form of subtle perturbations to inputs that lead a model to predict incorrect outputs. For images, such perturbations are often too small to be perceptible, yet they completely fool the deep learning models. Adversarial attacks pose a serious threat to the success of deep learning in practice. This fact has recently led to a large influx of contributions in this direction. This paper presents the first comprehensive survey on adversarial attacks on deep learning in computer vision. We review the works that design adversarial attacks, analyze the existence of such attacks and propose defenses against them. To emphasize that adversarial attacks are possible in practical conditions, we separately review the contributions that evaluate adversarial attacks in the real-world scenarios. Finally, drawing on the reviewed literature, we provide a broader outlook of this research direction.

...read moreread less

1,542 citations