DAMAD: Database, Attack, and Model Agnostic Adversarial Perturbation Detector

doi:10.1109/TNNLS.2021.3051529

Journal ArticleDOI

DAMAD: Database, Attack, and Model Agnostic Adversarial Perturbation Detector

Akshay Agarwal, +4 more

- 12 Mar 2021 -

IEEE Transactions on Neural Networks

- pp 1-13

TLDR

DAMAD as mentioned in this paper is a generalized perturbation detection algorithm which is agnostic to model architecture, training data set, and loss function used during training, which is based on the fusion of autoencoder embedding and statistical texture features extracted from convolutional neural networks.

Abstract:

Adversarial perturbations have demonstrated the vulnerabilities of deep learning algorithms to adversarial attacks. Existing adversary detection algorithms attempt to detect the singularities; however, they are in general, loss-function, database, or model dependent. To mitigate this limitation, we propose DAMAD--a generalized perturbation detection algorithm which is agnostic to model architecture, training data set, and loss function used during training. The proposed adversarial perturbation detection algorithm is based on the fusion of autoencoder embedding and statistical texture features extracted from convolutional neural networks. The performance of DAMAD is evaluated on the challenging scenarios of cross-database, cross-attack, and cross-architecture training and testing along with traditional evaluation of testing on the same database with known attack and model. Comparison with state-of-the-art perturbation detection algorithms showcase the effectiveness of the proposed algorithm on six databases: ImageNet, CIFAR-10, Multi-PIE, MEDS, point and shoot challenge (PaSC), and MNIST. Performance evaluation with nearly a quarter of a million adversarial and original images and comparison with recent algorithms show the effectiveness of the proposed algorithm.

DAMAD: Database, Attack, and Model Agnostic Adversarial Perturbation Detector

Citations

Exploring Robustness Connection between Artificial and Natural Adversarial Examples

Fast Adversarial Training with Noise Augmentation: A Unified Perspective on RandStart and GradAlign

Wavelet Regularization Benefits Adversarial Training

AN-GCN: An Anonymous Graph Convolutional Network Defense Against Edge-Perturbing Attack

Towards an Accurate and Secure Detector against Adversarial Perturbations

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database

Gradient-based learning applied to document recognition

Densely Connected Convolutional Networks

Related Papers (5)

An Adversarial Attack Detection Paradigm With Swarm Optimization

Efficient Detection of Pixel-Level Adversarial Attacks

Adversarial Attacks on Face Detectors Using Neural Net Based Constrained Optimization

R2AD: Randomization and Reconstructor-based Adversarial Defense for Deep Neural Networks

Frequency Centric Defense Mechanisms against Adversarial Examples.