Adversarial Feature Selection Against Evasion Attacks

doi:10.1109/TCYB.2015.2415032

Open AccessJournal ArticleDOI

Adversarial Feature Selection Against Evasion Attacks

Fei Zhang, +4 more

- 01 Mar 2016 -

IEEE Transactions on Systems, Man, and C...

- Vol. 46, Iss: 3, pp 766-777

TLDR

This paper proposes a novel adversary-aware feature selection model that can improve classifier security against evasion attacks, by incorporating specific assumptions on the adversary's data manipulation strategy.

Abstract:

Pattern recognition and machine learning techniques have been increasingly adopted in adversarial settings such as spam, intrusion, and malware detection, although their security against well-crafted attacks that aim to evade detection by manipulating data at test time has not yet been thoroughly assessed. While previous work has been mainly focused on devising adversary-aware classification algorithms to counter evasion attempts, only few authors have considered the impact of using reduced feature sets on classifier security against the same attacks. An interesting, preliminary result is that classifier security to evasion may be even worsened by the application of feature selection. In this paper, we provide a more detailed investigation of this aspect, shedding some light on the security properties of feature selection against evasion attacks. Inspired by previous work on adversary-aware classifiers, we propose a novel adversary-aware feature selection model that can improve classifier security against evasion attacks, by incorporating specific assumptions on the adversary’s data manipulation strategy. We focus on an efficient, wrapper-based implementation of our approach, and experimentally validate its soundness on different application examples, including spam and malware detection.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks.

Weilin Xu, +2 more

Abstract: Although deep neural networks (DNNs) have achieved great success in many tasks, they can often be fooled by \emph{adversarial examples} that are generated by adding small but purposeful distortions to natural examples. Previous studies to defend against adversarial examples mostly focused on refining the DNN models, but have either shown limited success or required expensive computation. We propose a new strategy, \emph{feature squeezing}, that can be used to harden DNN models by detecting adversarial examples. Feature squeezing reduces the search space available to an adversary by coalescing samples that correspond to many different feature vectors in the original space into a single sample. By comparing a DNN model's prediction on the original input with that on squeezed inputs, feature squeezing detects adversarial examples with high accuracy and few false positives. This paper explores two feature squeezing methods: reducing the color bit depth of each pixel and spatial smoothing. These simple strategies are inexpensive and complementary to other defenses, and can be combined in a joint detection framework to achieve high detection rates against state-of-the-art attacks.

...read moreread less

Proceedings ArticleDOI

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Weilin Xu, +2 more

- 04 Apr 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Two feature squeezing methods are explored: reducing the color bit depth of each pixel and spatial smoothing, which are inexpensive and complementary to other defenses, and can be combined in a joint detection framework to achieve high detection rates against state-of-the-art attacks.

...read moreread less

Proceedings ArticleDOI

Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning

Battista Biggio, +1 more

TL;DR: A thorough overview of the evolution of this research area over the last ten years and beyond is provided, starting from pioneering, earlier work on the security of non-deep learning algorithms up to more recent work aimed to understand the security properties of deep learning algorithms, in the context of computer vision and cybersecurity tasks.

...read moreread less

Posted Content

MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks

Dan Li, +5 more

- 15 Jan 2019 -

arXiv: Learning

TL;DR: The proposed MAD-GAN framework considers the entire variable set concurrently to capture the latent interactions amongst the variables and is effective in reporting anomalies caused by various cyber-intrusions compared in these complex real-world systems.

...read moreread less

Proceedings Article

Is Feature Selection Secure against Training Data Poisoning

Huang Xiao, +5 more

TL;DR: In this article, the authors investigate the robustness of feature selection methods, including LASSO, ridge regression and elastic net, under attack and show that they can be significantly compromised under attack, highlighting the need for specific countermeasures.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

Journal ArticleDOI

An introduction to variable and feature selection

Isabelle Guyon, +1 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: The contributions of this special issue cover a wide range of aspects of variable selection: providing a better definition of the objective function, feature construction, feature ranking, multivariate feature selection, efficient search methods, and feature validity assessment methods.

...read moreread less

Journal ArticleDOI

Wrappers for feature subset selection

Ron Kohavi, +1 more

- 01 Dec 1997 -

Artificial Intelligence

TL;DR: The wrapper method searches for an optimal feature subset tailored to a particular algorithm and a domain and compares the wrapper approach to induction without feature subset selection and to Relief, a filter approach tofeature subset selection.

...read moreread less

Journal ArticleDOI

Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

Hanchuan Peng, +2 more

- 01 Aug 2005 -

IEEE Transactions on Pattern Analysis an...

TL;DR: In this article, the maximal statistical dependency criterion based on mutual information (mRMR) was proposed to select good features according to the maximal dependency condition. But the problem of feature selection is not solved by directly implementing mRMR.

...read moreread less

Journal ArticleDOI

Gene Selection for Cancer Classification using Support Vector Machines

Isabelle Guyon, +3 more

- 11 Mar 2002 -

Machine Learning

TL;DR: In this article, a Support Vector Machine (SVM) method based on recursive feature elimination (RFE) was proposed to select a small subset of genes from broad patterns of gene expression data, recorded on DNA micro-arrays.

...read moreread less

Collapse

Adversarial Feature Selection Against Evasion Attacks

Citations

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks.

Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning

MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks

Is Feature Selection Secure against Training Data Poisoning

References

The Nature of Statistical Learning Theory

An introduction to variable and feature selection

Wrappers for feature subset selection

Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

Gene Selection for Cancer Classification using Support Vector Machines

Related Papers (5)

Evasion attacks against machine learning at test time

Adversarial machine learning

The Limitations of Deep Learning in Adversarial Settings

Towards Evaluating the Robustness of Neural Networks

Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks