Evasion attacks against machine learning at test time

doi:10.1007/978-3-642-40994-3_25

Open AccessBook ChapterDOI

Evasion attacks against machine learning at test time

Battista Biggio, +7 more

- Vol. 8190, pp 387-402

Chats0

TLDR

This work presents a simple but effective gradient-based approach that can be exploited to systematically assess the security of several, widely-used classification algorithms against evasion attacks.

Abstract:

In security-sensitive applications, the success of machine learning depends on a thorough vetting of their resistance to adversarial data. In one pertinent, well-motivated attack scenario, an adversary may attempt to evade a deployed system at test time by carefully manipulating attack samples. In this work, we present a simple but effective gradient-based approach that can be exploited to systematically assess the security of several, widely-used classification algorithms against evasion attacks. Following a recently proposed framework for security evaluation, we simulate attack scenarios that exhibit different risk levels for the classifier by increasing the attacker's knowledge of the system and her ability to manipulate attack samples. This gives the classifier designer a better picture of the classifier performance under evasion attacks, and allows him to perform a more informed model selection (or parameter setting). We evaluate our approach on the relevant security task of malware detection in PDF files, and show that such systems can be easily evaded. We also sketch some countermeasures suggested by our analysis.

Citations

PDF

Open Access

More filters

Posted Content

Towards Deep Learning Models Resistant to Adversarial Attacks

Aleksander Madry, +4 more

- 19 Jun 2017 -

arXiv: Machine Learning

TL;DR: This work studies the adversarial robustness of neural networks through the lens of robust optimization, and suggests the notion of security against a first-order adversary as a natural and broad security guarantee.

...read moreread less

Proceedings ArticleDOI

The Limitations of Deep Learning in Adversarial Settings

Nicolas Papernot, +5 more

TL;DR: This work formalizes the space of adversaries against deep neural networks (DNNs) and introduces a novel class of algorithms to craft adversarial samples based on a precise understanding of the mapping between inputs and outputs of DNNs.

...read moreread less

Journal ArticleDOI

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Alejandro Barredo Arrieta, +13 more

- 01 Jun 2020 -

Information Fusion

TL;DR: In this paper, a taxonomy of recent contributions related to explainability of different machine learning models, including those aimed at explaining Deep Learning methods, is presented, and a second dedicated taxonomy is built and examined in detail.

...read moreread less

Proceedings ArticleDOI

Practical Black-Box Attacks against Machine Learning

Nicolas Papernot, +5 more

TL;DR: This work introduces the first practical demonstration of an attacker controlling a remotely hosted DNN with no such knowledge, and finds that this black-box attack strategy is capable of evading defense strategies previously found to make adversarial example crafting harder.

...read moreread less

Proceedings ArticleDOI

Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks

Nicolas Papernot, +4 more

TL;DR: In this article, the authors introduce a defensive mechanism called defensive distillation to reduce the effectiveness of adversarial samples on DNNs, which increases the average minimum number of features that need to be modified to create adversarial examples by about 800%.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods

John Platt

Proceedings ArticleDOI

Adversarial machine learning

Ling Huang, +4 more

TL;DR: In this article, the authors discuss an emerging field of study: adversarial machine learning (AML), the study of effective machine learning techniques against an adversarial opponent, and give a taxonomy for classifying attacks against online machine learning algorithms.

...read moreread less

Proceedings ArticleDOI

Adversarial classification

Nilesh Dalvi, +3 more

TL;DR: This paper views classification as a game between the classifier and the adversary, and produces a classifier that is optimal given the adversary's optimal strategy, and experiments show that this approach can greatly outperform a classifiers learned in the standard way.

...read moreread less

Proceedings ArticleDOI

Can machine learning be secure

Marco Barreno, +4 more

TL;DR: A taxonomy of different types of attacks on machine learning techniques and systems, a variety of defenses against those attacks, and an analytical model giving a lower bound on attacker's work function are provided.

...read moreread less

Proceedings ArticleDOI

Adversarial learning

Daniel Lowd, +1 more

TL;DR: This paper introduces the adversarial classifier reverse engineering (ACRE) learning problem, the task of learning sufficient information about a classifier to construct adversarial attacks, and presents efficient algorithms for reverse engineering linear classifiers with either continuous or Boolean features.

...read moreread less

Evasion attacks against machine learning at test time

Citations

Towards Deep Learning Models Resistant to Adversarial Attacks

The Limitations of Deep Learning in Adversarial Settings

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Practical Black-Box Attacks against Machine Learning

Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks

References

Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods

Adversarial machine learning

Adversarial classification

Can machine learning be secure

Adversarial learning

Related Papers (5)

Intriguing properties of neural networks

Explaining and Harnessing Adversarial Examples

Towards Evaluating the Robustness of Neural Networks

Towards Deep Learning Models Resistant to Adversarial Attacks.

The Limitations of Deep Learning in Adversarial Settings