Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks

doi:10.1007/978-3-319-62416-7_19

Open AccessBook ChapterDOI

Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks

- pp 262-275

TLDR

In this article, the transferability of adversarial examples is verified across different DQN models, and a novel class of attacks based on this vulnerability is presented to enable policy manipulation and induction in the learning process of DQNs.

Abstract:

Deep learning classifiers are known to be inherently vulnerable to manipulation by intentionally perturbed inputs, named adversarial examples. In this work, we establish that reinforcement learning techniques based on Deep Q-Networks (DQNs) are also vulnerable to adversarial input perturbations, and verify the transferability of adversarial examples across different DQN models. Furthermore, we present a novel class of attacks based on this vulnerability that enable policy manipulation and induction in the learning process of DQNs. We propose an attack mechanism that exploits the transferability of adversarial examples to implement policy induction attacks on DQNs, and demonstrate its efficacy and impact through experimental study of a game-learning scenario.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning in Mobile and Wireless Networking: A Survey

Chaoyun Zhang, +2 more

- 13 Mar 2019 -

IEEE Communications Surveys and Tutorial...

TL;DR: This paper bridges the gap between deep learning and mobile and wireless networking research, by presenting a comprehensive survey of the crossovers between the two areas, and provides an encyclopedic review of mobile and Wireless networking research based on deep learning, which is categorize by different domains.

...read moreread less

Proceedings ArticleDOI

Audio Adversarial Examples: Targeted Attacks on Speech-to-Text

Nicholas Carlini, +1 more

TL;DR: A white-box iterative optimization-based attack to Mozilla's implementation DeepSpeech end-to-end has a 100% success rate, and the feasibility of this attack introduce a new domain to study adversarial examples.

...read moreread less

Posted Content

The Space of Transferable Adversarial Examples

Florian Tramèr, +4 more

- 11 Apr 2017 -

arXiv: Machine Learning

TL;DR: It is found that adversarial examples span a contiguous subspace of large (~25) dimensionality, which indicates that it may be possible to design defenses against transfer-based attacks, even for models that are vulnerable to direct attacks.

...read moreread less

Posted Content

Adversarially Robust Generalization Requires More Data

Ludwig Schmidt, +4 more

- 30 Apr 2018 -

arXiv: Learning

TL;DR: In this paper, the authors study adversarially robust learning from the viewpoint of generalization and show that the sample complexity of robust learning can be significantly larger than that of "standard" learning.

...read moreread less

Journal ArticleDOI

Adversarial Attacks and Defenses in Deep Learning

Kui Ren, +3 more

- 01 Mar 2020 -

Engineering

TL;DR: The theoretical foundations, algorithms, and applications of adversarial attack techniques are introduced and a few research efforts on the defense techniques are described, which cover the broad frontier in the field.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Martín Abadi, +39 more

- 01 Jan 2015 -

arXiv: Distributed, Parallel, and Cluste...

TL;DR: The TensorFlow interface and an implementation of that interface that is built at Google are described, which has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields.

...read moreread less

Proceedings Article

Intriguing properties of neural networks

Christian Szegedy, +7 more

TL;DR: It is found that there is no distinction between individual highlevel units and random linear combinations of high level units, according to various methods of unit analysis, and it is suggested that it is the space, rather than the individual units, that contains of the semantic information in the high layers of neural networks.

...read moreread less

Posted Content

Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, +6 more

- 19 Dec 2013 -

arXiv: Learning

TL;DR: This work presents the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning, which outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

...read moreread less

Proceedings Article

Explaining and Harnessing Adversarial Examples

Ian Goodfellow, +2 more

TL;DR: It is argued that the primary cause of neural networks' vulnerability to adversarial perturbation is their linear nature, supported by new quantitative results while giving the first explanation of the most intriguing fact about them: their generalization across architectures and training sets.

...read moreread less

Collapse

Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks

Citations

Deep Learning in Mobile and Wireless Networking: A Survey

Audio Adversarial Examples: Targeted Attacks on Speech-to-Text

The Space of Transferable Adversarial Examples

Adversarially Robust Generalization Requires More Data

Adversarial Attacks and Defenses in Deep Learning

References

Human-level control through deep reinforcement learning

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Intriguing properties of neural networks

Playing Atari with Deep Reinforcement Learning

Explaining and Harnessing Adversarial Examples

Related Papers (5)

Explaining and Harnessing Adversarial Examples

Intriguing properties of neural networks

Towards Evaluating the Robustness of Neural Networks

Human-level control through deep reinforcement learning

Asynchronous methods for deep reinforcement learning