Cross-modal Adversarial Reprogramming.

In this article, an efficient adversarial program that maps a sequence of discrete tokens into an image which can be classified to the desired class by an image classification model is proposed, achieving competitive performance on a variety of text and sequence classification benchmarks without retraining the network.

Abstract:

With the abundance of large-scale deep learning models, it has become possible to repurpose pre-trained networks for new tasks. Recent works on adversarial reprogramming have shown that it is possible to repurpose neural networks for alternate tasks without modifying the network architecture or parameters. However these works only consider original and target tasks within the same data domain. In this work, we broaden the scope of adversarial reprogramming beyond the data modality of the original task. We analyze the feasibility of adversarially repurposing image classification neural networks for Natural Language Processing (NLP) and other sequence classification tasks. We design an efficient adversarial program that maps a sequence of discrete tokens into an image which can be classified to the desired class by an image classification model. We demonstrate that by using highly efficient adversarial programs, we can reprogram image classifiers to achieve competitive performance on a variety of text and sequence classification benchmarks without retraining the network.

Citations

PDF

Open Access

More filters

Posted Content

Why Adversarial Reprogramming Works, When It Fails, and How to Tell the Difference

Yang Zheng,Xiaoyi Feng,Xia Zhaoqiang,Xiaoyue Jiang,Ambra Demontis,Maura Pintor,Battista Biggio,Fabio Roli +7 moreUniversity of Cagliari

- 26 Aug 2021 -

arXiv: Learning

Show Less

TL;DR: In this article, the authors developed a first-order linear model of adversarial reprogramming to show that its success inherently depends on the size of the average input gradient, which grows when input gradients are more aligned, and when inputs have higher dimensionality.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 moreMicrosoft

Show Less

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter,Jürgen Schmidhuber +1 moreTechnische Universität München,Dalle Molle Institute for Artificial Intelligence Research

- 01 Nov 1997 -

Neural Computation

Show Less

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng,Wei Dong,Richard Socher,Li-Jia Li,Kai Li,Li Fei-Fei +5 morePrinceton University

Show Less

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun,Léon Bottou,Léon Bottou,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio,Patrick Haffner +6 moreBell Labs,École Normale Supérieure,AT&T,Alcatel-Lucent,École Polytechnique de Montréal

Show Less

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma,Jimmy Ba +1 moreUniversity of Amsterdam,University of Toronto

- 22 Dec 2014 -

arXiv: Learning

Show Less

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less