Towards Crafting Text Adversarial Samples.

Open AccessPosted Content

Towards Crafting Text Adversarial Samples.

Suranjana Samanta, +1 more

- 10 Jul 2017 -

arXiv: Learning

Chats0

TLDR

This paper proposes a new method of crafting adversarial text samples by modification of the original samples, which works best for the datasets which have sub-categories within each of the classes of examples.

Abstract:

Adversarial samples are strategically modified samples, which are crafted with the purpose of fooling a classifier at hand. An attacker introduces specially crafted adversarial samples to a deployed classifier, which are being mis-classified by the classifier. However, the samples are perceived to be drawn from entirely different classes and thus it becomes hard to detect the adversarial samples. Most of the prior works have been focused on synthesizing adversarial samples in the image domain. In this paper, we propose a new method of crafting adversarial text samples by modification of the original samples. Modifications of the original text samples are done by deleting or replacing the important or salient words in the text or by introducing new words in the text sample. Our algorithm works best for the datasets which have sub-categories within each of the classes of examples. While crafting adversarial samples, one of the key constraint is to generate meaningful sentences which can at pass off as legitimate from language (English) viewpoint. Experimental results on IMDB movie review dataset for sentiment analysis and Twitter dataset for gender detection show the efficiency of our proposed method.

Citations

PDF

Open Access

More filters

Proceedings Article

Synthetic and Natural Noise Both Break Neural Machine Translation

Yonatan Belinkov, +1 more

TL;DR: It is found that a model based on a character convolutional neural network is able to simultaneously learn representations robust to multiple kinds of noise, including structure-invariant word representations and robust training on noisy texts.

...read moreread less

Proceedings ArticleDOI

Black-Box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers

Ji Gao, +3 more

TL;DR: DeepWordBug as mentioned in this paper generates small text perturbations in a black-box setting that force a deep-learning classifier to misclassify a text input by scoring strategies to find the most important words to modify.

...read moreread less

Proceedings ArticleDOI

Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency.

Shuhuai Ren, +3 more

TL;DR: A new word replacement order determined by both the wordsaliency and the classification probability is introduced, and a greedy algorithm called probability weighted word saliency (PWWS) is proposed for text adversarial attack.

...read moreread less

Journal ArticleDOI

Analysis Methods in Neural Language Processing: A Survey

Yonatan Belinkov, +1 more

- 01 Apr 2019 -

Transactions of the Association for Comp...

TL;DR: Analysis methods in neural language processing are reviewed, categorize them according to prominent research trends, highlight existing limitations, and point to potential directions for future work.

...read moreread less

Journal ArticleDOI

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Han Xu, +6 more

- 27 Mar 2020 -

International Journal of Automation and ...

TL;DR: A systematic and comprehensive overview of the main threats of attacks and the success of corresponding countermeasures against adversarial examples, for three most popular data types, including images, graphs and text is reviewed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Gradient-based learning applied to document recognition

Yann LeCun, +7 more

TL;DR: This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques.

...read moreread less

Posted Content

Explaining and Harnessing Adversarial Examples

Ian Goodfellow, +2 more

- 20 Dec 2014 -

arXiv: Machine Learning

TL;DR: The authors argue that the primary cause of neural networks' vulnerability to adversarial perturbation is their linear nature, which is supported by new quantitative results while giving the first explanation of the most intriguing fact about adversarial examples: their generalization across architectures and training sets.

...read moreread less

Proceedings Article

Learning Word Vectors for Sentiment Analysis

Andrew L. Maas, +5 more

TL;DR: This work presents a model that uses a mix of unsupervised and supervised techniques to learn word vectors capturing semantic term--document information as well as rich sentiment content, and finds it out-performs several previously introduced methods for sentiment classification.

...read moreread less

Towards Crafting Text Adversarial Samples.

Citations

Synthetic and Natural Noise Both Break Neural Machine Translation

Black-Box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers

Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency.

Analysis Methods in Neural Language Processing: A Survey

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

References

Gradient-based learning applied to document recognition

Efficient Estimation of Word Representations in Vector Space

Gradient-based learning applied to document recognition

Explaining and Harnessing Adversarial Examples

Learning Word Vectors for Sentiment Analysis

Related Papers (5)

Explaining and Harnessing Adversarial Examples

Intriguing properties of neural networks

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Glove: Global Vectors for Word Representation

Towards Evaluating the Robustness of Neural Networks