Adversarial Training Methods for Semi-Supervised Text Classification.

Open AccessProceedings Article

Adversarial Training Methods for Semi-Supervised Text Classification.

TLDR

In this article, the authors extend adversarial and virtual adversarial training to text domain by applying perturbations to the word embeddings in a recurrent neural network rather than to the original input itself.

Abstract:

Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting. However, both methods require making small perturbations to numerous entries of the input vector, which is inappropriate for sparse high-dimensional inputs such as one-hot word representations. We extend adversarial and virtual adversarial training to the text domain by applying perturbations to the word embeddings in a recurrent neural network rather than to the original input itself. The proposed method achieves state of the art results on multiple benchmark semi-supervised and purely supervised tasks. We provide visualizations and analysis showing that the learned word embeddings have improved in quality and that while training, the model is less prone to overfitting.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Universal Language Model Fine-tuning for Text Classification

Jeremy Howard, +1 more

TL;DR: Universal Language Model Fine-tuning (ULMFiT) as mentioned in this paper is an effective transfer learning method that can be applied to any task in NLP, and introduces techniques that are key for finetuning a language model.

...read moreread less

Proceedings Article

Learned in translation: contextualized word vectors

Bryan McCann, +3 more

TL;DR: Adding context vectors to a deep LSTM encoder from an attentional sequence-to-sequence model trained for machine translation to contextualize word vectors improves performance over using only unsupervised word and character vectors on a wide variety of common NLP tasks.

...read moreread less

Proceedings ArticleDOI

Adversarial Personalized Ranking for Recommendation

Xiangnan He, +3 more

TL;DR: Adversarial Personalized Ranking (APR) as mentioned in this paper enhances the pairwise ranking method BPR by performing adversarial training, where the minimization of the BPR objective function meanwhile defends an adversary, which adds adversarial perturbations on model parameters to maximize the BPE objective function.

...read moreread less

Posted Content

Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning

Eric Arazo, +4 more

- 08 Aug 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work shows that a naive pseudo-labeling overfits to incorrect pseudo-labels due to the so-called confirmation bias and demonstrates that mixup augmentation and setting a minimum number of labeled samples per mini-batch are effective regularization techniques for reducing it.

...read moreread less

Proceedings ArticleDOI

Adversarial Training for Relation Extraction

Yi Wu, +2 more

TL;DR: Experimental results demonstrate that adversarial training is generally effective for both CNN and RNN models and significantly improves the precision of predicted relations.

...read moreread less

Collapse

arXiv: Computation and Language

Adversarial Training Methods for Semi-Supervised Text Classification.

Citations

Universal Language Model Fine-tuning for Text Classification

Learned in translation: contextualized word vectors

Adversarial Personalized Ranking for Recommendation

Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning

Adversarial Training for Relation Extraction

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention is All you Need

Adam: A Method for Stochastic Optimization

Glove: Global Vectors for Word Representation

RoBERTa: A Robustly Optimized BERT Pretraining Approach