Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

doi:10.1109/TPAMI.2018.2858821

Open AccessJournal ArticleDOI

Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

Takeru Miyato, +3 more

- 01 Aug 2019 -

IEEE Transactions on Pattern Analysis an...

- Vol. 41, Iss: 8, pp 1979-1993

Chats0

TLDR

Virtual adversarial training (VAT) as discussed by the authors is a regularization method based on virtual adversarial loss, which is a measure of local smoothness of the conditional label distribution given input.

Abstract:

We propose a new regularization method based on virtual adversarial loss: a new measure of local smoothness of the conditional label distribution given input. Virtual adversarial loss is defined as the robustness of the conditional label distribution around each input data point against local perturbation. Unlike adversarial training, our method defines the adversarial direction without label information and is hence applicable to semi-supervised learning. Because the directions in which we smooth the model are only “virtually” adversarial, we call our method virtual adversarial training (VAT). The computational cost of VAT is relatively low. For neural networks, the approximated gradient of virtual adversarial loss can be computed with no more than two pairs of forward- and back-propagations. In our experiments, we applied VAT to supervised and semi-supervised learning tasks on multiple benchmark datasets. With a simple enhancement of the algorithm based on the entropy minimization principle, our VAT achieves state-of-the-art performance for semi-supervised learning tasks on SVHN and CIFAR-10.

Citations

PDF

Open Access

More filters

Proceedings Article

A Variational Approach for Learning from Positive and Unlabeled Data

Hui Chen, +4 more

TL;DR: In this paper, a variational principle for PU learning is introduced, which allows quantitatively evaluating the modeling error of the Bayesian classifier directly from given data and leads to a loss function which can then be employed to optimize the classifier under general conditions.

...read moreread less

Journal ArticleDOI

CMW-Net: Learning a Class-Aware Sample Weighting Mapping for Robust Deep Learning

J. Shu, +3 more

- 11 Feb 2022 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A meta-model capable of adaptively learning an explicit weighting scheme directly from data is proposed, by seeing each training class as a separate learning task, expecting to impose adaptively varying weighting schemes to different sample classes based on their own intrinsic bias characteristics.

...read moreread less

Journal ArticleDOI

Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy.

Lars Schmarje, +5 more

- 07 Oct 2021 -

Sensors

TL;DR: In this paper, the authors proposed a novel framework for handling semi-supervised classifications of such fuzzy labels, which is based on the idea of overclustering to detect substructures in these fuzzy labels.

...read moreread less

Posted Content

Virtual Adversarial Ladder Networks For Semi-supervised Learning

Saki Shinoda, +2 more

- 20 Nov 2017 -

arXiv: Learning

TL;DR: In this article, the authors propose a class of models that fuse ladder networks and virtual adversarial training (VAT) to achieve near-supervised accuracy with high consistency on the MNIST dataset using just 5 labels per class.

...read moreread less

Posted Content

Blending-target Domain Adaptation by Adversarial Meta-Adaptation Networks.

Ziliang Chen, +3 more

- 08 Jul 2019 -

arXiv: Learning

TL;DR: In this article, the authors proposed an adversarial meta-adaptation network (AMEAN) to overcome the intra-target category misalignment in a more realistic transfer scenario, where the target domain is comprised of multiple sub-targets implicitly blended with each other.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Proceedings ArticleDOI

Densely Connected Convolutional Networks

Gao Huang, +3 more

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.

...read moreread less

Collapse

Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

Citations

A Variational Approach for Learning from Positive and Unlabeled Data

CMW-Net: Learning a Class-Aware Sample Weighting Mapping for Robust Deep Learning

Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy.

Virtual Adversarial Ladder Networks For Semi-supervised Learning

Blending-target Domain Adaptation by Adversarial Meta-Adaptation Networks.

References

Adam: A Method for Stochastic Optimization

Generative Adversarial Nets

Dropout: a simple way to prevent neural networks from overfitting

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Densely Connected Convolutional Networks

Related Papers (5)

Deep Residual Learning for Image Recognition

Learning Multiple Layers of Features from Tiny Images

Explaining and Harnessing Adversarial Examples

Generative Adversarial Nets

ImageNet: A large-scale hierarchical image database