Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

Open AccessProceedings Article

Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

Hyeonwoo Noh, +3 more

- Vol. 30, pp 5109-5118

Chats0

TLDR

This paper interprets that the conventional training methods with regularization by noise injection optimize the lower bound of the true objective and proposes a technique to achieve a tighter lower bound using multiple noise samples per training example in a stochastic gradient descent iteration.

Abstract:

Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance. Injecting noises to hidden units during training, e.g., dropout, is known as a successful regularizer, but it is still not clear enough why such training techniques work well in practice and how we can maximize their benefit in the presence of two conflicting objectives---optimizing to true data distribution and preventing overfitting by regularization. This paper addresses the above issues by 1) interpreting that the conventional training methods with regularization by noise injection optimize the lower bound of the true objective and 2) proposing a technique to achieve a tighter lower bound using multiple noise samples per training example in a stochastic gradient descent iteration. We demonstrate the effectiveness of our idea in several computer vision applications.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Towards Robust Neural Networks via Random Self-ensemble

Xuanqing Liu, +3 more

TL;DR: Random Self-Ensemble (RSE) as mentioned in this paper adds random noise layers to the neural network to prevent the strong gradient-based attacks, and ensembles the prediction over random noises to stabilize the performance.

...read moreread less

Posted Content

Towards Robust Neural Networks via Random Self-ensemble

Xuanqing Liu, +3 more

- 02 Dec 2017 -

arXiv: Learning

TL;DR: This paper proposes a new defense algorithm called Random Self-Ensemble (RSE), which adds random noise layers to the neural network to prevent the strong gradient-based attacks, and ensembles the prediction over random noises to stabilize the performance.

...read moreread less

Proceedings Article

Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels

Lu Jiang, +3 more

TL;DR: In this article, the authors established the first benchmark of controlled real-world label noise from the web, which enabled them to study the web label noise in a controlled setting for the first time, and they showed that their method achieves the best result on their dataset as well as on two public benchmarks (CIFAR and WebVision).

...read moreread less

Journal ArticleDOI

Big-Data Science in Porous Materials: Materials Genomics and Machine Learning

Kevin Maik Jablonka, +3 more

- 18 Jan 2020 -

arXiv: Materials Science

TL;DR: It is shown that having so many materials allows us to use big-data methods as a powerful technique to study these materials and to discover complex correlations.

...read moreread less

Proceedings Article

Supervised autoencoders: Improving generalization performance with unsupervised regularizers

Lei Le, +2 more

TL;DR: This work theoretically and empirically analyze and provides a novel generalization result for linear auto-encoders, proving uniform stability based on the inclusion of the reconstruction error in a neural network that predicts both inputs (reconstruction error) and targets jointly.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Pyramidal Residual Networks

Dongyoon Han, +2 more

TL;DR: This research gradually increases the feature map dimension at all units to involve as many locations as possible in the network architecture and proposes a novel residual unit capable of further improving the classification accuracy with the new network architecture.

...read moreread less

Proceedings Article

Dropout Training as Adaptive Regularization

Stefan Wager, +2 more

TL;DR: By casting dropout as regularization, this work develops a natural semi-supervised algorithm that uses unlabeled data to create a better adaptive regularizer and consistently boosts the performance of dropout training, improving on state-of-the-art results on the IMDB reviews dataset.

...read moreread less

Posted Content

Neural Module Networks

Jacob Andreas, +3 more

- 09 Nov 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The authors decomposes questions into their linguistic substructures, and uses these structures to dynamically instantiate modular networks (with reusable components for recognizing dogs, classifying colors, etc.) for visual question answering.

...read moreread less

Proceedings ArticleDOI

Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction

Hyeonwoo Noh, +2 more

TL;DR: In this paper, a joint network with the CNN for ImageQA and the parameter prediction network is proposed, which is trained end-to-end through back-propagation, where its weights are initialized using a pre-trained CNN and GRU.

...read moreread less

Proceedings Article

Adaptive dropout for training deep neural networks

Jimmy Ba, +1 more

TL;DR: A method is described called 'standout' in which a binary belief network is overlaid on a neural network and is used to regularize of its hidden units by selectively setting activities to zero, which achieves lower classification error rates than other feature learning methods, including standard dropout, denoising auto-encoders, and restricted Boltzmann machines.

...read moreread less

Collapse

Related Papers (5)

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

Citations

Towards Robust Neural Networks via Random Self-ensemble

Towards Robust Neural Networks via Random Self-ensemble

Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels

Big-Data Science in Porous Materials: Materials Genomics and Machine Learning

Supervised autoencoders: Improving generalization performance with unsupervised regularizers

References

Deep Pyramidal Residual Networks

Dropout Training as Adaptive Regularization

Neural Module Networks

Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction

Adaptive dropout for training deep neural networks

Related Papers (5)

Deep Residual Learning for Image Recognition

Dropout: a simple way to prevent neural networks from overfitting

Gradient-based learning applied to document recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks