Self-supervised Pretraining of Visual Features in the Wild

Open AccessPosted Content

Self-supervised Pretraining of Visual Features in the Wild

Priya Goyal, +10 more

- 02 Mar 2021 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

Recently, self-supervised learning methods like MoCo, SimCLR, BYOL and SwAV have reduced the gap with supervised methods as mentioned in this paper. But self-learning cannot learn from any random image and from any unbounded dataset.

Abstract:

Recently, self-supervised learning methods like MoCo, SimCLR, BYOL and SwAV have reduced the gap with supervised methods. These results have been achieved in a control environment, that is the highly curated ImageNet dataset. However, the premise of self-supervised learning is that it can learn from any random image and from any unbounded dataset. In this work, we explore if self-supervision lives to its expectation by training large models on random, uncurated images with no supervision. Our final SElf-supERvised (SEER) model, a RegNetY with 1.3B parameters trained on 1B random images with 512 GPUs achieves 84.2% top-1 accuracy, surpassing the best self-supervised pretrained model by 1% and confirming that self-supervised learning works in a real world setting. Interestingly, we also observe that self-supervised models are good few-shot learners achieving 77.9% top-1 with access to only 10% of ImageNet. Code: this https URL

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Self-supervised Learning: Generative or Contrastive.

Xiao Liu, +6 more

- 15 Jun 2020 -

arXiv: Learning

TL;DR: This survey takes a look into new self-supervised learning methods for representation in computer vision, natural language processing, and graph learning, and comprehensively review the existing empirical methods into three main categories according to their objectives.

...read moreread less

Posted Content

Emerging Properties in Self-Supervised Vision Transformers

Mathilde Caron, +7 more

- 29 Apr 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, self-supervised learning provides new properties to Vision Transformer (ViT) that stand out compared to convolutional networks (convnets) beyond the fact that adapting selfsupervised methods to this architecture works particularly well, they make the following observations: first, self-vised ViT features contain explicit information about the semantic segmentation of an image, which does not emerge as clearly with supervised ViTs, nor with convnets.

...read moreread less

Journal ArticleDOI

Artificial intelligence and machine learning for medical imaging: A technology review.

Ana M. Barragan-Montero, +13 more

- 01 Mar 2021 -

Physica Medica

TL;DR: Artificial intelligence (AI) has recently become a very popular buzzword, as a consequence of disruptive technical advances and impressive experimental results, notably in the field of image analysis and processing as discussed by the authors.

...read moreread less

Journal ArticleDOI

Review on self-supervised image recognition using deep neural networks

Kriti Ohri, +1 more

- 19 Jul 2021 -

Knowledge Based Systems

TL;DR: Self-supervised learning as discussed by the authors is a form of unsupervised deep learning that allows the network to learn rich visual features that help in performing downstream computer vision tasks such as image classification, object detection, and image segmentation.

...read moreread less

Posted ContentDOI

Self-Supervised Deep-Learning Encodes High-Resolution Features of Protein Subcellular Localization

Hirofumi Kobayashi, +3 more

- 29 Mar 2021 -

bioRxiv

TL;DR: In this article, a deep learning-based approach for fully self-supervised protein localization profiling and clustering is presented, which does not require pre-existing knowledge, categories, or annotations.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Colin Raffel, +8 more

- 23 Oct 2019 -

arXiv: Learning

TL;DR: This systematic study compares pre-training objectives, architectures, unlabeled datasets, transfer approaches, and other factors on dozens of language understanding tasks and achieves state-of-the-art results on many benchmarks covering summarization, question answering, text classification, and more.

...read moreread less

Proceedings ArticleDOI

Extracting and composing robust features with denoising autoencoders

Pascal Vincent, +3 more

TL;DR: This work introduces and motivate a new training principle for unsupervised learning of a representation based on the idea of making the learned representations robust to partial corruption of the input pattern.

...read moreread less

Posted Content

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Mingxing Tan, +1 more

- 28 May 2019 -

arXiv: Learning

TL;DR: A new scaling method is proposed that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient and is demonstrated the effectiveness of this method on scaling up MobileNets and ResNet.

...read moreread less

Posted Content

Representation Learning with Contrastive Predictive Coding

Aaron van den Oord, +2 more

- 10 Jul 2018 -

arXiv: Learning

TL;DR: This work proposes a universal unsupervised learning approach to extract useful representations from high-dimensional data, which it calls Contrastive Predictive Coding, and demonstrates that the approach is able to learn useful representations achieving strong performance on four distinct domains: speech, images, text and reinforcement learning in 3D environments.

...read moreread less

Proceedings ArticleDOI

Dimensionality Reduction by Learning an Invariant Mapping

Raia Hadsell, +2 more

TL;DR: This work presents a method - called Dimensionality Reduction by Learning an Invariant Mapping (DrLIM) - for learning a globally coherent nonlinear function that maps the data evenly to the output manifold.

...read moreread less

Collapse

arXiv: Learning

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Jean-Bastien Grill, +13 more

- 13 Jun 2020 -

arXiv: Learning

Self-supervised Pretraining of Visual Features in the Wild

Citations

Self-supervised Learning: Generative or Contrastive.

Emerging Properties in Self-Supervised Vision Transformers

Artificial intelligence and machine learning for medical imaging: A technology review.

Review on self-supervised image recognition using deep neural networks

Self-Supervised Deep-Learning Encodes High-Resolution Features of Protein Subcellular Localization

References

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Extracting and composing robust features with denoising autoencoders

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Representation Learning with Contrastive Predictive Coding

Dimensionality Reduction by Learning an Invariant Mapping

Related Papers (5)

Deep Residual Learning for Image Recognition

Momentum Contrast for Unsupervised Visual Representation Learning

ImageNet: A large-scale hierarchical image database

Representation Learning with Contrastive Predictive Coding

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning