Supervised Contrastive Learning.

Open AccessPosted Content

Supervised Contrastive Learning.

- 23 Apr 2020 -

TLDR

In this paper, the authors extend the self-supervised batch contrastive approach to the fully supervised setting, allowing them to effectively leverage label information and achieve state-of-the-art performance in unsupervised training of deep image models.

Abstract:

Contrastive learning applied to self-supervised representation learning has seen a resurgence in recent years, leading to state of the art performance in the unsupervised training of deep image models. Modern batch contrastive approaches subsume or significantly outperform traditional contrastive losses such as triplet, max-margin and the N-pairs loss. In this work, we extend the self-supervised batch contrastive approach to the fully-supervised setting, allowing us to effectively leverage label information. Clusters of points belonging to the same class are pulled together in embedding space, while simultaneously pushing apart clusters of samples from different classes. We analyze two possible versions of the supervised contrastive (SupCon) loss, identifying the best-performing formulation of the loss. On ResNet-200, we achieve top-1 accuracy of 81.4% on the ImageNet dataset, which is 0.8% above the best number reported for this architecture. We show consistent outperformance over cross-entropy on other datasets and two ResNet variants. The loss shows benefits for robustness to natural corruptions and is more stable to hyperparameter settings such as optimizers and data augmentations. Our loss function is simple to implement, and reference TensorFlow code is released at this https URL.

Citations

PDF

Open Access

More filters

Posted Content

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

Mathilde Caron, +5 more

- 17 Jun 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes an online algorithm, SwAV, that takes advantage of contrastive methods without requiring to compute pairwise comparisons, and uses a swapped prediction mechanism where it predicts the cluster assignment of a view from the representation of another view.

...read moreread less

Posted Content

A Survey on Contrastive Self-supervised Learning

Ashish Jaiswal, +4 more

- 31 Oct 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper provides an extensive review of self-supervised methods that follow the contrastive approach, explaining commonly used pretext tasks in a contrastive learning setup, followed by different architectures that have been proposed so far.

...read moreread less

Journal ArticleDOI

Computer Vision and Image Understanding

Sumeet Menon, +1 more

- 01 Jan 2022 -

Social Science Research Network

TL;DR: Zhang et al. as discussed by the authors introduced methods to differentiate posed expressions from spontaneous ones by capturing global spatial patterns embedded in posed and spontaneous expressions, and incorporating gender and expression categories as privileged information during spatial pattern modeling.

...read moreread less

Proceedings Article

Hard Negative Mixing for Contrastive Learning

Yannis Kalantidis, +4 more

TL;DR: It is argued that an important aspect of contrastive learning, i.e., the effect of hard negatives, has so far been neglected and proposed hard negative mixing strategies at the feature level, that can be computed on-the-fly with a minimal computational overhead.

...read moreread less

Proceedings ArticleDOI

Self-supervised Graph Learning for Recommendation

Jiancan Wu, +6 more

- 21 Oct 2020 -

arXiv: Information Retrieval

TL;DR: This work explores self-supervised learning on user-item graph, so as to improve the accuracy and robustness of GCNs for recommendation, and implements it on the state-of-the-art model LightGCN, which has the ability of automatically mining hard negatives.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Collapse

Supervised Contrastive Learning.

Citations

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

A Survey on Contrastive Self-supervised Learning

Computer Vision and Image Understanding

Hard Negative Mixing for Contrastive Learning

Self-supervised Graph Learning for Recommendation

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Related Papers (5)

A Simple Framework for Contrastive Learning of Visual Representations

Deep Residual Learning for Image Recognition

Representation Learning with Contrastive Predictive Coding

Momentum Contrast for Unsupervised Visual Representation Learning

Unsupervised Feature Learning via Non-parametric Instance Discrimination