VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

Open AccessPosted Content

VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

Adrien Bardes, +2 more

- 11 May 2021 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

VICReg as discussed by the authors combines the variance term with a decorrelation mechanism based on redundancy reduction and covariance regularization, and achieves results on par with the state of the art on several downstream tasks.

Abstract:

Recent self-supervised methods for image representation learning are based on maximizing the agreement between embedding vectors from different views of the same image. A trivial solution is obtained when the encoder outputs constant vectors. This collapse problem is often avoided through implicit biases in the learning architecture, that often lack a clear justification or interpretation. In this paper, we introduce VICReg (Variance-Invariance-Covariance Regularization), a method that explicitly avoids the collapse problem with a simple regularization term on the variance of the embeddings along each dimension individually. VICReg combines the variance term with a decorrelation mechanism based on redundancy reduction and covariance regularization, and achieves results on par with the state of the art on several downstream tasks. In addition, we show that incorporating our new variance term into other methods helps stabilize the training and leads to performance improvements.

Citations

PDF

Open Access

More filters

Posted Content

Decoupled Contrastive Learning

Chun-Hsiao Yeh, +5 more

- 13 Oct 2021 -

arXiv: Learning

TL;DR: In contrastive learning, the authors proposed a decoupled contrastive objective function for self-supervised learning (SSL), which considers two augmented views of the same image as positive and negative to be pushed further apart.

...read moreread less

Posted Content

3D Infomax improves GNNs for Molecular Property Prediction

Hannes Stärk, +6 more

- 08 Oct 2021 -

arXiv: Learning

TL;DR: In this article, the authors proposed pre-training a model to reason about the geometry of molecules given only their 2D molecular graphs by maximizing the mutual information between 3D summary vectors and the representations of a Graph Neural Network (GNN) such that they contain latent 3D information.

...read moreread less

Posted Content

Can contrastive learning avoid shortcut solutions

Joshua Robinson, +5 more

- 21 Jun 2021 -

arXiv: Learning

TL;DR: This paper proposed implicit feature modification (IFM), a method for altering positive and negative samples in order to guide contrastive models towards capturing a wider variety of predictive features, and as a result improved performance on vision and medical imaging tasks.

...read moreread less

Posted Content

An Empirical Study of Graph Contrastive Learning

Yanqiao Zhu, +3 more

- 02 Sep 2021 -

arXiv: Learning

TL;DR: In this paper, the authors identify several critical design considerations within a general GCL paradigm, including augmentation functions, contrasting modes, contrastive objectives, and negative mining techniques, and conduct extensive, controlled experiments over a set of benchmark tasks on datasets across various domains.

...read moreread less

Posted Content

AAVAE: Augmentation-Augmented Variational Autoencoders.

William Falcon, +3 more

- 26 Jul 2021 -

arXiv: Learning

TL;DR: In this article, the authors introduce augmentation-augmented variational autoencoders (AAVAE), a third approach to self-supervised learning based on autoencoding, which replaces the KL divergence regularization with data augmentations that explicitly encourage the internal representations to encode domain-specific invariances and equivariances.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ClusterFit: Improving Generalization of Visual Representations

Xueting Yan, +4 more

TL;DR: ClusterFit as mentioned in this paper uses k-means clustering to improve the robustness of the visual representations learned during pre-training by re-training a pre-trained network on a new dataset using cluster assignments as pseudo-labels.

...read moreread less

Proceedings Article

Prototypical Contrastive Learning of Unsupervised Representations

Junnan Li, +3 more

TL;DR: Prototypical Contrastive Learning (PCL) as mentioned in this paper introduces prototypes as latent variables to help find the maximum-likelihood estimation of the network parameters in an Expectation-Maximization framework.

...read moreread less

Proceedings Article

Unsupervised Deep Learning by Neighbourhood Discovery

Jiabo Huang, +3 more

TL;DR: In this article, the authors introduce a generic unsupervised deep learning approach to training deep models without the need for any manual label supervision, which progressively discover sample anchored/centred neighbourhoods to reason and learn the underlying class decision boundaries iteratively and accumulatively.

...read moreread less

Proceedings ArticleDOI

Learning Representations by Predicting Bags of Visual Words

Spyros Gidaris, +4 more

TL;DR: In this article, a self-supervised approach based on spatially dense image descriptions that encode discrete visual concepts, here called visual words, is proposed to learn perturbation-invariant and context-aware image features.

...read moreread less

Posted Content

Understanding self-supervised Learning Dynamics without Contrastive Pairs

Yuandong Tian, +2 more

- 12 Feb 2021 -

arXiv: Learning

TL;DR: DirectPred as mentioned in this paper sets the linear predictor based on the statistics of its inputs, without gradient training, and outperforms a linear predictor by $2.5\%$ in 300-epoch training.

...read moreread less

Collapse

VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

Citations

Decoupled Contrastive Learning

3D Infomax improves GNNs for Molecular Property Prediction

Can contrastive learning avoid shortcut solutions

An Empirical Study of Graph Contrastive Learning

AAVAE: Augmentation-Augmented Variational Autoencoders.

References

ClusterFit: Improving Generalization of Visual Representations

Prototypical Contrastive Learning of Unsupervised Representations

Unsupervised Deep Learning by Neighbourhood Discovery

Learning Representations by Predicting Bags of Visual Words

Understanding self-supervised Learning Dynamics without Contrastive Pairs

Related Papers (5)

Representation Learning with Contrastive Predictive Coding

Momentum Contrast for Unsupervised Visual Representation Learning

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

Improved Baselines with Momentum Contrastive Learning