VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

Open AccessPosted Content

VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

Adrien Bardes, +2 more

- 11 May 2021 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

VICReg as discussed by the authors combines the variance term with a decorrelation mechanism based on redundancy reduction and covariance regularization, and achieves results on par with the state of the art on several downstream tasks.

Abstract:

Recent self-supervised methods for image representation learning are based on maximizing the agreement between embedding vectors from different views of the same image. A trivial solution is obtained when the encoder outputs constant vectors. This collapse problem is often avoided through implicit biases in the learning architecture, that often lack a clear justification or interpretation. In this paper, we introduce VICReg (Variance-Invariance-Covariance Regularization), a method that explicitly avoids the collapse problem with a simple regularization term on the variance of the embeddings along each dimension individually. VICReg combines the variance term with a decorrelation mechanism based on redundancy reduction and covariance regularization, and achieves results on par with the state of the art on several downstream tasks. In addition, we show that incorporating our new variance term into other methods helps stabilize the training and leads to performance improvements.

Citations

PDF

Open Access

More filters

Posted Content

Decoupled Contrastive Learning

Chun-Hsiao Yeh, +5 more

- 13 Oct 2021 -

arXiv: Learning

TL;DR: In contrastive learning, the authors proposed a decoupled contrastive objective function for self-supervised learning (SSL), which considers two augmented views of the same image as positive and negative to be pushed further apart.

...read moreread less

Posted Content

3D Infomax improves GNNs for Molecular Property Prediction

Hannes Stärk, +6 more

- 08 Oct 2021 -

arXiv: Learning

TL;DR: In this article, the authors proposed pre-training a model to reason about the geometry of molecules given only their 2D molecular graphs by maximizing the mutual information between 3D summary vectors and the representations of a Graph Neural Network (GNN) such that they contain latent 3D information.

...read moreread less

Posted Content

Can contrastive learning avoid shortcut solutions

Joshua Robinson, +5 more

- 21 Jun 2021 -

arXiv: Learning

TL;DR: This paper proposed implicit feature modification (IFM), a method for altering positive and negative samples in order to guide contrastive models towards capturing a wider variety of predictive features, and as a result improved performance on vision and medical imaging tasks.

...read moreread less

Posted Content

An Empirical Study of Graph Contrastive Learning

Yanqiao Zhu, +3 more

- 02 Sep 2021 -

arXiv: Learning

TL;DR: In this paper, the authors identify several critical design considerations within a general GCL paradigm, including augmentation functions, contrasting modes, contrastive objectives, and negative mining techniques, and conduct extensive, controlled experiments over a set of benchmark tasks on datasets across various domains.

...read moreread less

Posted Content

AAVAE: Augmentation-Augmented Variational Autoencoders.

William Falcon, +3 more

- 26 Jul 2021 -

arXiv: Learning

TL;DR: In this article, the authors introduce augmentation-augmented variational autoencoders (AAVAE), a third approach to self-supervised learning based on autoencoding, which replaces the KL divergence regularization with data augmentations that explicitly encourage the internal representations to encode domain-specific invariances and equivariances.

...read moreread less

References

PDF

Open Access

More filters

Proceedings Article

Signature Verification using a "Siamese" Time Delay Neural Network

Jane Bromley, +4 more

TL;DR: An algorithm for verification of signatures written on a pen-input tablet based on a novel, artificial neural network called a "Siamese" neural network, which consists of two identical sub-networks joined at their outputs.

...read moreread less

Proceedings Article

Learning Deep Features for Scene Recognition using Places Database

Bolei Zhou, +4 more

TL;DR: A new scene-centric database called Places with over 7 million labeled pictures of scenes is introduced with new methods to compare the density and diversity of image datasets and it is shown that Places is as dense as other scene datasets and has more diversity.

...read moreread less

Posted Content

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Priya Goyal, +8 more

- 08 Jun 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper empirically show that on the ImageNet dataset large minibatches cause optimization difficulties, but when these are addressed the trained networks exhibit good generalization and enable training visual recognition models on internet-scale data with high efficiency.

...read moreread less

Proceedings Article

Sinkhorn Distances: Lightspeed Computation of Optimal Transport

Marco Cuturi

TL;DR: This work smooths the classic optimal transport problem with an entropic regularization term, and shows that the resulting optimum is also a distance which can be computed through Sinkhorn's matrix scaling algorithm at a speed that is several orders of magnitude faster than that of transport solvers.

...read moreread less

Proceedings ArticleDOI

Unsupervised Feature Learning via Non-parametric Instance Discrimination

Zhirong Wu, +3 more

TL;DR: This work forms this intuition as a non-parametric classification problem at the instance-level, and uses noise-contrastive estimation to tackle the computational challenges imposed by the large number of instance classes.

...read moreread less

Collapse

VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

Citations

Decoupled Contrastive Learning

3D Infomax improves GNNs for Molecular Property Prediction

Can contrastive learning avoid shortcut solutions

An Empirical Study of Graph Contrastive Learning

AAVAE: Augmentation-Augmented Variational Autoencoders.

References

Signature Verification using a "Siamese" Time Delay Neural Network

Learning Deep Features for Scene Recognition using Places Database

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Sinkhorn Distances: Lightspeed Computation of Optimal Transport

Unsupervised Feature Learning via Non-parametric Instance Discrimination

Related Papers (5)

Representation Learning with Contrastive Predictive Coding

Momentum Contrast for Unsupervised Visual Representation Learning

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

Improved Baselines with Momentum Contrastive Learning