scispace - formally typeset
Open AccessProceedings Article

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

TLDR
Li et al. as mentioned in this paper proposed a relational self-supervised learning (ReSSL) framework, which employs sharpened distribution of pairwise similarities among different instances as \textit{relation} metric, which is thus utilized to match the feature embeddings of different augmentations.
Abstract
Self-supervised Learning (SSL) including the mainstream contrastive learning has achieved great success in learning visual representations without data annotations. However, most of methods mainly focus on the instance level information (\ie, the different augmented images of the same instance should have the same feature or cluster into the same class), but there is a lack of attention on the relationships between different instances. In this paper, we introduced a novel SSL paradigm, which we term as relational self-supervised learning (ReSSL) framework that learns representations by modeling the relationship between different instances. Specifically, our proposed method employs sharpened distribution of pairwise similarities among different instances as \textit{relation} metric, which is thus utilized to match the feature embeddings of different augmentations. Moreover, to boost the performance, we argue that weak augmentations matter to represent a more reliable relation, and leverage momentum strategy for practical efficiency. Experimental results show that our proposed ReSSL significantly outperforms the previous state-of-the-art algorithms in terms of both performance and training efficiency. Code is available at \url{this https URL}.

read more

Citations
More filters
Posted Content

Solo-learn: A Library of Self-supervised Methods for Visual Representation Learning

TL;DR: Solo-learn as discussed by the authors is a library of self-supervised methods for visual representation learning implemented in Python, using pytorch and Pytorch lightning, and it fits both research and industry needs by featuring distributed training pipelines with mixed-precision, faster data loading via Nvidia DALI.
Related Papers (5)