Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

Open AccessPosted Content

Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

Lukas Schott, +8 more

- 17 Jul 2021 -

arXiv: Learning

Chats0

TLDR

This paper test whether 17 unsupervised, weakly supervised, and fully supervised representation learning approaches correctly infer the generative factors of variation in simple datasets and observe that all of them struggle to learn the underlying mechanism regardless of supervision signal and architectural bias.

Abstract:

An important component for generalization in machine learning is to uncover underlying latent factors of variation as well as the mechanism through which each factor acts in the world. In this paper, we test whether 17 unsupervised, weakly supervised, and fully supervised representation learning approaches correctly infer the generative factors of variation in simple datasets (dSprites, Shapes3D, MPI3D). In contrast to prior robustness work that introduces novel factors of variation during test time, such as blur or other (un)structured noise, we here recompose, interpolate, or extrapolate only existing factors of variation from the training data set (e.g., small and medium-sized objects during training and large objects during testing). Models that learn the correct mechanism should be able to generalize to this benchmark. In total, we train and test 2000+ models and observe that all of them struggle to learn the underlying mechanism regardless of supervision signal and architectural bias. Moreover, the generalization capabilities of all tested models drop significantly as we move from artificial datasets towards more realistic real-world datasets. Despite their inability to identify the correct mechanism, the models are quite modular as their ability to infer other in-distribution factors remains fairly stable, providing only a single factor is out-of-distribution. These results point to an important yet understudied problem of learning mechanistic models of observations that can facilitate generalization.

Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

Citations

How we learn: why brains learn better than any machine … for now: by Stanislas Dehaene, New York, Viking, 2020, £20.15 (Hardback), £12.23 (Paperback), ISBN: 9780525559887

Assaying Out-Of-Distribution Generalization in Transfer Learning

Controlled Generation of Unseen Faults for Partial and OpenSet&Partial Domain Adaptation

Disentangling with Biological Constraints: A Theory of Functional Cell Types

Identifiability of deep generative models under mixture priors without auxiliary information

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Deep learning

The Nature of Statistical Learning Theory

Related Papers (5)

On the "steerability" of generative adversarial networks

Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds.

Learning to Disentangle Factors of Variation with Manifold Interaction

Challenges in Disentangling Independent Factors of Variation

Tensorize, Factorize and Regularize: Robust Visual Relationship Learning

Trending Questions (1)