Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Open AccessPosted Content

Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

- 09 Jul 2021 -

TLDR

In this article, the authors empirically show that out-of-distribution performance is strongly correlated with the performance of a wide range of models and distribution shifts and provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.

Abstract:

For machine learning systems to be reliable, we must understand their performance in unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution performance is strongly correlated with in-distribution performance for a wide range of models and distribution shifts. Specifically, we demonstrate strong correlations between in-distribution and out-of-distribution performance on variants of CIFAR-10 & ImageNet, a synthetic pose estimation task derived from YCB objects, satellite imagery classification in FMoW-WILDS, and wildlife classification in iWildCam-WILDS. The strong correlations hold across model architectures, hyperparameters, training set size, and training duration, and are more precise than what is expected from existing domain adaptation theory. To complete the picture, we also investigate cases where the correlation is weaker, for instance some synthetic distribution shifts from CIFAR-10-C and the tissue classification dataset Camelyon17-WILDS. Finally, we provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.

Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Citations

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

On a Benefit of Mask Language Modeling: Robustness to Simplicity Bias.

On the Robustness of Reading Comprehension Models to Entity Renaming

On the Robustness of Reading Comprehension Models to Entity Renaming.

References

A Survey on Transfer Learning

Rethinking the Inception Architecture for Computer Vision

Learning Multiple Layers of Features from Tiny Images

Squeeze-and-Excitation Networks

Xception: Deep Learning with Depthwise Separable Convolutions

Related Papers (5)

Accuracy on the Line: on the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Higher-Order Correlations and Cumulants

One size does not fit all: flexible models are required to understand animal movement across scales

Correlated topographic analysis: estimating an ordering of correlated components

Measuring the component overlapping in the Gaussian mixture model