Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Open AccessPosted Content

Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

John J. Miller, +8 more

- 09 Jul 2021 -

arXiv: Learning

Chats0

TLDR

In this article, the authors empirically show that out-of-distribution performance is strongly correlated with the performance of a wide range of models and distribution shifts and provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.

Abstract:

For machine learning systems to be reliable, we must understand their performance in unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution performance is strongly correlated with in-distribution performance for a wide range of models and distribution shifts. Specifically, we demonstrate strong correlations between in-distribution and out-of-distribution performance on variants of CIFAR-10 & ImageNet, a synthetic pose estimation task derived from YCB objects, satellite imagery classification in FMoW-WILDS, and wildlife classification in iWildCam-WILDS. The strong correlations hold across model architectures, hyperparameters, training set size, and training duration, and are more precise than what is expected from existing domain adaptation theory. To complete the picture, we also investigate cases where the correlation is weaker, for instance some synthetic distribution shifts from CIFAR-10-C and the tissue classification dataset Camelyon17-WILDS. Finally, we provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.

Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Citations

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

On a Benefit of Mask Language Modeling: Robustness to Simplicity Bias.

On the Robustness of Reading Comprehension Models to Entity Renaming

On the Robustness of Reading Comprehension Models to Entity Renaming.

References

Do Image Classifiers Generalize Across Time

Learning Transferable Visual Models From Natural Language Supervision

In Search of Lost Domain Generalization

Cold Case: The Lost MNIST Digits

WILDS: A Benchmark of in-the-Wild Distribution Shifts

Related Papers (5)

Accuracy on the Line: on the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Higher-Order Correlations and Cumulants

One size does not fit all: flexible models are required to understand animal movement across scales

Correlated topographic analysis: estimating an ordering of correlated components

Measuring the component overlapping in the Gaussian mixture model