Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Open AccessPosted Content

Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

- 09 Jul 2021 -

TLDR

In this article, the authors empirically show that out-of-distribution performance is strongly correlated with the performance of a wide range of models and distribution shifts and provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.

Abstract:

For machine learning systems to be reliable, we must understand their performance in unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution performance is strongly correlated with in-distribution performance for a wide range of models and distribution shifts. Specifically, we demonstrate strong correlations between in-distribution and out-of-distribution performance on variants of CIFAR-10 & ImageNet, a synthetic pose estimation task derived from YCB objects, satellite imagery classification in FMoW-WILDS, and wildlife classification in iWildCam-WILDS. The strong correlations hold across model architectures, hyperparameters, training set size, and training duration, and are more precise than what is expected from existing domain adaptation theory. To complete the picture, we also investigate cases where the correlation is weaker, for instance some synthetic distribution shifts from CIFAR-10-C and the tissue classification dataset Camelyon17-WILDS. Finally, we provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.

Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Citations

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

On a Benefit of Mask Language Modeling: Robustness to Simplicity Bias.

On the Robustness of Reading Comprehension Models to Entity Renaming

On the Robustness of Reading Comprehension Models to Entity Renaming.

References

Monocular Model-Based 3D Tracking of Rigid Objects: A Survey

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

Certified Adversarial Robustness via Randomized Smoothing

Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning

Related Papers (5)

Accuracy on the Line: on the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

Higher-Order Correlations and Cumulants

One size does not fit all: flexible models are required to understand animal movement across scales

Correlated topographic analysis: estimating an ordering of correlated components

Measuring the component overlapping in the Gaussian mixture model