Towards Out-Of-Distribution Generalization: A Survey

Open AccessPosted Content

Towards Out-Of-Distribution Generalization: A Survey

Zheyan Shen, +6 more

- 31 Aug 2021 -

arXiv: Learning

Chats0

TLDR

A comprehensive survey of OOD generalization methods can be found at http://out-of-distributiongeneralization.com as mentioned in this paper, where the authors provide a formal definition of the OOD problem.

Abstract:

Classic machine learning methods are built on the $i.i.d.$ assumption that training and testing data are independent and identically distributed. However, in real scenarios, the $i.i.d.$ assumption can hardly be satisfied, rendering the sharp drop of classic machine learning algorithms' performances under distributional shifts, which indicates the significance of investigating the Out-of-Distribution generalization problem. Out-of-Distribution (OOD) generalization problem addresses the challenging setting where the testing distribution is unknown and different from the training. This paper serves as the first effort to systematically and comprehensively discuss the OOD generalization problem, from the definition, methodology, evaluation to the implications and future directions. Firstly, we provide the formal definition of the OOD generalization problem. Secondly, existing methods are categorized into three parts based on their positions in the whole learning pipeline, namely unsupervised representation learning, supervised model learning and optimization, and typical methods for each category are discussed in detail. We then demonstrate the theoretical connections of different categories, and introduce the commonly used datasets and evaluation metrics. Finally, we summarize the whole literature and raise some future directions for OOD generalization problem. The summary of OOD generalization methods reviewed in this survey can be found at http://out-of-distribution-generalization.com.

Towards Out-Of-Distribution Generalization: A Survey

Citations

OoD-Bench: Benchmarking and Understanding Out-of-Distribution Generalization Datasets and Algorithms

Deep Long-Tailed Learning: A Survey

Constructing benchmark test sets for biological sequence analysis using independent set algorithms

Confounder Identification-free Causal Visual Feature Learning

A benchmark with decomposed distribution shifts for 360 monocular depth estimation

References

ImageNet: A large-scale hierarchical image database

Gradient-based learning applied to document recognition

Regression Shrinkage and Selection via the Lasso

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Related Papers (5)

Modeling Generalization in Machine Learning: A Methodological and Computational Study

Distributional Generalization: A New Kind of Generalization

In Search of Lost Domain Generalization

Domain Generalization by Marginal Transfer Learning.

New theoretical frameworks for machine learning