scispace - formally typeset
Open AccessPosted Content

Invariant Risk Minimization

Reads0
Chats0
TLDR
This work introduces Invariant Risk Minimization, a learning paradigm to estimate invariant correlations across multiple training distributions and shows how the invariances learned by IRM relate to the causal structures governing the data and enable out-of-distribution generalization.
Abstract
We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the causal structures governing the data and enable out-of-distribution generalization.

read more

Citations
More filters
Proceedings ArticleDOI

Regularization Penalty Optimization for Addressing Data Quality Variance in OoD Algorithms

TL;DR: This paper theoretically reveals the relationship between training data quality and algorithm performance, and analyzes the optimal regularization scheme for Lipschitz regularized invariant risk minimization for Out-of-Distribution generalization.
Journal ArticleDOI

When Neural Networks Fail to Generalize? A Model Sensitivity Perspective

TL;DR: In this article , the authors propose a novel strategy of Spectral Adversarial Data Augmentation (SADA) to generate augmented images targeted at the highly sensitive frequencies, which can effectively suppress the sensitivity in the frequency space.
Journal ArticleDOI

Causal Interventional Training for Image Recognition

TL;DR: In this article , a structural causal model (SCM) is proposed for the key variables involved in dataset collection and recognition model: object, common sense, bias, context, and label prediction.
Journal ArticleDOI

Counterfactual Learning on Graphs: A Survey

TL;DR: Recently, counterfactual learning on graphs has shown promising results in alleviating the drawbacks of GNNs as mentioned in this paper , such as lacking interpretability, can inherit the bias of the training data and cannot model the casual relations.
Posted Content

Causal Analysis of Agent Behavior for AI Safety.

TL;DR: In this article, a methodology for investigating the causal mechanisms that drive the behavior of artificial agents is presented, and six use cases are covered, each addressing a typical question an analyst might ask about an agent, and each question cannot be addressed by pure observation alone, but instead requires conducting experiments with systematically chosen manipulations so as to generate the correct causal evidence.
References
More filters
Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

Statistical learning theory

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
MonographDOI

Causality: models, reasoning, and inference

TL;DR: The art and science of cause and effect have been studied in the social sciences for a long time as mentioned in this paper, see, e.g., the theory of inferred causation, causal diagrams and the identification of causal effects.
Journal ArticleDOI

Estimating causal effects of treatments in randomized and nonrandomized studies.

TL;DR: A discussion of matching, randomization, random sampling, and other methods of controlling extraneous variation is presented in this paper, where the objective is to specify the benefits of randomization in estimating causal effects of treatments.
Book

Introduction to Smooth Manifolds

TL;DR: In this paper, a review of topology, linear algebra, algebraic geometry, and differential equations is presented, along with an overview of the de Rham Theorem and its application in calculus.
Related Papers (5)