Counterfactual reasoning and learning systems: the example of computational advertising

Open AccessJournal Article

Counterfactual reasoning and learning systems: the example of computational advertising

Léon Bottou, +8 more

- 01 Jan 2013 -

Journal of Machine Learning Research

- Vol. 14, Iss: 1, pp 3207-3260

Chats0

TLDR

This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system and allow both humans and algorithms to select the changes that would have improved the system performance.

Abstract:

This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system. Such predictions allow both humans and algorithms to select the changes that would have improved the system performance. This work is illustrated by experiments on the ad placement system associated with the Bing search engine.

Citations

PDF

Open Access

More filters

Posted Content

Concrete Problems in AI Safety

Dario Amodei, +5 more

- 21 Jun 2016 -

arXiv: Artificial Intelligence

TL;DR: A list of five practical research problems related to accident risk, categorized according to whether the problem originates from having the wrong objective function, an objective function that is too expensive to evaluate frequently, or undesirable behavior during the learning process, are presented.

...read moreread less

Posted Content

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Sergey Levine, +3 more

- 04 May 2020 -

arXiv: Learning

TL;DR: This tutorial article aims to provide the reader with the conceptual tools needed to get started on research on offline reinforcement learning algorithms: reinforcementlearning algorithms that utilize previously collected data, without additional online data collection.

...read moreread less

Proceedings Article

Hidden technical debt in Machine learning systems

D. Sculley, +9 more

TL;DR: It is found it is common to incur massive ongoing maintenance costs in real-world ML systems, and several ML-specific risk factors to account for in system design are explored.

...read moreread less

Journal ArticleDOI

Toward Causal Representation Learning

Bernhard Schölkopf, +6 more

TL;DR: The authors reviewed fundamental concepts of causal inference and related them to crucial open problems of machine learning, including transfer and generalization, thereby assaying how causality can contribute to modern machine learning research.

...read moreread less

Posted Content

WILDS: A Benchmark of in-the-Wild Distribution Shifts

Pang Wei Koh, +22 more

- 14 Dec 2020 -

arXiv: Learning

TL;DR: WILDS is presented, a benchmark of in-the-wild distribution shifts spanning diverse data modalities and applications, and is hoped to encourage the development of general-purpose methods that are anchored to real-world distribution shifts and that work well across different applications and problem settings.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

Efficient ranking in sponsored search

Sébastien Lahaie, +1 more

TL;DR: A sufficient condition under which applying an exponent--strictly less than one--to the quality score improves expected efficiency is provided, and this condition holds for a large class of distributions known as natural exponential families, and for the lognormal distribution.

...read moreread less

Posted Content

Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits

Miroslav Dudík, +3 more

- 16 Oct 2012 -

arXiv: Learning

TL;DR: This work presents and proves properties of a new offline policy evaluator for an exploration learning setting which is superior to previous evaluators and simultaneously and correctly incorporates techniques from importance weighting, doubly robust evaluation, and nonstationary policy evaluation approaches.

...read moreread less

Proceedings Article

Sample-efficient nonstationary policy evaluation for contextual bandits

Miroslav Dudík, +3 more

TL;DR: In this paper, a new offline policy evaluator for exploration learning is presented, which simultaneously and correctly incorporates techniques from importance weighting, doubly robust evaluation, and nonstationary policy evaluation approaches.

...read moreread less

Proceedings Article

The do -calculus revisited

Judea Pearl

TL;DR: This talk surveys the usefulness of the do-calculus in three additional areas: mediation analysis, transportability and metasynthesis and surveys these results with emphasis on the challenges posed by meta-synthesis.

...read moreread less

Collapse

Counterfactual reasoning and learning systems: the example of computational advertising

Citations

Concrete Problems in AI Safety

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Hidden technical debt in Machine learning systems

Toward Causal Representation Learning

WILDS: A Benchmark of in-the-Wild Distribution Shifts

References

Efficient ranking in sponsored search

Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits

Sample-efficient nonstationary policy evaluation for contextual bandits

The do -calculus revisited

Related Papers (5)

Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms

The self-normalized estimator for counterfactual learning

The central role of the propensity score in observational studies for causal effects

A contextual-bandit approach to personalized news article recommendation

Eligibility Traces for Off-Policy Policy Evaluation