Counterfactual reasoning and learning systems: the example of computational advertising

Open AccessJournal Article

Counterfactual reasoning and learning systems: the example of computational advertising

Léon Bottou, +8 more

- 01 Jan 2013 -

Journal of Machine Learning Research

- Vol. 14, Iss: 1, pp 3207-3260

Chats0

TLDR

This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system and allow both humans and algorithms to select the changes that would have improved the system performance.

Abstract:

This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system. Such predictions allow both humans and algorithms to select the changes that would have improved the system performance. This work is illustrated by experiments on the ad placement system associated with the Bing search engine.

Citations

PDF

Open Access

More filters

Posted Content

Concrete Problems in AI Safety

Dario Amodei, +5 more

- 21 Jun 2016 -

arXiv: Artificial Intelligence

TL;DR: A list of five practical research problems related to accident risk, categorized according to whether the problem originates from having the wrong objective function, an objective function that is too expensive to evaluate frequently, or undesirable behavior during the learning process, are presented.

...read moreread less

Posted Content

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Sergey Levine, +3 more

- 04 May 2020 -

arXiv: Learning

TL;DR: This tutorial article aims to provide the reader with the conceptual tools needed to get started on research on offline reinforcement learning algorithms: reinforcementlearning algorithms that utilize previously collected data, without additional online data collection.

...read moreread less

Proceedings Article

Hidden technical debt in Machine learning systems

D. Sculley, +9 more

TL;DR: It is found it is common to incur massive ongoing maintenance costs in real-world ML systems, and several ML-specific risk factors to account for in system design are explored.

...read moreread less

Journal ArticleDOI

Toward Causal Representation Learning

Bernhard Schölkopf, +6 more

TL;DR: The authors reviewed fundamental concepts of causal inference and related them to crucial open problems of machine learning, including transfer and generalization, thereby assaying how causality can contribute to modern machine learning research.

...read moreread less

Posted Content

WILDS: A Benchmark of in-the-Wild Distribution Shifts

Pang Wei Koh, +22 more

- 14 Dec 2020 -

arXiv: Learning

TL;DR: WILDS is presented, a benchmark of in-the-wild distribution shifts spanning diverse data modalities and applications, and is hoped to encourage the development of general-purpose methods that are anchored to real-world distribution shifts and that work well across different applications and problem settings.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms

Lihong Li, +3 more

TL;DR: In this paper, the authors introduce a replay methodology for contextual bandit algorithm evaluation, which is completely data-driven and very easy to adapt to different applications, and provide provably unbiased evaluations.

...read moreread less

Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms

Lihong Li, +3 more

TL;DR: This paper introduces a replay methodology for contextual bandit algorithm evaluation that is completely data-driven and very easy to adapt to different applications and can provide provably unbiased evaluations.

...read moreread less

Journal Article

Contextual bandits with similarity information

Aleksandrs Slivkins

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: In this paper, the authors consider similarity information in the setting of contextual bandits, a natural extension of the basic MAB problem where before each round an algorithm is given the context--a hint about the payoffs in this round.

...read moreread less

Book

Cybernetics, Second Edition: or the Control and Communication in the Animal and the Machine

Norbert Wiener

Proceedings ArticleDOI

Overlapping experiment infrastructure: more, better, faster experimentation

Diane Tang, +3 more

TL;DR: Google's overlapping experiment infrastructure is described, and the associated tools and educational processes required to use it effectively are discussed, which can be generalized and applied by any entity interested in using experimentation to improve search engines and other web applications.

...read moreread less

Collapse

Counterfactual reasoning and learning systems: the example of computational advertising

Citations

Concrete Problems in AI Safety

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Hidden technical debt in Machine learning systems

Toward Causal Representation Learning

WILDS: A Benchmark of in-the-Wild Distribution Shifts

References

Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms

Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms

Contextual bandits with similarity information

Cybernetics, Second Edition: or the Control and Communication in the Animal and the Machine

Overlapping experiment infrastructure: more, better, faster experimentation

Related Papers (5)

Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms

The self-normalized estimator for counterfactual learning

The central role of the propensity score in observational studies for causal effects

A contextual-bandit approach to personalized news article recommendation

Eligibility Traces for Off-Policy Policy Evaluation