Open AccessJournal Article
Counterfactual reasoning and learning systems: the example of computational advertising
Léon Bottou,Jonas Peters,Joaquin Quiñonero-Candela,Denis X. Charles,D. Max Chickering,Elon Portugaly,Dipankar Ray,Patrice Y. Simard,Ed Snelson +8 more
Reads0
Chats0
TLDR
This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system and allow both humans and algorithms to select the changes that would have improved the system performance.Abstract:
This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system. Such predictions allow both humans and algorithms to select the changes that would have improved the system performance. This work is illustrated by experiments on the ad placement system associated with the Bing search engine.read more
Citations
More filters
Posted Content
Fair Decisions Despite Imperfect Predictions
TL;DR: In this article, the authors propose to directly learn decision policies that maximize utility under fairness constraints and take into account how decisions affect which data is observed in the future, thus, the observed data distribution depends on how decisions are being made.
Proceedings Article
Challenges in the evaluation of conversational search systems
Gustavo Penha,Claudia Hauff +1 more
TL;DR: It is argued that the currently in-use evaluation schemes have critical limitations and simplify the conversational search tasks to a degree that makes it questionable whether the authors can trust the findings they deliver.
Posted Content
Counterfactual Learning from Human Proofreading Feedback for Semantic Parsing.
Carolin Lawrence,Stefan Riezler +1 more
TL;DR: This work is the first to show that semantic parsers can be improved significantly by counterfactual learning from logged human feedback data, and introduces new estimators which can effectively leverage the given feedback.
Proceedings ArticleDOI
Offline Evaluation and Optimization for Interactive Systems
TL;DR: This tutorial gives a review of the basic theory and representative techniques of offline evaluation, to build a user model that simulates user behavior under various contexts, and then evaluate metrics of a system with this simulator.
Proceedings ArticleDOI
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Dalin Guo,Sofia Ira Ktena,Pranay Kumar Myana,Ferenc Huszar,Wenzhe Shi,Alykhan Tejani,Michael Kneier,Sourav Das +7 more
TL;DR: In this article, the authors formulate a display advertising recommender as a contextual bandit and implement exploration techniques that require sampling from the posterior distribution of click-through-rates in a computationally tractable manner.
References
More filters
Book
Reinforcement Learning: An Introduction
TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
MonographDOI
Causality: models, reasoning, and inference
TL;DR: The art and science of cause and effect have been studied in the social sciences for a long time as mentioned in this paper, see, e.g., the theory of inferred causation, causal diagrams and the identification of causal effects.
Journal ArticleDOI
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
TL;DR: This article presents a general class of associative reinforcement learning algorithms for connectionist networks containing stochastic units that are shown to make weight adjustments in a direction that lies along the gradient of expected reinforcement in both immediate-reinforcement tasks and certain limited forms of delayed-reInforcement tasks, and they do this without explicitly computing gradient estimates.
Book
Introduction to Reinforcement Learning
TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.