Open AccessJournal Article
Counterfactual reasoning and learning systems: the example of computational advertising
Léon Bottou,Jonas Peters,Joaquin Quiñonero-Candela,Denis X. Charles,D. Max Chickering,Elon Portugaly,Dipankar Ray,Patrice Y. Simard,Ed Snelson +8 more
Reads0
Chats0
TLDR
This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system and allow both humans and algorithms to select the changes that would have improved the system performance.Abstract:
This work shows how to leverage causal inference to understand the behavior of complex learning systems interacting with their environment and predict the consequences of changes to the system. Such predictions allow both humans and algorithms to select the changes that would have improved the system performance. This work is illustrated by experiments on the ad placement system associated with the Bing search engine.read more
Citations
More filters
Proceedings Article
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Proceedings Article
Learning Disentangled Representations for CounterFactual Regression
Negar Hassanpour,Russell Greiner +1 more
TL;DR: This work proposes an algorithm to identify disentangled representations of the above-mentioned underlying factors from any given observational dataset D and leverage this knowledge to reduce, as well as account for, the negative impact of selection bias on estimating the treatment effects from D.
Proceedings Article
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
TL;DR: In this paper, a marginalized importance sampling (MIS) estimator is proposed to evaluate a new policy using the historical data obtained by different behavior policies under the model of nonstationary episodic Markov Decision Processes (MDP) with a long horizon and a large action space.
Posted Content
Striving for Simplicity in Off-policy Deep Reinforcement Learning
TL;DR: A simple and novel variant of ensemble Q-learning called Random Ensemble Mixture (REM), which enforces optimal Bellman consistency on random convex combinations of the Q-heads of a multi-head Q-network, is presented.
Proceedings Article
A Survey on Semantic Parsing
Aishwarya Kamath,Rajarshi Das +1 more
TL;DR: This survey examines the various components of a semantic parsing system and discusses prominent work ranging from the initial rule based methods to the current neural approaches to program synthesis.
References
More filters
Book
Reinforcement Learning: An Introduction
TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
MonographDOI
Causality: models, reasoning, and inference
TL;DR: The art and science of cause and effect have been studied in the social sciences for a long time as mentioned in this paper, see, e.g., the theory of inferred causation, causal diagrams and the identification of causal effects.
Journal ArticleDOI
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
TL;DR: This article presents a general class of associative reinforcement learning algorithms for connectionist networks containing stochastic units that are shown to make weight adjustments in a direction that lies along the gradient of expected reinforcement in both immediate-reinforcement tasks and certain limited forms of delayed-reInforcement tasks, and they do this without explicitly computing gradient estimates.
Book
Introduction to Reinforcement Learning
TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.