Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects

Open AccessPosted Content

Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects

- 29 Jun 2017 -

TLDR

This article proposed a Bayesian causal forest model for estimating heterogeneous treatment effects from observational data, which is geared specifically towards situations with small effect sizes, heterogeneous effects, and strong confounding.

Abstract:

This paper presents a novel nonlinear regression model for estimating heterogeneous treatment effects from observational data, geared specifically towards situations with small effect sizes, heterogeneous effects, and strong confounding. Standard nonlinear regression models, which may work quite well for prediction, have two notable weaknesses when used to estimate heterogeneous treatment effects. First, they can yield badly biased estimates of treatment effects when fit to data with strong confounding. The Bayesian causal forest model presented in this paper avoids this problem by directly incorporating an estimate of the propensity function in the specification of the response model, implicitly inducing a covariate-dependent prior on the regression function. Second, standard approaches to response surface modeling do not provide adequate control over the strength of regularization over effect heterogeneity. The Bayesian causal forest model permits treatment effect heterogeneity to be regularized separately from the prognostic effect of control variables, making it possible to informatively "shrink to homogeneity". We illustrate these benefits via the reanalysis of an observational study assessing the causal effects of smoking on medical expenditures as well as extensive simulation studies.

Citations

PDF

Open Access

More filters

Proceedings Article

What is Causal Inference

Judea Pearl

Abstract: This paper reviews a theory of causal inference based on the Structural Causal Model (SCM) described in (Pearl, 2000a). The theory unifies the graphical, potential-outcome (Neyman-Rubin), decision analytical, and structural equation approaches to causation, and provides both a mathematical foundation and a friendly calculus for the analysis of causes and counterfactuals. In particular, the paper establishes a methodology for inferring (from a combination of data and assumptions) the answers to three types of causal queries: (1) queries about the effect of potential interventions, (2) queries about counterfactuals, and (3) queries about the direct (or indirect) effect of one event on another.

...read moreread less

Journal ArticleDOI

A Survey of Learning Causality with Data: Problems and Methods

Ruocheng Guo, +4 more

- 25 Sep 2018 -

arXiv: Artificial Intelligence

TL;DR: In this paper, a comprehensive and structured review of both traditional and frontier methods in learning causality and relations along with the connections between causal effects and machine learning is presented. But, the authors point out on a case-by-case basis how big data facilitates, complicates, or motivates each approach.

...read moreread less

Posted Content

Estimating Treatment Effects with Causal Forests: An Application

Susan Athey, +1 more

- 20 Feb 2019 -

arXiv: Methodology

TL;DR: The authors apply causal forests to a dataset derived from the National Study of Learning Mindsets, and consider resulting practical and conceptual challenges, and discuss how causal forests use estimated propensity scores to be more robust to confounding and how they handle data with clustered errors.

...read moreread less

Posted Content

A Survey on Causal Inference

Liuyi Yao, +5 more

- 05 Feb 2020 -

arXiv: Methodology

TL;DR: This survey provides a comprehensive review of causal inference methods under the potential outcome framework, one of the well-known causal inference frameworks, and presents the plausible applications of these methods, including the applications in advertising, recommendation, medicine, and so on.

...read moreread less

Journal ArticleDOI

Causal inference and counterfactual prediction in machine learning for actionable healthcare

Mattia Prosperi, +11 more

- 13 Jul 2020 -

Nature Machine Intelligence

TL;DR: How target trials, transportability, and prediction invariance are linchpins to developing and testing intervention models and a true causal model is contained in the set of all prediction models whose accuracy does not vary across different settings is discussed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The central role of the propensity score in observational studies for causal effects

Paul R. Rosenbaum, +1 more

- 01 Apr 1983 -

Biometrika

TL;DR: The authors discusses the central role of propensity scores and balancing scores in the analysis of observational studies and shows that adjustment for the scalar propensity score is sufficient to remove bias due to all observed covariates.

...read moreread less

Book ChapterDOI

Domain-adversarial training of neural networks

Yaroslav Ganin, +7 more

- 01 Jan 2016 -

Journal of Machine Learning Research

TL;DR: In this article, a new representation learning approach for domain adaptation is proposed, in which data at training and test time come from similar but different distributions, and features that cannot discriminate between the training (source) and test (target) domains are used to promote the emergence of features that are discriminative for the main learning task on the source domain.

...read moreread less

Journal ArticleDOI

Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper)

Andrew Gelman

- 01 Sep 2006 -

Bayesian Analysis

TL;DR: In this paper, a folded-noncentral-$t$ family of conditionally conjugate priors for hierarchical standard deviation parameters is proposed, and weakly informative priors in this family are considered.

...read moreread less

Journal ArticleDOI

Doubly robust estimation in missing data and causal inference models

Heejung Bang, +1 more

- 01 Dec 2005 -

Biometrics

TL;DR: The results of simulation studies are presented which demonstrate that the finite sample performance of DR estimators is as impressive as theory would predict and the proposed method is applied to a cardiovascular clinical trial.

...read moreread less

Journal ArticleDOI

BART: Bayesian additive regression trees

Hugh A. Chipman, +2 more

- 01 Mar 2010 -

The Annals of Applied Statistics

TL;DR: A Bayesian "sum-of-trees" model where each tree is constrained by a regularization prior to be a weak learner, and fitting and inference are accomplished via an iterative Bayesian backfitting MCMC algorithm that generates samples from a posterior.

...read moreread less