Recursive partitioning for heterogeneous causal effects
Citations
1,156 citations
1,055 citations
Cites background or methods from "Recursive partitioning for heteroge..."
...A carefully constructed heterogeneity tree provides valid estimates of treatment effects in every leaf (Athey and Imbens 2016)....
[...]
...Athey and Imbens (2016) use sample-splitting to obtain valid (conditional) inference on 10 In particular, we have to avoid “forbidden regressions” (Angrist and Pischke 2008) in which correlation between first-stage residuals and fitted values exists and creates bias in the second stage....
[...]
816 citations
Cites background or methods from "Recursive partitioning for heteroge..."
...In growing trees to build our forest, we follow most closely the approach of Athey and Imbens [2016], who propose honest, causal trees, and obtain valid confidence intervals for average treatment effects for each of the subpopulations (leaves) identified by the algorithm. (Instead of personalizing predictions for each individual, this approach only provides treatment effect estimates for leaf-wise subgroups whose size must grow to infinity.) Other related approaches include those of Su et al. [2009] and Zeileis et al. [2008], which build a tree for treatment effects in subgroups and use statistical tests to determine splits; however, these papers do not analyze bias or consistency properties. Finally, we note a growing literature on estimating heterogeneous treatment effects using different machine learning methods. Imai and Ratkovic [2013], Signorovitch [2007], Tian et al. [2014] and Weisberg and Pontes [2015] develop lasso-like methods for causal inference in a sparse high-dimensional linear setting. Beygelzimer and Langford [2009], Dud́ık et al. [2011], and others discuss procedures for transforming outcomes that enable off-the-shelf loss minimization methods to be used for optimal treatment policy estimation....
[...]
...Following Athey and Imbens (2016), our proposed forest is composed of causal trees that estimate the effect of the treatment at the leaves of the trees; we thus refer to our algorithm as a causal forest....
[...]
...In growing trees to build our forest, we follow most closely the approach of Athey and Imbens [2016], who propose honest, causal trees, and obtain valid confidence intervals for average treatment effects for each of the subpopulations (leaves) identified by the algorithm. (Instead of personalizing predictions for each individual, this approach only provides treatment effect estimates for leaf-wise subgroups whose size must grow to infinity.) Other related approaches include those of Su et al. [2009] and Zeileis et al. [2008], which build a tree for treatment effects in subgroups and use statistical tests to determine splits; however, these papers do not analyze bias or consistency properties. Finally, we note a growing literature on estimating heterogeneous treatment effects using different machine learning methods. Imai and Ratkovic [2013], Signorovitch [2007], Tian et al. [2014] and Weisberg and Pontes [2015] develop lasso-like methods for causal inference in a sparse high-dimensional linear setting....
[...]
...For completeness, we briefly outline the motivation for the splitting rule of Athey and Imbens (2016) we use for our double-sample trees....
[...]
...We implemented our simulations in R, using the packages causalTree (Athey and Imbens 2016) for building individual trees, randomForestCI (Wager, Hastie, and Efron 2014) for computing V̂IJ , and FNN (Beygelzimer et al. 2013) for k-NN regression....
[...]
664 citations
546 citations
References
[...]
79,257 citations
"Recursive partitioning for heteroge..." refers background in this paper
...ethods for a closely related problem, the problem of predicting outcomes as a function of covariates in similar environments. The most popular approaches (e.g. regression trees ([4]), random forests ([3]), LASSO ([24]), support vector machines ([26]), etc.) entail building a model of the relationship between attributes and outcomes, with a penalty parameter that penalizes model complexity. Cross-vali...
[...]
...geable, and that there is no interference (the stable unit treatment value assumption, or sutva [20]). This assumption may be violated in settings where some units are connected through networks. Let [3] p = pr(Wi = 1) be the marginal treatment probability, and let e(x) = pr(Wi = 1|Xi = x) be the conditional treatment probability (the “propensity score” as defined by [17]). In a randomized experiment ...
[...]
...ve the same prediction. In this paper, we focus on the analogous goal of deriving a partition of the population according to treatment effect heterogeneity, building on standard regression trees ([4], [3]). Whether the ultimate goal in an application is to derive a partition or fully personalized treatment effect estimates depends on the setting; settings where partitions may be desirable include those...
[...]
40,785 citations
"Recursive partitioning for heteroge..." refers background in this paper
...Using Bayesian nonparametric methods, they project estimates of heterogeneous treatment effects onto the feature space using LASSO-type regularization methods to get low-dimensional summaries of heterogeneity....
[...]
...Beyond those previously discussed, Tian et al. (23) transform the features rather than the outcomes and then apply LASSO to the model with the original outcome and the transformed features....
[...]
...Imai and Ratkovic (25) use LASSO to estimate the effects of both treatments and attributes, but with different penalty terms for the two types of features to allow for the possibility that the treatment effects are present but the magnitudes of the interactions are small....
[...]
..., regression trees (5), random forests (6), LASSO (7), support vector machines (8), etc....
[...]
...The most popular approaches [e.g., regression trees (5), random forests (6), LASSO (7), support vector machines (8), etc.] entail building a model of the relationship between attributes and outcomes, with a penalty parameter that penalizes model complexity....
[...]
40,147 citations
26,531 citations
"Recursive partitioning for heteroge..." refers methods in this paper
...blem of predicting outcomes as a function of covariates in similar environments. The most popular approaches (e.g. regression trees ([4]), random forests ([3]), LASSO ([24]), support vector machines ([26]), etc.) entail building a model of the relationship between attributes and outcomes, with a penalty parameter that penalizes model complexity. Cross-validation is often used to select the optimal lev...
[...]
23,744 citations