Literature DB >> 27382149

Recursive partitioning for heterogeneous causal effects.

Susan Athey1, Guido Imbens2.   

Abstract

In this paper we propose methods for estimating heterogeneity in causal effects in experimental and observational studies and for conducting hypothesis tests about the magnitude of differences in treatment effects across subsets of the population. We provide a data-driven approach to partition the data into subpopulations that differ in the magnitude of their treatment effects. The approach enables the construction of valid confidence intervals for treatment effects, even with many covariates relative to the sample size, and without "sparsity" assumptions. We propose an "honest" approach to estimation, whereby one sample is used to construct the partition and another to estimate treatment effects for each subpopulation. Our approach builds on regression tree methods, modified to optimize for goodness of fit in treatment effects and to account for honest estimation. Our model selection criterion anticipates that bias will be eliminated by honest estimation and also accounts for the effect of making additional splits on the variance of treatment effect estimates within each subpopulation. We address the challenge that the "ground truth" for a causal effect is not observed for any individual unit, so that standard approaches to cross-validation must be modified. Through a simulation study, we show that for our preferred method honest estimation results in nominal coverage for 90% confidence intervals, whereas coverage ranges between 74% and 84% for nonhonest approaches. Honest estimation requires estimating the model with a smaller sample size; the cost in terms of mean squared error of treatment effects for our preferred method ranges between 7-22%.

Keywords:  causal inference; cross-validation; heterogeneous treatment effects; potential outcomes; supervised machine learning

Mesh:

Year:  2016        PMID: 27382149      PMCID: PMC4941430          DOI: 10.1073/pnas.1510489113

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  4 in total

1.  A Simple Method for Estimating Interactions between a Treatment and a Large Number of Covariates.

Authors:  Lu Tian; Ash A Alizadeh; Andrew J Gentles; Robert Tibshirani
Journal:  J Am Stat Assoc       Date:  2014-10       Impact factor: 5.033

2.  Subgroup identification from randomized clinical trial data.

Authors:  Jared C Foster; Jeremy M G Taylor; Stephen J Ruberg
Journal:  Stat Med       Date:  2011-08-04       Impact factor: 2.373

3.  Optimizing randomized trial designs to distinguish which subpopulations benefit from treatment.

Authors:  M Rosenblum; M J Van der Laan
Journal:  Biometrika       Date:  2011-12       Impact factor: 2.445

4.  Post hoc subgroups in clinical trials: Anathema or analytics?

Authors:  Herbert I Weisberg; Victor P Pontes
Journal:  Clin Trials       Date:  2015-06-10       Impact factor: 2.486

  4 in total
  56 in total

1.  Approaches to treatment effect heterogeneity in the presence of confounding.

Authors:  Sarah C Anoke; Sharon-Lise Normand; Corwin M Zigler
Journal:  Stat Med       Date:  2019-03-31       Impact factor: 2.373

Review 2.  Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects.

Authors:  David M Kent; Ewout Steyerberg; David van Klaveren
Journal:  BMJ       Date:  2018-12-10

3.  Targeting weight loss interventions to reduce cardiovascular complications of type 2 diabetes: a machine learning-based post-hoc analysis of heterogeneous treatment effects in the Look AHEAD trial.

Authors:  Aaron Baum; Joseph Scarpa; Emilie Bruzelius; Ronald Tamler; Sanjay Basu; James Faghmous
Journal:  Lancet Diabetes Endocrinol       Date:  2017-07-12       Impact factor: 32.069

4.  High-dimensional regression adjustments in randomized experiments.

Authors:  Stefan Wager; Wenfei Du; Jonathan Taylor; Robert J Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2016-10-25       Impact factor: 11.205

5.  Improving massive experiments with threshold blocking.

Authors:  Michael J Higgins; Fredrik Sävje; Jasjeet S Sekhon
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-05       Impact factor: 11.205

6.  Drawing causal inference from Big Data.

Authors:  Richard M Shiffrin
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-05       Impact factor: 11.205

Review 7.  Integrating explanation and prediction in computational social science.

Authors:  Jake M Hofman; Duncan J Watts; Susan Athey; Filiz Garip; Thomas L Griffiths; Jon Kleinberg; Helen Margetts; Sendhil Mullainathan; Matthew J Salganik; Simine Vazire; Alessandro Vespignani; Tal Yarkoni
Journal:  Nature       Date:  2021-06-30       Impact factor: 49.962

8.  Determinants of the population health distribution: an illustration examining body mass index.

Authors:  David Bann; Emla Fitzsimons; William Johnson
Journal:  Int J Epidemiol       Date:  2020-06-01       Impact factor: 7.196

9.  Performing an Informatics Consult: Methods and Challenges.

Authors:  Alejandro Schuler; Alison Callahan; Kenneth Jung; Nigam H Shah
Journal:  J Am Coll Radiol       Date:  2018-02-13       Impact factor: 5.532

10.  HUMAN DECISIONS AND MACHINE PREDICTIONS.

Authors:  Jon Kleinberg; Himabindu Lakkaraju; Jure Leskovec; Jens Ludwig; Sendhil Mullainathan
Journal:  Q J Econ       Date:  2017-08-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.