| Literature DB >> 32450775 |
Liangyuan Hu1,2,3, Chenyang Gu4, Michael Lopez5, Jiayi Ji1,2,3, Juan Wisnivesky6.
Abstract
There is a dearth of robust methods to estimate the causal effects of multiple treatments when the outcome is binary. This paper uses two unique sets of simulations to propose and evaluate the use of Bayesian additive regression trees in such settings. First, we compare Bayesian additive regression trees to several approaches that have been proposed for continuous outcomes, including inverse probability of treatment weighting, targeted maximum likelihood estimator, vector matching, and regression adjustment. Results suggest that under conditions of non-linearity and non-additivity of both the treatment assignment and outcome generating mechanisms, Bayesian additive regression trees, targeted maximum likelihood estimator, and inverse probability of treatment weighting using generalized boosted models provide better bias reduction and smaller root mean squared error. Bayesian additive regression trees and targeted maximum likelihood estimator provide more consistent 95% confidence interval coverage and better large-sample convergence property. Second, we supply Bayesian additive regression trees with a strategy to identify a common support region for retaining inferential units and for avoiding extrapolating over areas of the covariate space where common support does not exist. Bayesian additive regression trees retain more inferential units than the generalized propensity score-based strategy, and shows lower bias, compared to targeted maximum likelihood estimator or generalized boosted model, in a variety of scenarios differing by the degree of covariate overlap. A case study examining the effects of three surgical approaches for non-small cell lung cancer demonstrates the methods.Entities:
Keywords: Causal inference; generalized propensity score; inverse probability of treatment weighting; machine learning; matching
Mesh:
Year: 2020 PMID: 32450775 PMCID: PMC7534201 DOI: 10.1177/0962280220921909
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
The outcome rates in three surgical groups: robotic-assisted surgery, VATS, and open thoracotomy.
| Outcomes | Robotic-assisted surgery | VATS | Open thoracotomy | Overall |
|---|---|---|---|---|
| Respiratory complication | 30.1% | 33.6% | 33.3% | 33.3% |
| Prolonged LOS | 5.3% | 10.4% | 5.5% | 8.2% |
| ICU stay | 60.2% | 75.3% | 59.1% | 67.9% |
| Readmission | 8.8% | 9.8% | 8.0% | 9.0% |
ICU: intensive care unit; LOS: length of stay; VATS: video-assisted thoracic surgery.
Figure 1.Overlap assessment for the scenarios of (a) weak, (b) moderate, and (c) strong covariate overlap. Each panel presents boxplots by treatment group of the estimated GPSs for one of the treatments, , for every unit in the sample. The left panel presents treatment 1 (W = 1), the middle panel presents treatment 2 (W = 2), and the right panel presents treatment 3 (W = 3).
Comparison of the estimated average treatment effects on the treated in terms of mean absolute bias (MAB), root mean square error (RMSE), and coverage probability (CP) across 200 replications in Simulation 1. The causal estimand is based on risk difference.
|
|
|
|
|
| ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Scenario | Method | MAB | RMSE | CP | MAB | RMSE | CP | MAB | RMSE | CP | MAB | RMSE | CP | MAB | RMSE | CP |
| RA | 0.01 | 0.02 | 0.99 | 0.03 | 0.04 | 0.99 | 0.01 | 0.02 | 0.99 | 0.03 | 0.04 | 0.98 | 0.02 | 0.02 | 0.99 | |
| IPTW-MLR | 0.06 | 0.07 | 1 | 0.04 | 0.05 | 1 | 0.07 | 0.08 | 1 | 0.04 | 0.05 | 1 | 0.09 | 0.10 | 1 | |
| IPTW-MLR-Trim | 0.06 | 0.07 | 1 | 0.04 | 0.05 | 1 | 0.07 | 0.08 | 1 | 0.04 | 0.05 | 1 | 0.09 | 0.10 | 1 | |
| IPTW-GBM | 0.05 | 0.06 | 0.99 | 0.05 | 0.06 | 0.98 | 0.07 | 0.07 | 1 | 0.06 | 0.07 | 0.98 | 0.13 | 0.13 | 0.96 | |
| I | IPTW-GBM-Trim | 0.06 | 0.07 | 0.99 | 0.05 | 0.06 | 0.98 | 0.06 | 0.06 | 0.98 | 0.06 | 0.07 | 1 | 0.11 | 0.12 | 0.96 |
| IPTW-SL | 0.06 | 0.07 | 1 | 0.05 | 0.06 | 1 | 0.07 | 0.08 | 1 | 0.05 | 0.06 | 1 | 0.12 | 0.13 | 1 | |
| IPTW-SL-Trim | 0.06 | 0.07 | 1 | 0.06 | 0.08 | 1 | 0.06 | 0.07 | 1 | 0.05 | 0.05 | 1 | 0.10 | 0.11 | 1 | |
| VM | 0.05 | 0.07 | 0.99 | 0.06 | 0.08 | 0.93 | – | – | – | – | – | – | – | – | – | |
| BART | 0.03 | 0.04 | 0.88 | 0.04 | 0.05 | 0.80 | 0.03 | 0.03 | 0.96 | 0.03 | 0.03 | 0.95 | 0.03 | 0.04 | 0.95 | |
| TMLE | – | – | – | – | – | – | 0.04 | 0.05 | 1 | 0.02 | 0.03 | 1 | 0.05 | 0.06 | 1 | |
| RA | 0.02 | 0.02 | 1 | 0.05 | 0.05 | 0.92 | 0.02 | 0.02 | 0.80 | 0.05 | 0.05 | 0.60 | 0.03 | 0.03 | 0.67 | |
| IPTW-MLR | 0.05 | 0.06 | 1 | 0.03 | 0.03 | 0.99 | 0.06 | 0.08 | 1 | 0.04 | 0.05 | 1 | 0.07 | 0.07 | 1 | |
| IPTW-MLR-Trim | 0.06 | 0.06 | 1 | 0.03 | 0.03 | 0.99 | 0.06 | 0.07 | 1 | 0.03 | 0.04 | 1 | 0.08 | 0.08 | 1 | |
| IPTW-GBM | 0.03 | 0.04 | 0.98 | 0.03 | 0.04 | 0.99 | 0.05 | 0.05 | 0.98 | 0.05 | 0.06 | 1 | 0.09 | 0.09 | 0.94 | |
| II | IPTW-GBM-Trim | 0.05 | 0.06 | 0.98 | 0.04 | 0.04 | 0.99 | 0.05 | 0.05 | 0.98 | 0.05 | 0.06 | 1 | 0.09 | 0.09 | 1 |
| IPTW-SL | 0.06 | 0.06 | 1 | 0.04 | 0.04 | 0.99 | 0.06 | 0.07 | 1 | 0.05 | 0.05 | 1 | 0.11 | 0.11 | 1 | |
| IPTW-SL-Trim | 0.06 | 0.07 | 1 | 0.06 | 0.06 | 0.99 | 0.06 | 0.07 | 1 | 0.05 | 0.05 | 1 | 0.10 | 0.10 | 1 | |
| VM | 0.04 | 0.05 | 0.86 | 0.05 | 0.07 | 0.88 | – | – | – | – | – | – | – | – | – | |
| BART | 0.02 | 0.03 | 0.80 | 0.03 | 0.04 | 0.75 | 0.02 | 0.02 | 0.96 | 0.01 | 0.02 | 0.98 | 0.01 | 0.02 | 0.94 | |
| TMLE | – | – | – | – | – | – | 0.04 | 0.04 | 1 | 0.02 | 0.02 | 1 | 0.04 | 0.04 | 0.96 | |
| RA | 0.03 | 0.03 | 1 | 0.07 | 0.07 | 0.44 | 0.03 | 0.03 | 0.06 | 0.07 | 0.07 | 0.03 | 0.04 | 0.04 | 0.03 | |
| IPTW-MLR | 0.06 | 0.06 | 1 | 0.02 | 0.03 | 0.73 | 0.07 | 0.08 | 1 | 0.05 | 0.06 | 1 | 0.07 | 0.07 | 1 | |
| IPTW-MLR-Trim | 0.06 | 0.07 | 1 | 0.02 | 0.03 | 1 | 0.06 | 0.07 | 1 | 0.03 | 0.04 | 1 | 0.07 | 0.08 | 1 | |
| IPTW-GBM | 0.03 | 0.04 | 1 | 0.02 | 0.03 | 0.98 | 0.04 | 0.05 | 0.98 | 0.04 | 0.05 | 1 | 0.06 | 0.06 | 0.98 | |
| III | IPTW-GBM-Trim | 0.06 | 0.06 | 1 | 0.02 | 0.03 | 0.98 | 0.04 | 0.05 | 1 | 0.05 | 0.05 | 1 | 0.06 | 0.06 | 0.10 |
| IPTW-SL | 0.06 | 0.06 | 0.99 | 0.03 | 0.03 | 1 | 0.06 | 0.07 | 1 | 0.04 | 0.05 | 1 | 0.10 | 0.10 | 1 | |
| IPTW-SL-Trim | 0.06 | 0.07 | 0.99 | 0.04 | 0.05 | 1 | 0.06 | 0.07 | 1 | 0.04 | 0.05 | 1 | 0.10 | 0.10 | 0.99 | |
| VM | 0.03 | 0.04 | 0.80 | 0.05 | 0.06 | 0.78 | – | – | – | – | – | – | – | – | – | |
| BART | 0.02 | 0.03 | 0.76 | 0.03 | 0.04 | 0.74 | 0.02 | 0.03 | 0.95 | 0.02 | 0.03 | 0.96 | 0.01 | 0.01 | 0.94 | |
| TMLE | – | – | – | – | – | – | 0.03 | 0.03 | 0.98 | 0.01 | 0.02 | 0.97 | 0.03 | 0.03 | 0.96 | |
ATT: Average treatment effect on the treated; BART: Bayesian additive regression trees; IPTW-GBM: IPTW with weights estimated using generalized boosted models; IPTW-MLR: IPTW with weights estimated using multinomial logistic regression; IPTW-SL: IPTW with weights estimated using super learner; RA: regression adjustment; TMLE: Targeted maximum likelihood estimation; VM: vector matching.
Figure 2.The large-sample convergence rate of each of six methods for the estimates of two treatment effects, and . BART and IPTW-GBM converged the fastest, approximately at a rate of . RA converged the slowest, approximately at a rate of . ATT: Average treatment effect on the treated; BART: Bayesian additive regression trees; IPTW-GBM; IPTW with weights estimated using generalized boosted models; IPTW-MLR: IPTW with weights estimated using multinomial logistic regression; IPTW-SL: IPTW with weights estimated using super learner; RA: regression adjustment; RMSE: root mean squared error; VM: vector matching.
Figure 3.Biases among 200 replications under scenarios of differing covariate overlap for IPTW-GBM versus BART and two treatment effects and ; and for TMLE versus BART and three treatment effects , and . (a) BART-discard versus GBM for ATT estimates and (b) BART-discard versus TMLE for ATE estimates. ATE: Average treatment effects; ATT: Average treatment effect on the treated; BART: Bayesian additive regression trees; IPTW-GBM: IPTW with weights estimated using generalized boosted models; TMLE: targeted maximum likelihood estimation.
Baseline characteristics of patients in three surgical groups in SEER-Medicare data.
| Robotic-assisted surgery | VATS | Open thoracotomy | |
|---|---|---|---|
| Characteristics | |||
| Age (years), mean (SD) | 74.3 (5.7) | 73.9 (5.4) | 74.5 (5.7) |
| Female, N (%) | 223 (56.3) | 3446 (52.4) | 2941 (58.8) |
| Married, N (%) | 227 (57.3) | 3753 (57.0) | 2802 (56.0) |
| Race, N (%) | |||
| White | 320 (80.8) | 5694 (86.5) | 4369 (87.3) |
| Black | 21 (5.3) | 364 (5.5) | 248 (5.0) |
| Hispanic | 15 (3.8) | 218 (3.3) | 139 (2.8) |
| Other | 40 (10.1) | 306 (4.6) | 246 (4.9) |
| Median household annual income, N (%) | |||
| 1st quartile | 97 (24.5) | 2132 (32.4) | 1009 (20.2) |
| 2nd quartile | 88 (22.2) | 1729 (26.3) | 1193 (23.9) |
| 3rd quartile | 98 (24.7) | 1345 (20.4) | 1143 (22.9) |
| 4th quartile | 113 (28.5) | 1376 (20.9) | 1657 (33.1) |
| Charlson comorbidity score, N (%) | |||
| | 154 (38.9) | 2163 (32.9) | 1810 (36.2) |
| | 113 (28.5) | 1944 (29.5) | 1379 (27.6) |
| >2 | 129 (32.6) | 2475 (37.6) | 1813 (36.2) |
| Year of diagnosis, N (%) | |||
| 2008–2009 | 14 (3.5) | 2686 (40.8) | 1484 (29.7) |
| 2010 | 33 (8.3) | 1123 (17.1) | 857 (17.1) |
| 2011 | 85 (21.5) | 1033 (15.7) | 866 (17.3) |
| 2012 | 131 (33.1) | 899 (13.7) | 821 (16.4) |
| 2013 | 133 (33.6) | 841 (12.8) | 974 (19.5) |
| Cancer stage, N (%) | |||
| Stage I | 295 (74.5) | 4195 (63.7) | 3884 (77.6) |
| Stage II | 63 (15.9) | 1504 (22.9) | 709 (14.2) |
| Stage IIIA | 38 (9.6) | 883 (13.4) | 409 (8.2) |
| Tumor size, in mm, N (%) | |||
| | 160 (40.4) | 1967 (29.9) | 2232 (44.6) |
| | 98 (24.7) | 1696 (25.8) | 1388 (27.7) |
| | 109 (27.5) | 1804 (27.4) | 987 (19.7) |
| | 29 (7.3) | 1084 (16.5) | 367 (7.3) |
| Histology, N (%) | |||
| Adenocarcinoma | 255 (64.4) | 3757 (57.1) | 3348 (66.9) |
| Squamous cell carcinoma | 107 (27.0) | 2165 (32.9) | 1167 (23.3) |
| Other histology | 34 (8.6) | 660 (10.0) | 487 (9.7) |
| Tumor site, N (%) | |||
| Upper lobe | 215 (54.3) | 3829 (58.2) | 2859 (57.2) |
| Middle lobe | 27 (6.8) | 308 (4.7) | 335 (6.7) |
| Lower lobe | 141 (35.6) | 2195 (33.3) | 1720 (34.4) |
| Other site | 13 (3.3) | 250 (3.8) | 88 (1.8) |
| PET scan, N (%) | 302 (76.3) | 5004 (76.0) | 3410 (68.2) |
| Chest CT, N (%) | 263 (66.4) | 4525 (68.7) | 3148 (62.9) |
| Mediastinoscopy, N (%) | 62 (15.7) | 715 (10.9) | 420 (8.4) |
CT: computed tomography; PET: positron emission tomography; SD: standard deviation; VATS: video-assisted thoracic surgery.