| Literature DB >> 33983641 |
Ruth H Keogh1, Shaun R Seaman2, Jon Michael Gran3, Stijn Vansteelandt1,4.
Abstract
Observational longitudinal data on treatments and covariates are increasingly used to investigate treatment effects, but are often subject to time-dependent confounding. Marginal structural models (MSMs), estimated using inverse probability of treatment weighting or the g-formula, are popular for handling this problem. With increasing development of advanced causal inference methods, it is important to be able to assess their performance in different scenarios to guide their application. Simulation studies are a key tool for this, but their use to evaluate causal inference methods has been limited. This paper focuses on the use of simulations for evaluations involving MSMs in studies with a time-to-event outcome. In a simulation, it is important to be able to generate the data in such a way that the correct forms of any models to be fitted to those data are known. However, this is not straightforward in the longitudinal setting because it is natural for data to be generated in a sequential conditional manner, whereas MSMs involve fitting marginal rather than conditional hazard models. We provide general results that enable the form of the correctly specified MSM to be derived based on a conditional data generating procedure, and show how the results can be applied when the conditional hazard model is an Aalen additive hazard or Cox model. Using conditional additive hazard models is advantageous because they imply additive MSMs that can be fitted using standard software. We describe and illustrate a simulation algorithm. Our results will help researchers to effectively evaluate causal inference methods via simulation.Entities:
Keywords: additive hazard model; causal inference; congenial models; longitudinal data; marginal structural model; simulation study; survival analysis; time-dependent confounding
Mesh:
Year: 2021 PMID: 33983641 PMCID: PMC7612178 DOI: 10.1002/bimj.202000040
Source DB: PubMed Journal: Biom J ISSN: 0323-3847 Impact factor: 2.207
Figure 1Causal directed acyclic graph (DAG) illustrating relationships between treatment A, time-dependent covariates L, an unmeasured frailty term U and time-to-event, illustrated for a discrete-time setting where Y = I(T > K)
Cumulative coefficients at times 1–5: true values, mean of the estimates (and empirical SE) obtained using MSM-IPTW from 1000 simulations, and bias in the estimates (and Monte Carlo SE) obtained using MSM-IPTW
| Time | True value | MSM-IPTW | |
|---|---|---|---|
| Mean estimate (Empirical SE) | Bias (Monte Carlo SE) | ||
| Cumulative coefficient | |||
| 1 | 0.700 (0.009) | 0.699 (0.016) | −0.001 (0.000) |
| 2 | 1.408 (0.016) | 1.407 (0.028) | −0.000 (0.001) |
| 3 | 2.128 (0.026) | 2.129 (0.045) | 0.002 (0.001) |
| 4 | 2.863 (0.040) | 2.867 (0.070) | 0.003 (0.002) |
| 5 | 3.623 (0.058) | 3.630 (0.110) | 0.007 (0.003) |
| Cumulative coefficient | |||
| 1 | −0.198 (0.010) | −0.199 (0.037) | −0.001 (0.001) |
| 2 | −0.396 (0.017) | −0.397 (0.065) | −0.000 (0.002) |
| 3 | −0.594 (0.023) | −0.592 (0.100) | 0.001 (0.003) |
| 4 | −0.790 (0.033) | −0.788 (0.150) | 0.002 (0.005) |
| 5 | −0.987 (0.042) | −0.968 (0.231) | 0.018 (0.007) |
| Cumulative coefficient | |||
| 2 | −0.098 (0.013) | −0.102 (0.057) | −0.004 (0.002) |
| 3 | −0.195 (0.021) | −0.206 (0.096) | −0.011 (0.003) |
| 4 | −0.291 (0.030) | −0.303 (0.155) | −0.013 (0.005) |
| 5 | −0.386 (0.039) | −0.390 (0.245) | −0.005 (0.008) |
| Cumulative coefficient | |||
| 3 | −0.077 (0.017) | −0.076 (0.078) | 0.001 (0.002) |
| 4 | −0.153 (0.027) | −0.153 (0.139) | 0.000 (0.004) |
| 5 | −0.228 (0.039) | −0.222 (0.232) | 0.006 (0.007) |
| Cumulative coefficient | |||
| 4 | −0.060 (0.021) | −0.061 (0.115) | −0.001 (0.004) |
| 5 | −0.121 (0.035) | −0.128 (0.211) | −0.007 (0.007) |
| Cumulative coefficient | |||
| 5 | −0.047 (0.028) | −0.039 (0.176) | 0.008 (0.006) |
Survival probabilities for the treatment regimes ‘never treated’ and ‘always treated’ at times 1–5: true values, mean of the estimates (and empirical SE) obtained using MSM-IPTW from 1000 simulations, and bias in the estimates (and Monte Carlo SE) obtained using MSM-IPTW
| Time | True value | MSM-IPTW | |
|---|---|---|---|
| Mean estimate (Empirical SE) | Bias (Monte Carlo SE) | ||
| Never treated: | |||
| 1 | 0.497 | 0.497 (0.008) | 0.000 (0.000) |
| 2 | 0.245 | 0.245 (0.007) | 0.000 (0.000) |
| 3 | 0.119 | 0.119(0.005) | −0.000 (0.000) |
| 4 | 0.057 | 0.057 (0.004) | −0.000 (0.000) |
| 5 | 0.027 | 0.027 (0.003) | −0.000 (0.000) |
| Always treated: | |||
| 1 | 0.606 | 0.607 (0.021) | 0.001 (0.001) |
| 2 | 0.401 | 0.404 (0.031) | 0.003 (0.001) |
| 3 | 0.283 | 0.288 (0.040) | 0.005 (0.001) |
| 4 | 0.208 | 0.216 (0.051) | 0.007 (0.002) |
| 5 | 0.157 | 0.165 (0.066) | 0.009 (0.002) |
Figure 2Cumulative coefficients: true values, estimates obtained using MSM-IPTW from 1000 simulated data sets (faded grey lines), and mean estimated cumulative coefficients using MSM-IPTW
Figure 3Survival curves for the treatment regimes ‘never treated’ and ‘always treated’: true survival curves, estimated survival curves obtained using MSM-IPTW from 1000 simulated data sets (faded grey lines), and the mean estimated survival curves using MSM-IPTW