| Literature DB >> 31162707 |
Tobias Bluhmki1, Hein Putter2, Arthur Allignol3, Jan Beyersmann1.
Abstract
We consider nonparametric and semiparametric resampling of multistate event histories by simulating multistate trajectories from an empirical multivariate hazard measure. One advantage of our approach is that it does not necessarily require individual patient data, but may be based on published information. This is also attractive for both study planning and simulating realistic real-world event history data in general. The concept extends to left-truncation and right-censoring mechanisms, nondegenerate initial distributions, and nonproportional as well as non-Markov settings. A special focus is on its connection to simulating survival data with time-dependent covariates. For the case of qualitative time-dependent exposures, we demonstrate that our proposal gives a more natural interpretation of how such data evolve over the course of time than many of the competing approaches. The multistate perspective avoids any latent failure time structure and sampling spaces impossible in real life, whereas its parsimony follows the principle of Occam's razor. We also suggest empirical simulation as a novel bootstrap procedure to assess estimation uncertainty in the absence of individual patient data. This is not possible for established procedures such as Efron's bootstrap. A simulation study investigating the effect of liver functionality on survival in patients with liver cirrhosis serves as a proof of concept. Example code is provided.Entities:
Keywords: bootstrap; internal time-dependent covariates; joint model; simulation; survival analysis
Mesh:
Year: 2019 PMID: 31162707 PMCID: PMC6771611 DOI: 10.1002/sim.8177
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1A joint model to assess the effect of a time‐dependent covariate Y(t) ∈ {0,1} on survival: illness‐death multistate model without recovery treating the two covariate levels as separate transient states. Transition hazards α 01(t),α 02(t), and α 12(t) are included
Figure 2Relation between the multistate process X and covariate process Y(t). Both T 0 and T are random variables. X is right‐continuous, but Y is left‐continuous
Figure 3Comparison of the survival hazards for t 0 = 2 specified a priori (dashed line) and the population survival hazard α(t) resulting from the illness‐death model without recovery of Figure 1 (solid line) in the presence of a time‐dependent covariate with, at most, one change over time. Details are given in Appendix B
Figure 4Simulation algorithm—proof of concept. Solid black lines are the “true” (study‐based) state occupation probabilities, and solid gray lines are 300 randomly selected state occupation probabilities from the simulation. The averages of the simulated state occupation probabilities are drawn as black dots. PI, prothrombin index
Simulation results in terms of coverage probabilities for the proof of concept in Section 5.2. Coverages are computed for each (study‐based) transition probability evaluated at different timepoints and sample sizes
| Coverage Probability, % | |||||
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| 50 | 378 | 94.9 | 92.5 | 93.8 | 94.0 |
| 1800 | 92.6 | 92.6 | 93.6 | 92.8 | |
| 2700 | 71.5 | 90.7 | 92.7 | 91.5 | |
| 3200 | 14.8 | 90.0 | 92.0 | 90.5 | |
| 100 | 378 | 94.6 | 94.8 | 95.1 | 94.3 |
| 1800 | 93.3 | 94.2 | 93.7 | 93.0 | |
| 2700 | 87.6 | 94.2 | 94.3 | 93.8 | |
| 3200 | 28.5 | 93.0 | 94.0 | 93.1 | |
| 200 | 378 | 95.2 | 95.0 | 94.5 | 94.9 |
| 1800 | 94.6 | 93.8 | 94.8 | 93.9 | |
| 2700 | 92.4 | 94.1 | 94.0 | 94.0 | |
| 3200 | 50.6 | 94.5 | 94.1 | 94.4 | |
| 251 | 378 | 95.3 | 95.2 | 94.7 | 95.0 |
| 1800 | 94.0 | 94.4 | 94.6 | 94.2 | |
| 2700 | 92.5 | 94.4 | 94.3 | 94.6 | |
| 3200 | 59.0 | 93.3 | 94.9 | 93.4 | |
| 500 | 378 | 96.0 | 93.9 | 94.8 | 93.5 |
| 1800 | 95.1 | 94.9 | 94.7 | 94.4 | |
| 2700 | 94.0 | 94.4 | 94.4 | 94.2 | |
| 3200 | 82.4 | 94.5 | 94.5 | 94.1 | |
| 1000 | 378 | 95.2 | 94.6 | 94.5 | 94.7 |
| 1800 | 95.5 | 94.9 | 94.5 | 93.9 | |
| 2700 | 94.2 | 94.2 | 95.2 | 95.2 | |
| 3200 | 94.1 | 94.0 | 94.7 | 94.6 | |
Coverage probabilities for the simulation study in Section 5.3
| Coverage Probability, % | ||||
|---|---|---|---|---|
|
|
|
|
|
|
| 50 | 378 | 97.4 | 95.1 | 94.0 |
| 500 | 97.4 | 94.5 | 94.9 | |
| 1000 | 95.5 | 93.1 | 94.8 | |
| 1800 | 92.9 | 88.8 | 92.3 | |
| 2700 | 90.3 | 66.8 | 90.9 | |
| 3200 | 89.1 | 68.7 | 91.2 | |
| 100 | 378 | 97.2 | 94.7 | 94.2 |
| 500 | 96.5 | 94.3 | 94.9 | |
| 1000 | 94.1 | 94.3 | 94.6 | |
| 1800 | 93.6 | 92.5 | 94.1 | |
| 2700 | 92.1 | 87.2 | 92.8 | |
| 3200 | 90.3 | 74.8 | 91.9 | |
| 200 | 378 | 96.8 | 95.4 | 94.9 |
| 500 | 95.7 | 95.0 | 95.0 | |
| 1000 | 96.1 | 95.2 | 95.4 | |
| 1800 | 93.8 | 91.9 | 94.2 | |
| 2700 | 92.6 | 89.5 | 92.9 | |
| 3200 | 94.1 | 89.1 | 93.7 | |
| 251 | 378 | 97.3 | 94.9 | 94.4 |
| 500 | 95.4 | 94.5 | 95.9 | |
| 1000 | 96.7 | 95.0 | 96.4 | |
| 1800 | 94.9 | 92.1 | 95.0 | |
| 2700 | 92.8 | 89.8 | 94.0 | |
| 3200 | 92.3 | 87.6 | 93.2 | |
| 500 | 378 | 96.8 | 95.3 | 96.4 |
| 500 | 96.7 | 95.0 | 95.6 | |
| 1000 | 94.9 | 95.7 | 95.0 | |
| 1800 | 94.9 | 93.9 | 95.8 | |
| 2700 | 92.1 | 91.4 | 93.3 | |
| 3200 | 91.8 | 89.4 | 91.6 | |