| Literature DB >> 25060703 |
Dan Jackson1, Ian R White, Shaun Seaman, Hannah Evans, Kathy Baisley, James Carpenter.
Abstract
The Cox proportional hazards model is frequently used in medical statistics. The standard methods for fitting this model rely on the assumption of independent censoring. Although this is sometimes plausible, we often wish to explore how robust our inferences are as this untestable assumption is relaxed. We describe how this can be carried out in a way that makes the assumptions accessible to all those involved in a research project. Estimation proceeds via multiple imputation, where censored failure times are imputed under user-specified departures from independent censoring. A novel aspect of our method is the use of bootstrapping to generate proper imputations from the Cox model. We illustrate our approach using data from an HIV-prevention trial and discuss how it can be readily adapted and applied in other settings.Entities:
Keywords: Schoenfeld residuals; bootstrapping; informative censoring; multiple imputation; sensitivity analysis; survival analysis
Mesh:
Substances:
Year: 2014 PMID: 25060703 PMCID: PMC4282781 DOI: 10.1002/sim.6274
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Summary statistics for the HIV infection data; there are 821 women in the sample.
| Variable | Summary |
|---|---|
| Lost to follow-up (censored) | 142 (17%) |
| Pregnant (censored) | 165 (20%) |
| Completed follow-up without HIV infection | 459 (56%) |
| or pregnancy (censored) | |
| HIV infection (event) | 55 (7%) |
| Age, mean (SD) | 27.4 (5.1) |
| Drinks per week=0 | 411 (50%) |
| Drinks per week=1–9 | 272 (33%) |
| Drinks per week=10–29 | 110 (13%) |
| Drinks per week=30+ | 28 (3%) |
| Lived at site <2years | 129 (16%) |
Parameter estimates (log hazard ratios) from a proportional hazards model fit to the HIV infection data assuming independent censoring.
| Parameter | Baseline covariate | Estimate | Standard error |
|---|---|---|---|
| Age | −0.084 | 0.030 | |
| Drinks per week=1–9 | 0.684 | 0.342 | |
| Drinks per week=10–29 | 1.261 | 0.362 | |
| Drinks per week=30+ | 1.118 | 0.568 | |
| Lived at site <2years | 0.687 | 0.301 |
Figure 1Left: Schoenfeld residuals for the five parameters in Table 2, from fitting model (2) to the incidence of HIV data. Right: Schoenfeld residuals for the five parameters in Table 2, from fitting model (9) to the first imputed dataset using γ = 5. LOWESS smoothers are also shown, together with ± 2 standard deviation confidence bands.
Figure 2Results from the sensitivity analysis for the HIV incidence data. The curves show the point estimates and 95% confidence intervals for the log hazard ratios associated with the covariates indicated. Inferences from the standard analysis assuming independent censoring from Table 2 are shown at γ = 0. Inferences are also shown from imputing all censored observations as ‘never infected’, and all censored observations as ‘immediate failures’, at the left-hand and right-hand sides of the plots, respectively.
Figure 3Results from the sensitivity analysis for the HIV incidence data assuming independent censoring unless censored due to pregnancy. The curves show the point estimates and 95% confidence intervals for the log hazard ratios associated with the covariates indicated. Inferences from the standard analysis assuming independent censoring from Table 2 are shown at γ = 0. Inferences are also shown from imputing all censored (due to pregnancy) observations as ‘never infected’, and all censored (due to pregnancy) observations as ‘immediate failures’, at the left-hand and right-hand sides of the plots, respectively.
Results from the simulation study.
| First parameter: | First parameter: | Second parameter: | Second parameter: | |||||
|---|---|---|---|---|---|---|---|---|
| bias (MCSE) | coverage | bias (MCSE) | coverage | |||||
| MI | IC | MI | IC | MI | IC | MI | IC | |
| −2 | 0.007 (0.012) | 0.012 (0.012) | 0.960 | 0.958 | 0.011 (0.011) | 0.020 (0.011) | 0.948 | 0.952 |
| −1 | 0.003 (0.012) | 0.007 (0.012) | 0.947 | 0.948 | 0.006 (0.011) | 0.011 (0.011) | 0.955 | 0.950 |
| 0 | 0.018 (0.009) | 0.019 (0.009) | 0.953 | 0.959 | −0.001 (0.008) | 0.000 (0.008) | 0.960 | 0.961 |
| 1 | 0.017 (0.011) | 0.025 (0.010) | 0.937 | 0.950 | 0.015 (0.010) | 0.032 (0.010) | 0.941 | 0.939 |
| 2 | −0.013 (0.009) | 0.036 (0.010) | 0.930 | 0.953 | −0.014 (0.009) | 0.120 (0.009) | 0.931 | 0.936 |
| 3 | −0.008 (0.008) | 0.152 (0.009) | 0.920 | 0.915 | −0.003 (0.007) | 0.397 (0.009) | 0.926 | 0.655 |
| 4 | −0.006 (0.005) | 0.301 (0.009) | 0.928 | 0.800 | 0.008 (0.006) | 0.704 (0.009) | 0.930 | 0.222 |
| 5 | 0.004 (0.004) | 0.409 (0.009) | 0.941 | 0.665 | 0.012 (0.005) | 0.861 (0.009) | 0.935 | 0.096 |
‘MI’ indicates that the proposed multiple imputation procedure (using the correct value of γ) is used, and ‘IC’ indicates that a standard Cox proportional hazards model assuming independent censoring is used. ‘First parameter’ indicates that the results are for the log hazard ratio of Z = 1 relative to Z = 0; ‘second parameter’ indicates that the results are for the log hazard ratio of Z = 2 relative to Z = 0. Monte Carlo standard errors (MCSE) of the estimated biases are provided in parentheses, and ‘coverage’ denotes the estimated coverage probabilities of 95% confidence intervals. A total of 1000 simulated datasets were produced for each value of γ, and ‘MI’ and ‘IC’ were applied to the same sets of simulated datasets.