| Literature DB >> 34996366 |
James H McVittie1, David B Wolfson2, Vittorio Addona3, Zhaoheng Li3.
Abstract
When modelling the survival distribution of a disease for which the symptomatic progression of the associated condition is insidious, it is not always clear how to measure the failure/censoring times from some true date of disease onset. In a prevalent cohort study with follow-up, one approach for removing any potential influence from the uncertainty in the measurement of the true onset dates is through the utilization of only the residual lifetimes. As the residual lifetimes are measured from a well-defined screening date (prevalence day) to failure/censoring, these observed time durations are essentially error free. Using residual lifetime data, the nonparametric maximum likelihood estimator (NPMLE) may be used to estimate the underlying survival function. However, the resulting estimator can yield exceptionally wide confidence intervals. Alternatively, while parametric maximum likelihood estimation can yield narrower confidence intervals, it may not be robust to model misspecification. Using only right-censored residual lifetime data, we propose a stacking procedure to overcome the non-robustness of model misspecification; our proposed estimator comprises a linear combination of individual nonparametric/parametric survival function estimators, with optimal stacking weights obtained by minimizing a Brier Score loss function.Entities:
Keywords: Nonparametric estimation; Residual lifetime data; Stacking; Survival analysis
Mesh:
Year: 2022 PMID: 34996366 PMCID: PMC8742399 DOI: 10.1186/s12874-021-01496-3
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Corrected non-parametric maximum likelihood survival function estimates with 95% bootstrapped pointwise confidence limits (black) for varying dementia subgroups in the Canadian Study of Health and Aging along with stacked survival function estimates with 95% bootstrapped pointwise confidence limits (red)
Fig. 2A depiction of a sample of right-censored residual lifetime data. The open circles represent the calendar dates of censoring and the crosses represent the calendar dates of failure. Brackets represent the earliest confirmed time for the start of the failure/censoring time durations. Dashed lines represent uncertainty in the measurement of the underlying onset dates (represented by open squares)
A summary of the simulation studies examining the performance of the proposed stacked survival model estimation procedure
| Simulation Number | Simulation Study Description |
|---|---|
| Simulation 1 | Weibull distributed failure times with various amounts of administrative censoring (10%, 20%, 30%, 40%) acting on the residual failure time data. |
| Two stacked models fitted: All submodels, All submodels except Weibull | |
| Simulation 2 | Mixture model distributed failure times with 30% random censoring. |
| One stacked model fitted: All submodels |
Average discrete integrated squared survival errors for individual and stacked models for a Weibull (2,2) failure time distribution with varying amounts of administrative censoring for samples of size 125 over 100 simulation runs
| Proportion of Administrative Censoring | ||||
|---|---|---|---|---|
| Model | 10% | 20% | 30% | 40% |
| NPMLE | 0.09594 | 0.1639 | 0.2414 | 0.3645 |
| Weibull | 0.02776 | 0.03843 | 0.06635 | 0.09601 |
| Log-Logistic | 0.03244 | 0.03350 | 0.05696 | 0.08661 |
| Log-Normal | 0.03009 | 0.03660 | 0.06222 | 0.07615 |
| Gamma | 0.03056 | 0.04181 | 0.06025 | 0.08460 |
| Stacked Model (all) | 0.02877 | 0.04203 | 0.06680 | 0.09865 |
| Stacked Model (w/o Weibull) | 0.03049 | 0.04303 | 0.06546 | 0.09010 |
Mean weights for a stacked model including all submodels or including all submodels except Weibull. The failure time data were generated according to a Weibull (2, 2) distribution with varying amounts of administrative censoring for samples of size 125 over 100 simulation runs
| Individual submodel type of stacked estimator | |||||
|---|---|---|---|---|---|
| Administrative Censoring Proportion | NPMLE | Weibull | Log-Logistic | Log-Normal | Gamma |
| 10% | 0.01394 | 0.08208 | 0.02781 | 0.04167 | 0.09579 |
| 0.01607 | N/A | 0.05645 | 0.08028 | 0.8472 | |
| 20% | 5.586×10−9 | 0.9061 | 0.01439 | 0.04962 | 0.02992 |
| 8.419×10−9 | N/A | 0.04088 | 0.08901 | 0.8701 | |
| 30% | 3.900×10−9 | 0.8503 | 0.03581 | 0.04999 | 0.06389 |
| 5.520×10−9 | N/A | 0.07797 | 0.08018 | 0.8418 | |
| 40% | 2.169×10−9 | 0.7888 | 0.02656 | 0.01465 | 0.1700 |
| 3.463×10−9 | N/A | 0.1319 | 0.04835 | 0.8197 | |
Fig. 3Graphical comparison of the NPMLE estimate (blue lines), stacked model estimate (without the Weibull submodel included) (red lines) relative to the underlying Weibull failure time survival function (black line) using samples of size 125 over 100 simulation runs with varying amounts of administrative censoring
Fig. 4Graphical comparison of the individual model mean survival estimates (solid red line) with bootstrapped 95% pointwise confidence intervals (dotted red lines) relative to the underlying mixture failure time survival function (solid black line) using samples of size 125 over 100 simulation runs with 30% random censoring (Panel a - NPMLE, Panel b - Stacked Estimator)
Average discrete integrated squared survival errors (DISSE) for individual and stacked models for a mixture failure time distribution with 30% random censoring for samples of size 125 over 100 simulation runs
| Model | Average DISSE |
|---|---|
| NPMLE | 1.478 |
| Weibull | 2.755 |
| Log-Logistic | 5.612 |
| Log-Normal | 4.655 |
| Gamma | 3.075 |
| Stacked Model (all) | 2.273 |
Weights of stacked survival models applied to the three dementia type strata of the Canadian Study of Health and Aging
| Individual submodel type of stacked estimator | ||||
|---|---|---|---|---|
| CSHA Strata | NPMLE | Weibull | Log-Normal | Gamma |
| Probable Alzheimer’s Disease | 1.282×10−8 | 0.9928 | 2.523×10−7 | 0.007240 |
| Possible Alzheimer’s Disease | 1.000×10−8 | 6.556×10−7 | 1.353×10−7 | 0.9999 |
| Vascular Dementia | 1.126×10−8 | 0.7179 | 2.022×10−7 | 0.2821 |