| Literature DB >> 28543626 |
Anh Nguyen Duc1, Marcel Wolbers1,2.
Abstract
This paper presents a novel approach to estimation of the cumulative incidence function in the presence of competing risks. The underlying statistical model is specified via a mixture factorization of the joint distribution of the event type and the time to the event. The time to event distributions conditional on the event type are modeled using smooth semi-nonparametric densities. One strength of this approach is that it can handle arbitrary censoring and truncation while relying on mild parametric assumptions. A stepwise forward algorithm for model estimation and adaptive selection of smooth semi-nonparametric polynomial degrees is presented, implemented in the statistical software R, evaluated in a sequence of simulation studies, and applied to data from a clinical trial in cryptococcal meningitis. The simulations demonstrate that the proposed method frequently outperforms both parametric and nonparametric alternatives. They also support the use of 'ad hoc' asymptotic inference to derive confidence intervals. An extension to regression modeling is also presented, and its potential and challenges are discussed.Entities:
Keywords: competing risks; cumulative incidence function; interval censoring; mixture factorization; smooth semi-nonparametric (SNP) estimation
Mesh:
Substances:
Year: 2017 PMID: 28543626 PMCID: PMC5518232 DOI: 10.1002/sim.7331
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Specification of the distribution of uncensored competing risks data for the scenarios investigated in the simulation study.
| Mixture representation based scenario | |||
|---|---|---|---|
| Scenario |
|
|
|
|
| |||
|
|
| 25 | |
|
| |||
|
|
| 50 | |
| Latent failure times‐based scenario | |||
| Scenario |
|
|
|
|
| |||
|
|
| 66 | |
Note: T|D=j is the time to event distribution for event type j conditional on the occurrence of that event type. P 1 is the marginal probability of event type 1. T is the independent latent failure time distribution associated with event type j used in the latent failure times‐based scenario. W(σ −1,e ) refers to a Weibull distribution with shape σ −1 and scale e and to a log‐normal distribution with mean μ and variance σ 2 for the log‐transformed data. S N(μ,σ,ϕ) is a random variable T satisfying , where Z has a smooth semi‐nonparametric distribution with a standard normal base density and polynomial degree K=1 with a corresponding spherical coordinate ϕ.
Frequency with which the proposed smooth semi‐nonparametric estimators based on AIC, BICn, or HQCn, respectively, chose the correct base density and correct polynomial degrees for mixture representation based scenarios (first four rows).
| AIC | BIC | HQC | |||||
|---|---|---|---|---|---|---|---|
| Scenario |
| 100 | 500 | 100 | 500 | 100 | 500 |
| Frequency of correct base density and polynomial degrees | |||||||
|
| 343 | 650 | 649 | 977 | 488 | 888 | |
|
| 162 | 496 | 447 | 769 | 286 | 657 | |
|
| 471 | 657 | 677 | 970 | 597 | 875 | |
|
| 305 | 585 | 528 | 915 | 436 | 787 | |
| Frequency that the selected model chose | |||||||
|
| 250 | 243 | 25 | 50 | 102 | 136 | |
|
| 12 | 45 | 1 | 0 | 2 | 5 | |
AIC, AkaikeŠ's information criterion; BICn, Bayesian information criterion; HQCn, Hannan–Quinn's criterion; LFT, latent failure times.
For the other scenarios, the frequency with which the maximal allowed polynomial degree was chosen (i.e., K 1=3 or K 2=3) is reported (rows 5 and 6). All results are for scenarios with interval censoring.
All frequencies are based on 1000 simulated data sets per scenario.
IC‐1 refers to interval censoring with 10 sub‐intervals and right‐censoring at the 90% quantile.
IC‐2 refers to interval censoring with seven sub‐intervals and right‐censoring at the 50% quantile.
IMSE for different estimation methods for all scenarios with interval censoring.
|
|
| |||
|---|---|---|---|---|
| Scenario |
|
|
|
|
|
| ||||
| LN/SB‐HQC |
|
|
|
|
| WB/SB‐HQC |
|
|
|
|
| NP/SB‐HQC |
|
|
|
|
| SB‐AIC ×104 | 8.71 | 3.72 | 1.53 | 0.62 |
| SB‐BIC | 8.33 | 3.16 | 1.47 | 0.56 |
| SB‐HQC | 8.54 | 3.43 | 1.51 | 0.58 |
|
| ||||
| LN/SB‐HQC |
|
|
|
|
| WB/SB‐HQC |
|
|
|
|
| NP/SB‐HQC |
|
|
|
|
| SB‐AIC ×104 | 9.05 | 4.78 | 1.84 | 1.03 |
| SB‐BIC | 8.77 | 4.31 | 1.79 | 0.81 |
| SB‐HQC | 8.89 | 4.54 | 1.81 | 0.90 |
|
| ||||
| LN/SB‐HQC |
|
|
|
|
| WB/SB‐HQC |
|
|
|
|
| NP/SB‐HQC |
|
|
|
|
| SB‐AIC ×104 | 17.61 | 11.18 | 3.12 | 2.17 |
| SB‐BIC | 16.73 | 11.57 | 2.98 | 2.10 |
| SB‐HQC | 17.19 | 11.31 | 3.04 | 2.13 |
|
| ||||
| LN/SB‐HQC |
|
|
|
|
| WB/SB‐HQC |
|
|
|
|
| NP/SB‐HQC |
|
|
|
|
| SB‐AIC ×104 | 21.25 | 12.52 | 4.10 | 2.58 |
| SB‐BIC | 20.16 | 12.69 | 3.95 | 2.49 |
| SB‐HQC | 20.89 | 12.52 | 4.01 | 2.53 |
|
| ||||
| LN/SB‐HQC |
|
|
|
|
| WB/SB‐HQC |
|
|
|
|
| NP/SB‐HQC |
|
|
|
|
| SB‐AIC ×104 | 17.06 | 14.88 | 3.62 | 2.75 |
| SB‐BIC | 16.86 | 14.09 | 3.77 | 2.69 |
| SB‐HQC | 16.94 | 14.46 | 3.64 | 2.67 |
|
| ||||
| LN/SB‐ |
|
|
|
|
| WB/SB‐HQC |
|
|
|
|
| NP/SB‐HQC |
|
|
|
|
| SB‐AIC ×104 | 20.23 | 15.67 | 3.74 | 2.77 |
| SB‐BIC | 16.86 | 14.09 | 3.77 | 2.69 |
| SB‐HQC | 16.94 | 14.46 | 3.64 | 2.67 |
Note: LN/SB‐HQCn, WB/SB‐HQCn, and NP/SB‐HQCn are the ratios (in bold) of the integrated mean squared error (IMSE) of the parametric log‐normal, Weibull, and the nonparametric models, respectively, versus the SB‐HQCn model with corresponding bootstrap standard errors. For each scenario, the last three rows give IMSE values of smooth semi‐nonparametric‐based models for all information criteria. IC‐1 (IC‐2) refers to interval censoring with 10 (7) sub‐intervals and right‐censoring at the 90% (50%) quantile.
Figure 1Simulation results for SB‐HQCn models for all scenarios with interval censoring and a sample size of n=500. Note: Bold lines show the true C I F 1 for event type 1, and 1−C I F 2 for event type 2 from time 0 to t . Bold dashed lines show the corresponding point‐wise averaged fitted cumulative incidence functions across 1000 simulation runs. Light dashed lines show curves, resulting from the two simulation runs leading to the minimal and maximum average residual from the true curve based on 300 equally spaced time point from 0 to t , that is, they display the worst observed under‐estimation and over‐estimation. From left to right are the scenarios: 2 weibull, 2 SNP stdnorm, logmixturenormal + Weibull (LFT). Top and bottom rows correspond to ‘IC‐1’ and ‘IC‐2’ scenarios, respectively.
Observed coverage probabilities of nominal 95% CI for the cumulative incidence functions at times 0.5t and t for interval‐censored scenarios.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
| Time point | 0.5 |
| 0.5 |
| ||||
| Scenario |
|
|
|
|
|
|
|
|
|
| ||||||||
| LN | 94.1 | 63.1 | 95.2 | 81.6 | 92.3 | 8.5 | 95.0 | 39.0 |
| WB | 94.6 | 95.9 | 94.5 | 94.3 | 95.5 | 96.1 | 94.7 | 94.6 |
| SB‐AIC | 94.6 | 82.6 | 93.6 | 92.4 | 94.8 | 90.8 | 95.1 | 93.4 |
| SB‐BIC | 94.0 | 86.3 | 94.2 | 92.9 | 95.5 | 95.1 | 94.7 | 94.5 |
| SB‐HQC | 94.4 | 83.9 | 93.8 | 93.1 | 95.2 | 93.2 | 94.8 | 94.2 |
|
| ||||||||
| LN | 95.3 | 95.5 | 95.8 | 94.9 | 94.6 | 79.9 | 95.2 | 94.1 |
| WB | 95.2 | 95.6 | 95.7 | 94.4 | 94.4 | 95.7 | 95.5 | 94.7 |
| SB‐AIC | 94.3 | 89.4 | 95.5 | 93.6 | 94.4 | 86.7 | 95.6 | 94.7 |
| SB‐BIC | 94.5 | 94.5 | 95.8 | 94.2 | 94.6 | 90.8 | 95.5 | 94.5 |
| SB‐HQC | 94.3 | 93.2 | 95.7 | 94.2 | 94.8 | 89.5 | 95.5 | 94.7 |
|
| ||||||||
| LN | 94.5 | 91.3 | 93.9 | 64.4 | 95.1 | 69.5 | 94.3 | 5.4 |
| WB | 95.0 | 85.2 | 94.0 | 80.5 | 92.3 | 37.0 | 95.1 | 42.0 |
| SB‐AIC | 94.0 | 96.1 | 93.0 | 94.3 | 95.3 | 94.0 | 95.3 | 95.8 |
| SB‐BIC | 94.5 | 94.9 | 93.6 | 93.5 | 95.8 | 94.0 | 95.7 | 95.6 |
| SB‐HQC | 94.1 | 95.8 | 93.4 | 94.1 | 95.7 | 93.9 | 95.6 | 95.6 |
|
| ||||||||
| LN | 94.9 | 94.7 | 94.3 | 95.6 | 92.3 | 82.1 | 94.3 | 94.1 |
| WB | 94.4 | 91.3 | 93.9 | 95.2 | 86.9 | 48.6 | 94.7 | 93.8 |
| SB‐AIC | 93.7 | 94.5 | 94.5 | 95.2 | 92.5 | 95.0 | 95.8 | 94.7 |
| SB‐BIC | 94.0 | 94.4 | 94.2 | 95.4 | 93.3 | 94.9 | 95.5 | 94.5 |
| SB‐HQC | 93.5 | 94.4 | 94.4 | 95.5 | 92.6 | 94.9 | 95.5 | 94.5 |
|
| ||||||||
| LN | 89.5 | 85.6 | 33.3 | 96.5 | 44.5 | 72.9 | 0.1 | 94.2 |
| WB | 84.4 | 93.6 | 56.0 | 95.6 | 22.6 | 93.1 | 3.8 | 91.9 |
| SB‐AIC | 92.4 | 94.1 | 94.8 | 94.5 | 86.8 | 93.9 | 94.2 | 93.7 |
| SB‐BICn | 92.3 | 93.5 | 94.1 | 95.0 | 83.0 | 94.4 | 92.9 | 93.1 |
| SB‐HQCn | 92.1 | 93.7 | 94.3 | 94.3 | 86.4 | 94.5 | 93.4 | 93.3 |
|
| ||||||||
| LN | 94.7 | 93.7 | 95.4 | 95.4 | 96.6 | 91.6 | 95.6 | 95.7 |
| WB | 96.9 | 96.6 | 95.4 | 94.5 | 93.0 | 95.9 | 95.6 | 94.8 |
| SB‐AIC | 93.3 | 94.3 | 95.3 | 94.1 | 93.0 | 93.7 | 95.6 | 95.0 |
| SB‐BIC | 94.2 | 94.5 | 95.3 | 94.5 | 95.1 | 92.5 | 95.6 | 95.2 |
| SB‐HQC | 93.7 | 94.4 | 95.3 | 94.2 | 94.2 | 92.8 | 95.6 | 95.1 |
Note: LN and WB refer to parametric log‐normal and Weibull models. SB‐AIC, SB‐BICn, and SB‐HQCn refer to smooth semi‐nonparametric‐based models using different information criteria. Estimated Monte Carlo standard error of observed coverage ≈0.69%. IC‐1 refers to interval censoring with 10 sub‐intervals and right‐censoring at the 90% quantile. IC‐2 refers to interval censoring with seven sub‐intervals and right‐censoring at the 50% quantile.
Figure 2Estimated cumulative incidence function for the time to fungal clearance (increasing lines) and one minus cumulative incidence function for prior death (decreasing lines) by treatment arm for the smooth semi‐nonparametric‐based estimator using HQCn and the nonparametric estimator. Black lines correspond to amphotericin B monotherapy, gray lines to amphotericin B plus flucytosine. Solid lines refer to smooth semi‐nonparametric estimates, dashed lines to nonparametric estimates. As the nonparametric maximum likelihood estimator for interval‐censored outcomes can only assign mass to a finite number of disjoint intervals, we chose a unique representation by distributing the assigned mass uniformly across the respective intervals.