| Literature DB >> 26467904 |
Sylvie Scolas1, Anouar El Ghouch1, Catherine Legrand1, Abderrahim Oulhaj2.
Abstract
In standard survival analysis, it is generally assumed that every individual will experience someday the event of interest. However, this is not always the case, as some individuals may not be susceptible to this event. Also, in medical studies, it is frequent that patients come to scheduled interviews and that the time to the event is only known to occur between two visits. That is, the data are interval-censored with a cure fraction. Variable selection in such a setting is of outstanding interest. Covariates impacting the survival are not necessarily the same as those impacting the probability to experience the event. The objective of this paper is to develop a parametric but flexible statistical model to analyze data that are interval-censored and include a fraction of cured individuals when the number of potential covariates may be large. We use the parametric mixture cure model with an accelerated failure time regression model for the survival, along with the extended generalized gamma for the error term. To overcome the issue of non-stable and non-continuous variable selection procedures, we extend the adaptive LASSO to our model. By means of simulation studies, we show good performance of our method and discuss the behavior of estimates with varying cure and censoring proportion. Lastly, our proposed method is illustrated with a real dataset studying the time until conversion to mild cognitive impairment, a possible precursor of Alzheimer's disease.Entities:
Keywords: accelerated failure time; adaptive LASSO; cure model; extended generalized gamma; interval-censoring
Mesh:
Year: 2015 PMID: 26467904 PMCID: PMC5057324 DOI: 10.1002/sim.6767
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Parameter values for three levels of cure proportion and right‐censoring.
| Scenario 1 | Scenario 2 | Scenario 3 | |
|---|---|---|---|
| Cure proportion | 20% | 30% | 40% |
| Right‐Censoring | 40% | 40% | 60% |
|
| 0 | 0,5 | 1 |
|
| 2 | 1 | 0,85 |
|
| −1 | −0,2 | −0,85 |
|
| 14 | 14 | 12 |
Results of 2000 simulations.
| Sample size : | ||||||
|---|---|---|---|---|---|---|
| (20% Cure, 40% RC) | (30% Cure, 40% RC) | (40% Cure, 60% RC) | ||||
| Bias | MSE | Bias | MSE | Bias | MSE | |
| EGG‐AFT mixture cure model | ||||||
|
| −0,023 | 0,107 | 0,010 | 0,098 | 0,214 | 0,811 |
|
| −0,001 | 0,001 | −0,000 | 0,001 | 0,003 | 0,002 |
|
| −0,000 | 0,001 | −0,000 | 0,002 | 0,004 | 0,003 |
|
| −0,001 | 0,002 | 0,003 | 0,002 | 0,004 | 0,005 |
|
| 0,000 | 0,001 | −0,001 | 0,002 | −0,001 | 0,003 |
|
| −0,049 | 0,016 | −0,053 | 0,022 | −0,135 | 0,123 |
|
| 0,014 | 0,025 | 0,007 | 0,026 | 0,013 | 0,045 |
|
| 0,320 | 5,286 | 0,051 | 0,105 | 0,063 | 0,158 |
|
| −0,209 | 1,668 | −0,020 | 0,170 | −0,047 | 0,226 |
|
| 0,066 | 7,085 | 0,002 | 0,246 | 0,020 | 0,284 |
|
| −0,073 | 1,357 | −0,016 | 0,171 | −0,014 | 0,169 |
| Likelihood ratio test | ||||||
| True value of |
|
|
| |||
| Cov. | 6,15% | 41,85% | 74,95% | |||
| Cov. | 93,25% | 39,30% | 9,35% | |||
| EGG‐AFT model without cure | ||||||
|
| −1,577 | 2,622 | −2,210 | 5,038 | −2,315 | 5,675 |
|
| −0,068 | 0,005 | −0,068 | 0,006 | −0,057 | 0,008 |
|
| 0,023 | 0,003 | −0,029 | 0,004 | 0,147 | 0,034 |
|
| −0,002 | 0,003 | 0,002 | 0,005 | 0,006 | 0,010 |
|
| −0,000 | 0,002 | −0,003 | 0,003 | −0,006 | 0,007 |
|
| 0,118 | 0,038 | 0,512 | 0,295 | 0,785 | 0,656 |
|
| 0,243 | 0,096 | 0,024 | 0,050 | 0,226 | 0,118 |
RC, right‐censored; MSE, mean squared error; EGG‐AFT, extended generalized gamma accelarated failure time.
Bias and MSE of the EGG‐AFT mixture cure model in the upper part of the Table; rejection percentage of the likelihood ratio test in the middle; bias and MSE of the EGG‐AFT model in the lower part.
Results of 2000 simulations.
| Sample size : | ||||||
|---|---|---|---|---|---|---|
| (20% Cure, 40% RC) | (30% Cure, 40% RC) | (40% Cure, 60% RC) | ||||
| Bias | MSE | Bias | MSE | Bias | MSE | |
| EGG‐AFT mixture cure model | ||||||
|
| −0,048 | 0,096 | −0,013 | 0,077 | 0,050 | 0,210 |
|
| −0,002 | 0,000 | −0,001 | 0,001 | 0,002 | 0,001 |
|
| −0,000 | 0,001 | −0,000 | 0,001 | 0,001 | 0,003 |
|
| 0,003 | 0,006 | 0,007 | 0,009 | 0,004 | 0,016 |
|
| −0,003 | 0,003 | −0,003 | 0,004 | −0,005 | 0,008 |
|
| −0,025 | 0,012 | −0,029 | 0,016 | −0,058 | 0,054 |
|
| 0,011 | 0,019 | 0,006 | 0,019 | 0,008 | 0,034 |
|
| 0,398 | 2,435 | 0,052 | 0,104 | 0,124 | 0,518 |
|
| −0,298 | 2,111 | −0,031 | 0,148 | −0,100 | 0,492 |
|
| 0,024 | 0,028 | 0,006 | 0,008 | 0,014 | 0,012 |
|
| −0,054 | 1,007 | −0,034 | 0,433 | −0,059 | 0,650 |
| Likelihood ratio test | ||||||
| True value of |
|
|
| |||
| Cov. | 6,50% | 48,40% | 79,70% | |||
| Cov. | 98,30% | 55,00% | 7,60% | |||
| EGG‐AFT model without cure | ||||||
|
| −1,514 | 2,375 | −2,125 | 4,604 | −2,195 | 5,010 |
|
| −0,066 | 0,005 | −0,063 | 0,005 | −0,044 | 0,005 |
|
| 0,028 | 0,002 | −0,024 | 0,003 | 0,161 | 0,035 |
|
| 0,001 | 0,002 | 0,004 | 0,003 | 0,007 | 0,006 |
|
| 0,000 | 0,001 | −0,003 | 0,002 | −0,005 | 0,004 |
|
| 0,142 | 0,035 | 0,549 | 0,321 | 0,822 | 0,702 |
|
| 0,241 | 0,083 | 0,020 | 0,033 | 0,220 | 0,092 |
RC, right‐censored; MSE, mean squared error; EGG‐AFT, extended generalized gamma accelarated failure time.
Bias and MSE of the EGG‐AFT mixture cure model in the upper part of the Table; rejection percentage of the likelihood ratio test in the middle; bias and MSE of the EGG‐AFT model in the lower part.
Results of 2000 simulations.
| Sample size : | ||||||
|---|---|---|---|---|---|---|
| (20% Cure, 40% RC) | (30% Cure, 40% RC) | (40% Cure, 60% RC) | ||||
| Bias | MSE | Bias | MSE | Bias | MSE | |
| EGG‐AFT mixture cure model | ||||||
|
| −0,025 | 0,050 | −0,006 | 0,036 | 0,045 | 0,089 |
|
| −0,001 | 0,000 | −0,001 | 0,000 | 0,002 | 0,001 |
|
| −0,001 | 0,001 | 0,001 | 0,001 | 0,001 | 0,001 |
|
| 0,001 | 0,004 | 0,004 | 0,005 | 0,005 | 0,008 |
|
| 0,000 | 0,002 | −0,002 | 0,002 | 0,000 | 0,004 |
|
| −0,016 | 0,007 | −0,016 | 0,009 | −0,040 | 0,026 |
|
| 0,003 | 0,011 | 0,007 | 0,011 | 0,002 | 0,017 |
|
| 0,122 | 0,397 | 0,022 | 0,050 | 0,030 | 0,085 |
|
| −0,081 | 0,385 | −0,002 | 0,073 | −0,020 | 0,105 |
|
| 0,009 | 0,007 | 0,004 | 0,004 | 0,006 | 0,004 |
|
| −0,025 | 0,399 | −0,007 | 0,237 | 0,016 | 0,268 |
| Likelihood ratio test | ||||||
| True value of |
|
|
| |||
|
| 5,75% | 70,95% | 95,95% | |||
|
| 100% | 78,10% | 6,95% | |||
| EGG‐AFT model without cure | ||||||
|
| −1,474 | 2,218 | −1,974 | 3,943 | −2,474 | 6,167 |
|
| −0,063 | 0,004 | −0,063 | 0,004 | −0,063 | 0,004 |
|
| 0,028 | 0,002 | 0,028 | 0,002 | 0,028 | 0,002 |
|
| 0,000 | 0,001 | 0,000 | 0,001 | 0,000 | 0,001 |
|
| 0,000 | 0,001 | 0,000 | 0,001 | 0,000 | 0,001 |
|
| 0,170 | 0,038 | 0,170 | 0,038 | 0,170 | 0,038 |
|
| 0,222 | 0,064 | 0,222 | 0,064 | 0,222 | 0,064 |
RC, right‐censored; MSE, mean squared error; EGG‐AFT, extended generalized gamma accelarated failure time.
Bias and MSE of the EGG‐AFT mixture cure model in the upper part of the Table; rejection percentage of the likelihood ratio test in the middle; bias and MSE of the EGG‐AFT model in the lower part.
Results of 2000 simulations, with adaptive LASSO.
| Sample size : | ||||||
|---|---|---|---|---|---|---|
| (20% Cure, 40% RC) | (30% Cure, 40% RC) | (40% Cure, 60% RC) | ||||
| Parameter | Bias | MSE | Bias | MSE | Bias | MSE |
|
| 0,007 | 0,207 | 0,109 | 0,271 | 0,007 | 0,504 |
|
| 0,000 | 0,001 | 0,002 | 0,001 | −0,009 | 0,003 |
|
| 0,016 | 0,002 | 0,020 | 0,003 | 0,031 | 0,008 |
|
| −0,011 | 0,002 | −0,012 | 0,003 | −0,010 | 0,006 |
|
| 0,009 | 0,001 | 0,009 | 0,002 | 0,003 | 0,005 |
|
| −0,139 | 0,041 | −0,184 | 0,079 | −0,217 | 0,156 |
|
| 0,071 | 0,047 | 0,086 | 0,056 | 0,129 | 0,127 |
|
| 0,874 | 5,767 | 0,176 | 0,216 | 0,312 | 0,498 |
|
| −0,562 | 5,272 | 0,002 | 0,153 | 0,009 | 0,525 |
|
| −0,138 | 0,415 | −0,182 | 0,059 | −0,185 | 0,125 |
|
| 0,274 | 0,472 | 0,332 | 0,173 | 0,325 | 0,189 |
| Average correct/incorrect number of zero's | ||||||
| Latency | ||||||
| Correct | 9,202 | 9,283 | 8,685 | |||
| Incorrect | 0,006 | 0,010 | 0,086 | |||
| Incidence | ||||||
| Correct | 9,176 | 9,805 | 9,707 | |||
| Incorrect | 1,966 | 2,633 | 2,138 | |||
RC, right‐censored; MSE, mean squared error.
Results of 2000 simulations, with adaptive LASSO.
| Sample size : | ||||||
|---|---|---|---|---|---|---|
| (20% Cure, 40% RC) | (30% Cure, 40% RC) | (40% Cure, 60% RC) | ||||
| Parametere | Biais | MSE | Biais | MSE | Biais | MSE |
|
| −0,031 | 0,099 | 0,080 | 0,109 | 0,178 | 0,276 |
|
| −0,002 | 0,001 | 0,003 | 0,001 | 0,001 | 0,001 |
|
| 0,008 | 0,001 | 0,013 | 0,002 | 0,023 | 0,004 |
|
| −0,007 | 0,001 | −0,007 | 0,002 | −0,010 | 0,003 |
|
| 0,006 | 0,001 | 0,004 | 0,001 | 0,004 | 0,002 |
|
| −0,077 | 0,016 | −0,112 | 0,031 | −0,190 | 0,096 |
|
| 0,034 | 0,020 | 0,046 | 0,025 | 0,076 | 0,049 |
|
| 0,310 | 0,379 | 0,104 | 0,100 | 0,140 | 0,169 |
|
| −0,200 | 0,486 | 0,047 | 0,087 | 0,051 | 0,255 |
|
| −0,175 | 0,073 | −0,183 | 0,049 | −0,180 | 0,050 |
|
| 0,304 | 0,180 | 0,324 | 0,157 | 0,334 | 0,158 |
| Average correct/incorrect number of zero's | ||||||
| Latency | ||||||
| Correct | 9,646 | 9,640 | 9,306 | |||
| Incorrect | 0,000 | 0,001 | 0,019 | |||
| Incidence | ||||||
| Correct | 9,624 | 9,858 | 9,859 | |||
| Incorrect | 1,921 | 2,601 | 2,034 | |||
RC, right‐censored; MSE, mean squared error.
Results of 2000 simulations, with adaptive LASSO.
| Sample size : | ||||||
|---|---|---|---|---|---|---|
| (20% Cure, 40% RC) | (30% Cure, 40% RC) | (40% Cure, 60% RC) | ||||
| Parameter | Bias | MSE | Bias | MSE | Bias | MSE |
|
| −0,017 | 0,045 | 0,029 | 0,042 | 0,178 | 0,152 |
|
| −0,001 | 0,000 | 0,001 | 0,000 | 0,006 | 0,001 |
|
| 0,004 | 0,001 | 0,007 | 0,001 | 0,016 | 0,002 |
|
| −0,005 | 0,001 | −0,004 | 0,001 | −0,004 | 0,002 |
|
| 0,004 | 0,000 | 0,003 | 0,001 | 0,002 | 0,001 |
|
| −0,043 | 0,007 | −0,056 | 0,011 | −0,133 | 0,048 |
|
| 0,019 | 0,011 | 0,025 | 0,012 | 0,038 | 0,020 |
|
| 0,152 | 0,148 | 0,051 | 0,045 | 0,059 | 0,067 |
|
| −0,091 | 0,201 | 0,086 | 0,053 | 0,055 | 0,117 |
|
| −0,182 | 0,053 | −0,178 | 0,047 | −0,189 | 0,043 |
|
| 0,306 | 0,152 | 0,309 | 0,140 | 0,315 | 0,142 |
| Average correct/incorrect number of zero's | ||||||
| Latency | ||||||
| Correct | 9,859 | 9,834 | 9,705 | |||
| Incorrect | 0,000 | 0,000 | 0,002 | |||
| Incidence | ||||||
| Correct | 9,845 | 9,896 | 9,909 | |||
| Incorrect | 1,862 | 2,540 | 1,872 | |||
RC, right‐censored; MSE, mean squared error.
Results of 2000 simulations, with adaptive LASSO, for n = 500 and 25 covariates with zero coefficients.
| (30% Cure, 40% RC) | ||
|---|---|---|
| Parameter | Bias | MSE |
|
| 0,114 | 0,082 |
|
| 0,004 | 0,000 |
|
| 0,012 | 0,001 |
|
| −0,008 | 0,001 |
|
| 0,006 | 0,001 |
|
| −0,132 | 0,030 |
|
| 0,066 | 0,018 |
|
| 0,125 | 0,073 |
|
| 0,045 | 0,070 |
|
| −0,187 | 0,043 |
|
| 0,337 | 0,146 |
| Average correct/incorrect | ||
| number of zero's Latency | ||
| Correct | 24,540 | |
| Incorrect | 0,000 | |
| Incidence | ||
| Correct | 24,813 | |
| Incorrect | 2,539 | |
RC, right‐censored; MSE, mean squared error.
Result of 2000 simulations, with adaptive LASSO for n = 500.
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
| Parameter | Bias | MSE | Bias | MSE | Bias | MSE | Bias | MSE |
|
| 0,036 | 0,036 | 0,033 | 0,033 | 0,052 | 0,057 | 0,042 | 0,050 |
|
| 0,001 | 0,000 | 0,001 | 0,000 | 0,002 | 0,001 | 0,001 | 0,000 |
|
| 0,008 | 0,001 | 0,007 | 0,001 | 0,010 | 0,001 | 0,009 | 0,001 |
|
| −0,002 | 0,000 | −0,003 | 0,000 | −0,000 | 0,000 | −0,001 | 0,000 |
|
| 0,002 | 0,000 | 0,003 | 0,000 | 0,001 | 0,000 | 0,001 | 0,000 |
|
| −0,060 | 0,011 | −0,054 | 0,010 | −0,081 | 0,017 | −0,071 | 0,014 |
|
| 0,028 | 0,011 | 0,027 | 0,011 | 0,038 | 0,015 | 0,034 | 0,014 |
|
| 0,050 | 0,041 | 0,044 | 0,034 | 0,062 | 0,047 | 0,056 | 0,043 |
|
| 0,086 | 0,051 | 0,103 | 0,048 | 0,082 | 0,055 | 0,087 | 0,053 |
|
| −0,177 | 0,038 | −0,185 | 0,041 | −0,080 | 0,043 | −0,130 | 0,034 |
|
| 0,237 | 0,090 | 0,284 | 0,110 | 0,095 | 0,073 | 0,116 | 0,052 |
| Average correct/incorrect number of zero's | ||||||||
| Latency | ||||||||
| Correct | 9,843 | 9,852 | 9,817 | 9,839 | ||||
| Incorrect | 0,000 | 0,000 | 0,000 | 0,000 | ||||
| Incidence | ||||||||
| Correct | 9,866 | 9,838 | 9,729 | 9,818 | ||||
| Incorrect | 2,155 | 2,345 | 1,745 | 1,736 | ||||
MSE, mean squared error.
Each column represents a different value of ρ in the correlation matrix V. The case where ρ = 0 is given in the second column of Table 7.
Figure 1Turnbull survival curve, taking interval‐censoring into account.
MCI results: adaptive LASSO estimates, standard errors, and exponentiated estimates.
| Parameter | aLASSO | SD | Exp(Estimate) | |
|---|---|---|---|---|
| Latency | Intercept (Lat.) | 2,628 | 0,155 | |
| MMSE | — | — | ||
| Expression | 0,321 | 0,077 | 1,38 | |
| Remote | — | — | ||
| Learning | — | — | ||
| Attention | −0,057 | 0,043 | 0,94 | |
| Praxis | — | — | ||
| Abstract thinking | 0,086 | 0,034 | 1,09 | |
| Perception | 0,182 | 0,052 | 1,20 | |
| ApoE4 | −0,092 | 0,025 | 0,91 | |
| Gender | −0,061 | 0,022 | 0,94 | |
| Age (5y.) | −0,321 | 0,144 | 0,73 | |
| Total education | 0,152 | 0,163 | 1,16 | |
| Incidence | Intercept (Inc.) | 2,657 | 0,985 | |
| MMSE | −2,250 | 0,802 | 0,11 | |
| Expression | — | — | ||
| Remote | — | — | ||
| Learning | −0,969 | 0,425 | 0,38 | |
| Attention | — | — | ||
| Praxis | −1,302 | 0,564 | 0,27 | |
| Abstract thinking | 0,483 | 0,456 | 1,62 | |
| Perception | — | — | ||
| ApoE | −0,556 | 0,264 | 0,57 | |
| Gender | — | — | ||
| Age (5y.) | — | — | ||
| Total education | 2,285 | 2,081 | 9,83 |
MCI, mild cognitive impairment; MMSE, Mini‐Mental State Examination; ApoE E4, Apolipoprotein E4.
Last column gives the increase in time‐to‐the‐event (for the latency) and odds ratio (for incidence).