| Literature DB >> 33019901 |
Camille Maringe1, Aurélien Belot1, Bernard Rachet1.
Abstract
Despite a large choice of models, functional forms and types of effects, the selection of excess hazard models for prediction of population cancer survival is not widespread in the literature. We propose multi-model inference based on excess hazard model(s) selected using Akaike information criteria or Bayesian information criteria for prediction and projection of cancer survival. We evaluate the properties of this approach using empirical data of patients diagnosed with breast, colon or lung cancer in 1990-2011. We artificially censor the data on 31 December 2010 and predict five-year survival for the 2010 and 2011 cohorts. We compare these predictions to the observed five-year cohort estimates of cancer survival and contrast them to predictions from an a priori selected simple model, and from the period approach. We illustrate the approach by replicating it for cohorts of patients for which stage at diagnosis and other important prognosis factors are available. We find that model-averaged predictions and projections of survival have close to minimal differences with the Pohar-Perme estimation of survival in many instances, particularly in subgroups of the population. Advantages of information-criterion based model selection include (i) transparent model-building strategy, (ii) accounting for model selection uncertainty, (iii) no a priori assumption for effects, and (iv) projections for patients outside of the sample.Entities:
Keywords: Akaike information criteria; Bayesian information criteria; Cancer survival; multi-model inference; prediction; projection
Mesh:
Year: 2020 PMID: 33019901 PMCID: PMC7543029 DOI: 10.1177/0962280220934501
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.Structure of the data as used in the low-resolution data setting.
Models selected following model selection algorithms, by cancer and cohort of patients used in model selection.
| RMISD at 5 years | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
| Age (A) | Year of diagnosis (Y) | Deprivation (D) | Interactions | XIC[ | XIC[ | XIC[ | 2010b | 2011[ | |
| Breast cancer | ||||||||||
|
| ||||||||||
| AIC | 2005–2010 | NL TD | NL TD | TD | A*D (TD), Y*D | 205,554.5 | 18 | 0.030 | 0.028 | |
| 2000–2010 | NL TD | NL TD | TD | A*D (TD), Y*D | 589,478.3 | 61 | 0.032 | 0.032 | ||
| 1995–2010 | NL TD | NL TD | TD | A*D (TD), Y*D (TD) | 1,072,626.0 | 69 | 0.032 | 0.033 | ||
| 1995–2010 | NL TD | NL TD | TD | A*D (TD), Y*D | 1,072,628.0 | 31 | 104 | |||
| 1990–2010 | NL TD | NL TD | TD | A*D (TD), Y*D (TD) | 1,605,374.0 | 12 | 0.039 | 0.042 | ||
| BIC | 2005–2010 | NL TD | NL TD | PH | 205,742.0 | 18.3 | 0.026 | 0.026 | ||
| 2000–2010 | NL TD | NL TD | TD | Y*D | 589,840.6 | 10.4 | 0.033 | 0.032 | ||
| 1995–2010 | NL TD | NL TD | TD | Y*D | 1,073,076.0 | 406.0 | 0.033 | 0.033 | ||
| 1990–2010 | NL TD | NL TD | TD | Y*D (TD) | 1,605,849.0 | 53.4 | 0.039 | 0.042 | ||
|
| ||||||||||
| AIC | 2005–2010 | NL TD | NL TD | TD | A*D | 205,576.9 | 0.024 | 0.027 | ||
| 2000–2010 | NL TD | NL TD | TD | A*D | 589,540.4 | 0.031 | 0.034 | |||
| 1995–2010 | NL TD | NL TD | TD | A*D | 1,072,766.0 | 0.030 | 0.035 | |||
| 1990–2010 | NL TD | NL TD | TD | A*D | 1,605,607.0 | 0.039 | 0.045 | |||
| BIC | 2005–2010 | NL TD | NL TD | PH | 205,742.0 | 18.3 | 0.026 | 0.026 | ||
| 2000–2010 | NL TD | NL TD | TD | 589,766.8 | 8.6 | 0.031 | 0.033 | |||
| 1995–2010 | NL TD | NL TD | TD | 1,073,035.0 | 13.8 | 0.030 | 0.034 | |||
| 1990–2010 | NL TD | NL TD | TD | A*D | 1,605,901.0 | 9.5 | 0.039 | 0.045 | ||
|
| ||||||||||
| 2005–2010 | L PH | L PH | PH | 208,763.6 | 0.069 | 0.075 | ||||
| 2000–2010 | L PH | L PH | PH | 598,469.7 | 0.079 | 0.087 | ||||
| 1995–2010 | L PH | L PH | PH | 1,087,460.8 | 0.088 | 0.098 | ||||
| 1990–2010 | L PH | L PH | PH | 1,625,430.8 | 0.089 | 0.098 | ||||
|
| ||||||||||
| Colon cancer | ||||||||||
|
| ||||||||||
| AIC | 2005–2010 | NL TD | L TD | TD | 97,225.4 | 65 | 0.094 | 0.117 | ||
| 2005–2010 | NL TD | NLTD | TD | 97,226.7 | 35 | 4 | ||||
| 2000–2010 | NL TD | NL TD | TD | A*D (TD), Y*D | 215,214.2 | 39 | 0.097 | 0.110 | ||
| 1995–2010 | NL TD | NL TD | TD | Y*D | 338,571.2 | 192 | 0.093 | 0.108 | ||
| 1990–2010 | NL TD | NL TD | TD | A*D (TD), Y*D | 455,678.2 | 85 | 0.095 | 0.110 | ||
| BIC | 2005–2010 | NL TD | NL TD | PH | 97,379.7 | 637.0 | 0.095 | 0.116 | ||
| 2000–2010 | NL TD | NL TD | TD | 215,438.8 | 1,483.9 | 0.093 | 0.108 | |||
| 1995–2010 | NL TD | NL TD | TD | 338,747.7 | 171.3 | 0.091 | 0.109 | |||
| 1990–2010 | NL TD | NL TD | TD | 455,968.5 | 3,315.6 | 0.093 | 0.111 | |||
|
| ||||||||||
| AIC | 2005–2010 | NL TD | NL TD | TD | A*D | 97,225.2 | 67 | 0.090 | 0.115 | |
| 2005–2010 | NL TD | NL TD | TD | 97,226.7 | 33 | |||||
| 2000–2010 | NL TD | NL TD | TD | A*D | 215,253.6 | 0.094 | 0.111 | |||
| 1995–2010 | NL TD | NL TD | TD | A*D | 338,556.8 | 0.090 | 0.109 | |||
| 1990–2010 | NL TD | NL TD | TD | A*D | 455,763.4 | 0.092 | 0.112 | |||
| BIC | 2005–2010 | NL TD | NL TD | PH | 97,379.7 | 67.3 | 0.095 | 0.116 | ||
| 2000–2010 | NL TD | NL TD | TD | 215,438.8 | 41.2 | 0.093 | 0.108 | |||
| 1995–2010 | NL TD | NL TD | TD | 338,747.7 | 72.7 | 0.091 | 0.109 | |||
| 1990–2010 | NL TD | NL TD | TD | 455,968.5 | 66.9 | 0.093 | 0.111 | |||
|
| ||||||||||
| 2005–2010 | L PH | L PH | PH | 98,608.6 | 0.175 | 0.170 | ||||
| 2000–2010 | L PH | L PH | PH | 217,925.7 | 0.155 | 0.150 | ||||
| 1995–2010 | L PH | L PH | PH | 342,308.5 | 0.147 | 0.138 | ||||
| 1990–2010 | L PH | L PH | PH | 460,688.7 | 0.141 | 0.138 | ||||
|
| ||||||||||
| Lung cancer | ||||||||||
|
| ||||||||||
| AIC | 2005–2010 | NL TD | NL TD | TD | Y*D | 110,543.9 | 410 | 0.122 | 0.142 | |
| 2000–2010 | NL TD | NL TD | TD | A*D, Y*D | 220,096.1 | 31 | 0.103 | 0.128 | ||
| 1995–2010 | NL TD | NL TD | TD | A*D, Y*D | 323,089.4 | 1,041 | 0.102 | 0.127 | ||
| 1990–2010 | NL TD | NL TD | TD | A*D, Y*D | 418,484.9 | 1,386 | 0.102 | 0.126 | ||
| BIC | 2005–2010 | NL TD | NL TD | PH | 101,805.6 | 573.1 | 0.120 | 0.148 | ||
| 2000–2010 | NL TD | NL | PH | 202,668.9 | 932.1 | 0.107 | 0.125 | |||
| 1995–2010 | NL TD | NL TD | PH | 290,457.4 | 423.2 | 0.105 | 0.129 | |||
| 1990–2010 | NL TD | NL TD | TD | 378,957.9 | 193.5 | 0.105 | 0.127 | |||
|
| ||||||||||
| AIC | 2005–2010 | NL TD | NL TD | TD | A*D | 101,644.7 | 0.112 | 0.147 | ||
| 2000–2010 | NL TD | NL TD | TD | A*D | 202,427.4 | 0.102 | 0.128 | |||
| 1995–2010 | NL TD | NL TD | TD | A*D | 290,194.5 | 0.101 | 0.125 | |||
| 1990–2010 | NL TD | NL TD | TD | A*D | 378,710.8 | 0.100 | 0.126 | |||
| BIC | 2005–2010 | NL TD | NL TD | PH | 101,805.6 | 69.8 | 0.120 | 0.148 | ||
| 2000–2010 | NL TD | NL | PH | 202,668.9 | 75.6 | 0.107 | 0.125 | |||
| 1995–2010 | NL TD | NL TD | PH | 290,457.4 | 63.1 | 0.105 | 0.129 | |||
| 1990–2010 | NL TD | NL TD | TD | 378,957.9 | 55.0 | 0.105 | 0.127 | |||
|
| ||||||||||
| 2005–2010 | L PH | L PH | PH | 102,492.2 | 0.119 | 0.113 | ||||
| 2000–2010 | L PH | L PH | PH | 203,852.5 | 0.119 | 0.113 | ||||
| 1995–2010 | L PH | L PH | PH | 292,678.4 | 0.125 | 0.117 | ||||
| 1990–2010 | L PH | L PH | PH | 381,722.1 | 0.124 | 0.117 | ||||
|
| 0.076 | |||||||||
L: Linear; NL: non-linear; TD: time-dependent; PH: proportional hazard; *: interaction; AIC: Akaike information criteria; BIC: Bayesian or Schwarz information criteria.
aXIC stands for AIC or BIC depending on model.
cRMISD are calculated by averaging the integrated square differences measured in each of the age and deprivation groups.
dPrediction of five-year survival for patients diagnosed in 2010 and 2011.
dA model with all variables modelled with a linear proportional effect.
Figure 2.Integrated Square Difference (ISD) between NS predicted by each AIC or BIC model-averaged, simple models, and the period approach, compared to the PP cohort survival for patients diagnosed in 2010, by age group, deprivation from 1990 to 2010 and from 2005 to 2010 cohorts of patients used in model selection.
Figure 3.Net survival curves: comparison between the PP estimates and estimates from 1990 to 2010 and from 2005 to 2010 cohorts of patients used in model-averaging from AIC and BIC model selection.
Models selected, and associated AIC, BIC for the prediction and projection of four-year cancer survival.
| Age (A) | Year of diagnosis (Y) | Deprivation (D) | Stage (S) | Performance status (PS, lung) | Presentation: screening (breast) emergency (lung) | Interactions | XIC[ | XIC[ | XIC[ | |
|---|---|---|---|---|---|---|---|---|---|---|
| | ||||||||||
| (adapted) | L TD | NL TD | PH | TD | PH | 9,481.6 | 23.4 | |||
| L TD | NL | PH | TD | PH | 9,480.4 | 42.3 | ||||
| L TD | NL TD | PH | TD | TD | 9,482.1 | 18.2 | ||||
| L | NL | PH | TD | TD | 9,482.4 | 16.1 | 2.2 | |||
| (mfpigen) | L TD | NL | PH | TD | PH | 9,480.4 | 17.8 | |||
| NL TD | L | PH | TD | PH | A*D | 9,481.1 | 13.1 | |||
| NL TD | L | PH | TD | PH | 9,480.7 | 15.7 | ||||
| L TD | NL TD | PH | TD | PH | 9,481.6 | 9.9 | ||||
| NL | L | PH | TD | TD | A*D | 9,482.1 | 7.8 | |||
| NL | L | PH | TD | TD | 9,482.1 | 7.7 | ||||
| L TD | NL TD | PH | TD | TD | 9,482.1 | 7.7 | ||||
| L | NL | PH | TD | TD | 9,482.4 | 6.8 | ||||
| NL | L | PH | TD | PH | A*D | 9,482.3 | 7.0 | |||
| NL | L | PH | TD | PH | 9,482.4 | 6.6 | 2.2 | |||
| BIC-selection (adapted & mfpigen) | L | NL | PH | TD | PH | 9,576.3[ | ||||
| Simple modela | L | L | PH | PH | PH | 9,619.2 | ||||
|
| ||||||||||
| (adapted) | L | NL | TD | TD | TD | TD | A*D | 7,820.2 | 15.5 | |
| L | NL TD | PH | TD | TD | TD | A*D | 7,819.0 | 29.4 | ||
| L | NL | PH | TD | TD | TD | A*D | 7,819.1 | 26.8 | ||
| L | NL TD | TD | TD | TD | TD | A*D | 7,820.1 | 16.6 | ||
| L TD | NL TD | PH | TD | TD | TD | A*D | 7,820.8 | 11.6 | 316.3 | |
| (mfpigen) | NL | L TD | PH | TD | TD | TD | A*D | 7,819.9 | 2.2 | |
| BIC-selection (adapted & mfpigen) | L | NL | PH | TD | TD | TD | 7,967.4[ | |||
| Simple modela | L | L | PH | PH | PH | PH | 8,369.8 |
L: Linear; NL: non-linear; TD: time-dependent; PH: proportional hazard; *: interaction; AIC: Akaike information criteria; BIC: Bayesian or Schwarz information criteria.
aA model with all variables modelled with a linear proportional effect.
bXIC stands for AIC or BIC depending on model.
cBIC value.
Figure 4.Up to four-year net survival for patients diagnosed with breast or lung cancer by age and stage for patients in the third deprivation quintile.