| Literature DB >> 28608396 |
Kengo Nagashima1, Yasunori Sato1.
Abstract
In the estimation of Cox regression models, maximum partial likelihood estimates might be infinite in a monotone likelihood setting, where partial likelihood converges to a finite value and parameter estimates converge to infinite values. To address monotone likelihood, previous studies have applied Firth's bias correction method to Cox regression models. However, while the model selection criteria for Firth's penalized partial likelihood approach have not yet been studied, a heuristic AIC-type information criterion can be used in a statistical package. Application of the heuristic information criterion to data obtained from a prospective observational study of patients with multiple brain metastases indicated that the heuristic information criterion selects models with many parameters and ignores the adequacy of the model. Moreover, we showed that the heuristic information criterion tends to select models with many regression parameters as the sample size increases. Thereby, in the present study, we propose an alternative AIC-type information criterion based on the risk function. A Bayesian information criterion type was also evaluated. Further, the presented simulation results confirm that the proposed criteria performed well in a monotone likelihood setting. The proposed AIC-type criterion was applied to prospective observational study data.Entities:
Keywords: Akaike's information criterion; model selection; monotone likelihood; penalized partial likelihood; survival analysis
Mesh:
Year: 2017 PMID: 28608396 PMCID: PMC6084330 DOI: 10.1002/sim.7368
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Number of events (leptomeningeal dissemination) and censored values for the study data (n=1073).
| Covariate | Group | Event | Censored | % Censored |
|---|---|---|---|---|
|
| <65 | 69 | 393 | 85.1 |
| ⩾65 | 76 | 535 | 87.6 | |
|
| Female | 62 | 370 | 85.6 |
| Male | 83 | 558 | 87.1 | |
|
| ⩾80 | 132 | 810 | 86.0 |
| ⩽70 | 13 | 118 | 90.1 | |
|
| 1 | 49 | 365 | 88.2 |
| 2–4 | 61 | 412 | 87.1 | |
| 5–10 | 35 | 151 | 81.2 | |
|
| <1.6cm | 76 | 455 | 85.7 |
| ⩾1.6 cm | 69 | 473 | 87.3 | |
|
| <1.9 mL | 75 | 459 | 86.0 |
| ⩾1.9 mL | 70 | 469 | 87.0 | |
|
| Lung | 116 | 705 | 85.9 |
| Breast | 17 | 95 | 84.8 | |
| Gastrointestinal | 9 | 66 | 88.0 | |
| Kidney | 0 | 32 | 100.0 | |
| Other | 3 | 30 | 90.9 | |
|
| Not controlled | 103 | 634 | 86.0 |
| Controlled | 42 | 294 | 87.5 | |
|
| No | 105 | 656 | 86.2 |
| Yes | 40 | 272 | 87.2 | |
| Total | 145 | 928 | 86.5 |
Note: ntumor, number of tumors; kps, Karnofsky performance status; diameter, maximum diameter of largest tumor; volume, cumulative tumor volume; ptumor, primary tumor category; status, extracerebral disease status; neuro, neurological symptoms.
The top five models based on AIC∗ for the study data.
| Model | AIC∗ | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| 1731.50 |
|
|
|
|
|
|
|
|
| 1731.80 | |
|
|
|
|
|
|
|
|
| 1731.85 | |
|
|
|
|
|
|
|
|
| 1731.85 | |
|
|
|
|
|
|
|
| 1732.16 | ||
Note: ntumor, number of tumors; kps, Karnofsky performance status; diameter, maximum diameter of largest tumor; volume, cumulative tumor volume; ptumor, primary tumor category; status, extracerebral disease status; neuro, neurological symptoms; AIC, Akaike's information criterion.
Parameter estimates of the best model based on AIC∗ for the study data.
| Usual Cox | Firth's penalized partial | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Covariate | regression model | likelihood approach | |||||||||
|
| SE | 95% CI |
|
|
| SE | 95% CI |
| |||
|
| 1.01 | 0.17 | 0.72 | 1.40 | 0.98 | 0.98 | 1.00 | 0.17 | 0.72 | 1.40 | 0.98 |
|
| 1.20 | 0.19 | 0.83 | 1.73 | 0.34 | 0.33 | 1.19 | 0.19 | 0.83 | 1.72 | 0.34 |
|
| |||||||||||
| 1 vs. 2–4 | 0.72 | 0.20 | 0.49 | 1.06 | 0.09 | 0.09 | 0.72 | 0.20 | 0.49 | 1.06 | 0.09 |
| 5–10 vs. 2–4 | 1.58 | 0.22 | 1.03 | 2.41 | 0.04 | 0.04 | 1.59 | 0.22 | 1.04 | 2.43 | 0.04 |
|
| 1.04 | 0.32 | 0.55 | 1.96 | 0.90 | 0.90 | 1.07 | 0.32 | 0.57 | 2.00 | 0.83 |
|
| 0.90 | 0.33 | 0.47 | 1.69 | 0.73 | 0.73 | 0.90 | 0.33 | 0.47 | 1.70 | 0.74 |
|
| 1.10 | 0.32 | 0.59 | 2.08 | 0.76 | 0.76 | 1.11 | 0.32 | 0.59 | 2.08 | 0.76 |
|
| |||||||||||
| Breast vs. lung | 1.03 | 0.29 | 0.58 | 1.83 | 0.91 | 0.91 | 1.05 | 0.29 | 0.60 | 1.87 | 0.85 |
| GI vs. lung | 1.55 | 0.37 | 0.75 | 3.21 | 0.24 | 0.26 | 1.61 | 0.37 | 0.79 | 3.31 | 0.21 |
| Kidney vs. lung | 0.00 | 543.30 | 0.00 | – | 0.98 | <0.01 | 0.12 | 1.43 | 0.01 | 1.91 | 0.02 |
| Others vs. lung | 0.77 | 0.59 | 0.24 | 2.45 | 0.66 | 0.64 | 0.89 | 0.55 | 0.30 | 2.63 | 0.83 |
|
| 1.02 | 0.19 | 0.71 | 1.47 | 0.92 | 0.92 | 1.03 | 0.19 | 0.71 | 1.48 | 0.89 |
|
| 1.50 | 0.23 | 0.96 | 2.36 | 0.07 | 0.08 | 1.51 | 0.23 | 0.97 | 2.37 | 0.07 |
Note: HR, hazard ratio; kps, Karnofsky performance status; ntumor, number of tumors; diameter, maximum diameter of largest tumor; volume, cumulative tumor volume; ptumor, primary tumor category; status, extracerebral disease status; neuro, neurological symptoms; GI, gastrointestinal; nc., not controlled; LR, likelihood ratio; SE, standard error.
The mean of the difference between the estimated MR and the value of the information criterion in each model and its 5 and 95 percentiles, and the estimated MR for the selected model (the proportion of covariates: q=0.5; the number of simulations: R=20,000).
|
|
|
| Mean difference (5 percentile, 95 percentile) |
| ||||
|---|---|---|---|---|---|---|---|---|
| Model 6 (true model) | Model 11 (full model) | |||||||
| AICF | AIC∗ | AICF | AIC∗ | AICF | AIC∗ | |||
| 90 | 1 | 100 | −0.4 (−43.9, 45.9) | −2.7 (−44.2, 42.1) | −1.1 (−44.4, 45.5) | −4.8 (−44.7, 39.2) | 94.38 | 95.05 |
| 90 | 1 | 200 | 0.1 (−71.4, 74.6) | −4.4 (−74.8, 68.9) | −0.1 (−71.6, 74.9) | −7.7 (−77.1, 65.5) | 212.87 | 214.55 |
| 90 | 1 | 1000 | 1.6 (−206.1, 218.1) | −8.0 (−215.2, 208.0) | 1.6 (−206.9, 218.2) | −14.4 (−222.1, 201.4) | 1373.64 | 1375.61 |
| 90 | 2 | 100 | −0.4 (−43.8, 45.8) | −2.7 (−44.1, 42.0) | −1.0 (−44.2, 45.6) | −4.7 (−44.6, 39.3) | 94.40 | 95.04 |
| 90 | 2 | 200 | −0.6 (−72.0, 74.4) | −5.2 (−75.4, 68.8) | −0.9 (−72.4, 74.6) | −8.5 (−77.8, 65.2) | 212.82 | 214.48 |
| 90 | 2 | 1000 | 1.3 (−205.9, 217.1) | −8.3 (−215.1, 207.0) | 1.3 (−206.6, 217.4) | −14.8 (−221.8, 200.6) | 1375.21 | 1377.21 |
| 90 | 4 | 100 | −0.7 (−44.2, 45.7) | −3.0 (−44.5, 41.9) | −1.4 (−44.6, 45.4) | −5.1 (−45.0, 39.1) | 94.73 | 95.36 |
| 90 | 4 | 200 | 0.0 (−71.6, 74.2) | −4.6 (−74.9, 68.6) | −0.3 (−71.5, 74.3) | −7.9 (−77.1, 65.0) | 212.88 | 214.50 |
| 90 | 4 | 1000 | −1.8 (−216.9, 214.8) | −11.4 (−226.1, 204.7) | −1.9 (−215.8, 213.4) | −17.9 (−231.0, 196.7) | 1376.07 | 1378.04 |
| 90 | 16 | 100 | −0.7 (−43.4, 45.9) | −3.0 (−43.8, 42.1) | −1.4 (−44.0, 45.2) | −5.1 (−44.4, 38.9) | 94.49 | 95.15 |
| 90 | 16 | 200 | −0.6 (−71.3, 74.0) | −5.2 (−74.7, 68.3) | −0.9 (−71.5, 74.0) | −8.5 (−77.1, 64.6) | 212.88 | 214.54 |
| 90 | 16 | 1000 | 0.1 (−207.5, 217.3) | −9.5 (−216.6, 207.3) | 0.1 (−207.6, 217.4) | −15.9 (−222.8, 200.6) | 1374.48 | 1376.42 |
| 50 | 1 | 100 | 0.1 (−64.8, 62.8) | −7.4 (−71.8, 54.9) | −0.4 (−65.8, 62.3) | −12.8 (−77.4, 49.1) | 432.37 | 434.56 |
| 50 | 1 | 200 | −1.1 (−110.0, 104.1) | −10.7 (−119.2, 94.2) | −1.3 (−109.7, 104.7) | −17.2 (−125.1, 88.2) | 1000.80 | 1002.75 |
| 50 | 1 | 1000 | 3.0 (−322.5, 325.7) | −11.5 (−336.8, 311.1) | 2.9 (−322.5, 325.9) | −21.2 (−346.3, 301.5) | 6602.19 | 6603.65 |
| 50 | 2 | 100 | −0.4 (−64.5, 63.2) | −7.8 (−71.4, 55.2) | −0.9 (−65.6, 62.8) | −13.3 (−77.1, 49.7) | 432.28 | 434.21 |
| 50 | 2 | 200 | 0.9 (−108.1, 107.4) | −8.7 (−117.3, 97.4) | 0.6 (−108.1, 106.9) | −15.3 (−123.5, 90.4) | 999.82 | 1001.35 |
| 50 | 2 | 1000 | −2.9 (−322.7, 319.4) | −17.4 (−337.0, 304.8) | −3.0 (−322.5, 318.6) | −27.1 (−346.4, 294.3) | 6601.73 | 6601.85 |
| 50 | 4 | 100 | −0.9 (−66.0, 61.9) | −8.4 (−73.0, 54.0) | −1.6 (−66.9, 61.3) | −14.0 (−78.4, 48.2) | 432.62 | 434.21 |
| 50 | 4 | 200 | 0.1 (−107.2, 104.3) | −9.5 (−116.4, 94.4) | −0.3 (−107.3, 103.9) | −16.3 (−122.6, 87.4) | 999.69 | 1000.66 |
| 50 | 4 | 1000 | −3.0 (−323.1, 312.3) | −17.4 (−337.4, 297.7) | −3.2 (−323.1, 311.9) | −27.3 (−347.0, 287.6) | 6595.55 | 6595.42 |
| 50 | 16 | 100 | 0.1 (−64.3, 63.4) | −7.3 (−71.2, 55.5) | −0.7 (−65.6, 62.4) | −13.1 (−77.0, 49.3) | 431.82 | 433.14 |
| 50 | 16 | 200 | −1.1 (−107.8, 104.4) | −10.6 (−117.0, 94.5) | −1.6 (−108.1, 104.4) | −17.6 (−123.4, 87.9) | 999.22 | 999.75 |
| 50 | 16 | 1000 | 0.8 (−320.5, 319.0) | −13.7 (−334.8, 304.4) | 0.5 (−321.0, 318.5) | −23.6 (−344.8, 294.1) | 6589.13 | 6589.34 |
| 0 | 1 | 100 | −0.4 (−9.9, 5.9) | −9.8 (−19.2,−3.5) | −1.1 (−11.2, 6.0) | −16.7 (−26.7,−9.7) | 728.44 | 728.71 |
| 0 | 1 | 200 | −0.1 (−12.1, 8.8) | −11.7 (−23.7,−2.8) | −0.5 (−13.2, 9.0) | −19.8 (−32.4,−10.3) | 1722.43 | 1722.27 |
| 0 | 1 | 1000 | 0.2 (−23.8, 21.1) | −16.2 (−40.3, 4.6) | 0.1 (−24.3, 21.3) | −27.4 (−51.8,−6.3) | 11779.42 | 11780.44 |
| 0 | 2 | 100 | −0.5 (−17.2, 14.1) | −9.6 (−26.2, 4.7) | −1.2 (−18.4, 13.8) | −16.6 (−33.6,−1.7) | 705.35 | 706.37 |
| 0 | 2 | 200 | −0.2 (−23.6, 20.8) | −11.6 (−34.8, 9.3) | −0.5 (−23.9, 20.5) | −19.5 (−42.9, 1.4) | 1675.67 | 1676.76 |
| 0 | 2 | 1000 | 0.4 (−48.9, 48.0) | −15.9 (−65.2, 31.7) | 0.3 (−49.3, 48.2) | −27.0 (−76.6, 20.8) | 11548.20 | 11549.24 |
| 0 | 4 | 100 | −0.5 (−24.1, 21.7) | −9.2 (−32.6, 12.8) | −1.1 (−24.9, 21.4) | −16.0 (−39.5, 6.2) | 657.00 | 658.21 |
| 0 | 4 | 200 | −0.5 (−33.3, 30.8) | −11.4 (−44.1, 19.8) | −0.9 (−34.0, 30.7) | −19.5 (−52.4, 12.0) | 1578.08 | 1579.22 |
| 0 | 4 | 1000 | −0.4 (−72.1, 69.9) | −16.2 (−87.8, 54.0) | −0.4 (−72.1, 69.9) | −27.3 (−98.8, 43.0) | 11053.44 | 11054.48 |
| 0 | 16 | 100 | −0.3 (−26.8, 26.5) | −7.9 (−34.0, 18.4) | −0.8 (−27.7, 26.2) | −14.6 (−41.0, 12.0) | 575.12 | 576.30 |
| 0 | 16 | 200 | −0.1 (−37.4, 37.8) | −10.0 (−47.0, 27.6) | −0.4 (−37.8, 37.6) | −17.9 (−55.1, 19.8) | 1411.05 | 1412.15 |
| 0 | 16 | 1000 | 0.4 (−81.5, 84.4) | −14.4 (−96.3, 69.4) | 0.4 (−82.0, 84.4) | −25.5 (−107.8, 58.5) | 10203.57 | 10204.62 |
MR, mean risk; Mean difference (5 percentile, 95 percentile), mean of the difference between the estimated mean risk, , and the value of the information criterion in each model and its 5 and 95 percentiles; , estimated MR for the selected model based on new data; c, proportion of random censoring; θ, regression parameters; n, total sample size. The values that are superior to other are highlighted.
The selection probability (the proportion of covariates: q=0.5; the number of simulations: R=20,000).
|
|
|
| Model 6 (true model) | Model 11 (full model) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| AICF | AIC∗ | BICF | BIC∗ | AICF | AIC∗ | BICF | BIC∗ | |||
| 90 | 1.3 | 100 | 0.060 | 0.106 | 0.050 | 0.087 | 0.008 | 0.039 | 0.007 | 0.016 |
| 90 | 1.3 | 200 | 0.058 | 0.121 | 0.026 | 0.094 | 0.005 | 0.239 | 0.001 | 0.016 |
| 90 | 1.3 | 1000 | 0.059 | 0.000 | 0.005 | 0.094 | 0.005 | 1.000 | 0.000 | 0.019 |
| 90 | 2 | 100 | 0.063 | 0.110 | 0.052 | 0.092 | 0.008 | 0.041 | 0.007 | 0.019 |
| 90 | 2 | 200 | 0.060 | 0.116 | 0.024 | 0.093 | 0.005 | 0.238 | 0.001 | 0.018 |
| 90 | 2 | 1000 | 0.062 | 0.000 | 0.006 | 0.096 | 0.004 | 1.000 | 0.000 | 0.018 |
| 90 | 4 | 100 | 0.061 | 0.106 | 0.052 | 0.086 | 0.008 | 0.042 | 0.007 | 0.018 |
| 90 | 4 | 200 | 0.061 | 0.115 | 0.026 | 0.092 | 0.006 | 0.239 | 0.001 | 0.018 |
| 90 | 4 | 1000 | 0.055 | 0.000 | 0.005 | 0.090 | 0.005 | 1.000 | 0.000 | 0.019 |
| 90 | 16 | 100 | 0.062 | 0.110 | 0.052 | 0.091 | 0.008 | 0.043 | 0.006 | 0.018 |
| 90 | 16 | 200 | 0.058 | 0.119 | 0.023 | 0.093 | 0.006 | 0.234 | 0.001 | 0.017 |
| 90 | 16 | 1000 | 0.060 | 0.000 | 0.005 | 0.098 | 0.005 | 1.000 | 0.000 | 0.020 |
| 50 | 1.3 | 100 | 0.062 | 0.000 | 0.011 | 0.095 | 0.007 | 1.000 | 0.000 | 0.021 |
| 50 | 1.3 | 200 | 0.064 | 0.000 | 0.006 | 0.104 | 0.006 | 1.000 | 0.000 | 0.022 |
| 50 | 1.3 | 1000 | 0.091 | 0.000 | 0.003 | 0.135 | 0.007 | 1.000 | 0.000 | 0.027 |
| 50 | 2 | 100 | 0.082 | 0.000 | 0.018 | 0.122 | 0.010 | 1.000 | 0.000 | 0.027 |
| 50 | 2 | 200 | 0.099 | 0.000 | 0.015 | 0.140 | 0.010 | 1.000 | 0.000 | 0.031 |
| 50 | 2 | 1000 | 0.258 | 0.000 | 0.028 | 0.294 | 0.025 | 1.000 | 0.000 | 0.070 |
| 50 | 4 | 100 | 0.109 | 0.000 | 0.030 | 0.148 | 0.017 | 1.000 | 0.000 | 0.039 |
| 50 | 4 | 200 | 0.154 | 0.000 | 0.031 | 0.194 | 0.020 | 1.000 | 0.000 | 0.052 |
| 50 | 4 | 1000 | 0.469 | 0.000 | 0.142 | 0.462 | 0.053 | 1.000 | 0.000 | 0.116 |
| 50 | 16 | 100 | 0.133 | 0.000 | 0.044 | 0.172 | 0.022 | 1.000 | 0.002 | 0.050 |
| 50 | 16 | 200 | 0.203 | 0.000 | 0.054 | 0.242 | 0.029 | 1.000 | 0.001 | 0.066 |
| 50 | 16 | 1000 | 0.596 | 0.000 | 0.305 | 0.544 | 0.072 | 1.000 | 0.000 | 0.143 |
| 0 | 1.3 | 100 | 0.270 | 0.000 | 0.087 | 0.304 | 0.033 | 1.000 | 0.001 | 0.067 |
| 0 | 1.3 | 200 | 0.468 | 0.000 | 0.188 | 0.468 | 0.054 | 1.000 | 0.001 | 0.108 |
| 0 | 1.3 | 1000 | 0.778 | 0.000 | 0.919 | 0.658 | 0.085 | 1.000 | 0.001 | 0.155 |
| 0 | 2 | 100 | 0.745 | 0.000 | 0.820 | 0.662 | 0.091 | 1.000 | 0.007 | 0.145 |
| 0 | 2 | 200 | 0.778 | 0.000 | 0.968 | 0.670 | 0.086 | 1.000 | 0.003 | 0.150 |
| 0 | 2 | 1000 | 0.786 | 0.000 | 0.991 | 0.670 | 0.081 | 1.000 | 0.001 | 0.149 |
| 0 | 4 | 100 | 0.772 | 0.000 | 0.959 | 0.676 | 0.089 | 1.000 | 0.007 | 0.144 |
| 0 | 4 | 200 | 0.773 | 0.000 | 0.971 | 0.664 | 0.085 | 1.000 | 0.004 | 0.150 |
| 0 | 4 | 1000 | 0.786 | 0.000 | 0.990 | 0.670 | 0.081 | 1.000 | 0.001 | 0.149 |
| 0 | 16 | 100 | 0.774 | 0.000 | 0.958 | 0.690 | 0.088 | 1.000 | 0.007 | 0.137 |
| 0 | 16 | 200 | 0.779 | 0.000 | 0.974 | 0.679 | 0.085 | 1.000 | 0.003 | 0.144 |
| 0 | 16 | 1000 | 0.788 | 0.000 | 0.991 | 0.668 | 0.081 | 1.000 | 0.001 | 0.151 |
Note: c, proportion of random censoring; θ, regression parameters; n, total sample size. The values that are superior to other are highlighted.
The top five models based on AICF for the study data.
| Model | AICF | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
| 1753.84 | ||||||
|
|
| 1754.72 | |||||||
|
|
|
|
| 1754.90 | |||||
|
|
|
|
| 1755.83 | |||||
|
|
|
|
| 1755.83 | |||||
Note: ntumor, number of tumors; diameter, maximum diameter of largest tumor; ptumor, primary tumor category; status, extracerebral disease status; neuro, neurological symptoms; AICF, AIC for Firth's penalized partial likelihood approach.
Parameter estimates of the best model based on AICF for the study data.
| Covariate |
| SE | 95% CI |
| |
|---|---|---|---|---|---|
|
| |||||
| 1 vs. 2–4 | 0.71 | 0.19 | 0.49 | 1.05 | 0.08 |
| 5–10 vs. 2–4 | 1.57 | 0.21 | 1.03 | 2.38 | 0.04 |
|
| |||||
| Breast vs. lung | 0.94 | 0.26 | 0.57 | 1.57 | 0.82 |
| GI vs. lung | 1.60 | 0.36 | 0.79 | 3.26 | 0.21 |
| Renal cell vs. lung | 0.12 | 1.43 | 0.01 | 1.99 | 0.02 |
| Others vs. lung | 0.87 | 0.55 | 0.30 | 2.56 | 0.79 |
|
| 1.53 | 0.20 | 1.04 | 2.24 | 0.04 |
Note: HR, hazard ratio; ntumor, number of tumors; ptumor, primary tumor category; neuro, neurological symptoms; GI, gastrointestinal; LR, likelihood ratio; SE, standard error.