| Literature DB >> 24950066 |
Tjeerd van der Ploeg1, Frank Datema2, Robert Baatenburg de Jong2, Ewout W Steyerberg3.
Abstract
BACKGROUND: The use of alternative modeling techniques for predicting patient survival is complicated by the fact that some alternative techniques cannot readily deal with censoring, which is essential for analyzing survival data. In the current study, we aimed to demonstrate that pseudo values enable statistically appropriate analyses of survival outcomes when used in seven alternative modeling techniques.Entities:
Mesh:
Year: 2014 PMID: 24950066 PMCID: PMC4065009 DOI: 10.1371/journal.pone.0100234
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Parameters required for the modeling techniques.
|
|
|
| NNET | size and decay |
| RPART | cp-value |
| SVM LINEAR | cost and gamma |
| SVM POLYNOMIAL | cost, gamma and degree |
| SVM RADIAL | cost and gamma |
Figure 1Survival pattern 1282 patients with newly diagnosed HNSCC.
Figure 2Censoring pattern 1282 patients with newly diagnosed HNSCC.
Overall survival and 60-month survival.
| Overall | 60 months | ||||||
| Variable | Value | Total (n) | Events (n) | HR | 95% CI | Survival probability | 95% CI |
| Gender | Male (ref) | 1022 | 789 | 1.00 | – | 0.54 | [0.51−0.57] |
| Female | 260 | 197 | 1.12 | [0.96−1.31] | 0.48 | [0.42–0.54] | |
| Tumor location | Glottic larynx (ref) | 425 | 282 | 1.00 | − | 0.71 | [0.67−0.75] |
| Lip | 85 | 54 | 0.88 | [0.66−1.18] | 0.75 | [0.67−0.85] | |
| Oral cavity | 261 | 210 | 2.04 | [1.70−2.44] | 0.36 | [0.31−0.43] | |
| Oropharynx | 148 | 129 | 2.37 | [1.92−2.92] | 0.37 | [0.30−0.46] | |
| Nasopharynx | 39 | 23 | 1.35 | [0.88−2.06] | 0.52 | [0.37−0.74] | |
| Hypopharynx | 135 | 123 | 2.83 | [2.29−3.51] | 0.33 | [0.26−0.42] | |
| Supraglottic larynx | 189 | 165 | 1.70 | [1.40−2.06] | 0.50 | [0.43−0.57] | |
| T-class | T1 (ref) | 454 | 293 | 1.00 | − | 0.74 | [0.70−0.78] |
| T2 | 354 | 281 | 1.63 | [1.38−1.92] | 0.53 | [0.48−0.58] | |
| T3 | 200 | 170 | 2.26 | [1.87−2.73] | 0.38 | [0.32−0.45] | |
| T4 | 274 | 242 | 3.18 | [2.68−3.78] | 0.27 | [0.22−0.33] | |
| N-class | N0 (ref) | 891 | 641 | 1.00 | − | 0.64 | [0.61−0.67] |
| N1 | 138 | 125 | 2.10 | [1.73−2.54] | 0.33 | [0.26−0.42] | |
| N2 | 174 | 147 | 2.45 | [2.04−2.94] | 0.28 | [0.22−0.36] | |
| N3 | 79 | 73 | 3.82 | [2.99−4.89] | 0.11 | [0.06−0.21] | |
| M-class | M0 (ref) | 1266 | 972 | 1.00 | − | 0.53 | [0.50−0.56] |
| M1 | 16 | 14 | 8.51 | [4.97−14.58] | 0.00 | − | |
| Prior malignancies | No (ref) | 1160 | 880 | 1.00 | − | 0.54 | [0.51−0.57] |
| Yes | 122 | 106 | 1.62 | [1.32−1.98] | 0.36 | [0.28−0.45] | |
| ACE27 | Grade 0 (ref) | 782 | 574 | 1.00 | − | 0.57 | [0.54−0.61] |
| Grade 1 | 239 | 176 | 1.17 | [0.99−1.39] | 0.52 | [0.46−0.59] | |
| Grade 2 | 185 | 164 | 1.66 | [1.40−1.98] | 0.44 | [0.38−0.52] | |
| Grade 3 | 76 | 72 | 2.52 | [1.97−3.23] | 0.25 | [0.17−0.37] | |
| Age class | <50 (ref) | 173 | 100 | 1.00 | − | 0.66 | [0.59−0.74] |
| 50–59 | 339 | 234 | 1.24 | [0.98−1.57] | 0.59 | [0.54−0.65] | |
| 60–69 | 404 | 328 | 1.73 | [1.38−2.16] | 0.52 | [0.47−0.57] | |
| > = 70 | 366 | 324 | 2.53 | [2.02−3.18] | 0.40 | [0.36−0.46] | |
| Total | 1282 | 986 | 0.52 | [0.50−0.55] | |||
HR. Hazard ratio.
CI. Confidence interval.
Performance of models for the outcome ‘dead or alive at 60 months’.
| Dead or alive at 60 months | |||||
| Modeling technique | AUC bootstrap | AUC validated | AUC-apparent | Optimism | Optimism-corrected-AUC |
| LR | 0.809 | 0.797 | 0.803 | 0.012 | 0.791 |
| NNET | 0.880 | 0.810 | 0.855 | 0.070 | 0.785 |
| RPART | 0.769 | 0.741 | 0.753 | 0.028 | 0.725 |
| SVM LINEAR | 0.807 | 0.794 | 0.800 | 0.013 | 0.787 |
| SVM POLYNOMIAL | 0.861 | 0.811 | 0.821 | 0.050 | 0.771 |
| SVM RADIAL | 0.872 | 0.813 | 0.825 | 0.059 | 0.766 |
Performance of models for the outcome ‘pseudo values at 60 months’.
| Pseudo values at 60 months | |||||
| Modeling technique | RMSE bootstrap | RMSE validated | RMSE-apparent | Optimism | Optimism-corrected-RMSE |
| GLM | 0.427 | 0.433 | 0.430 | 0.006 | 0.436 |
| NNET | 0.388 | 0.457 | 0.417 | 0.069 | 0.486 |
| RPART | 0.430 | 0.448 | 0.448 | 0.018 | 0.466 |
| SVM LINEAR | 0.461 | 0.470 | 0.460 | 0.009 | 0.469 |
| SVM POLYNOMIAL | 0.409 | 0.445 | 0.446 | 0.036 | 0.482 |
| SVM RADIAL | 0.428 | 0.446 | 0.442 | 0.018 | 0.460 |
Performance of models for the outcome ‘estimated survival time’.
| Estimated survival time | |||||
| Modeling technique | RMSE bootstrap | RMSE validated | RMSE-apparent | Optimism | Optimism-corrected-RMSE |
| GLM | 76.0 | 77.1 | 76.6 | 1.1 | 77.7 |
| NNET | 80.3 | 83.0 | 81.0 | 2.7 | 83.7 |
| RPART | 76.7 | 80.1 | 79.8 | 3.4 | 83.1 |
| SVM LINEAR | 77.4 | 78.7 | 77.9 | 1.3 | 79.2 |
| SVM POLYNOMIAL | 69.7 | 76.3 | 76.3 | 6.6 | 82.9 |
| SVM RADIAL | 69.7 | 76.4 | 76.8 | 6.7 | 83.4 |
Figure 3Variable importance of the models per outcome.
Mode of the parameter settings identified as optimal in bootstrap samples.
| Outcome | |||
| Modeling technique | Dead or alive at 60 months | Pseudo values at 60 months | Estimated survival time |
| LR | − | − | − |
| NNET | size = 40 | size = 30 | size = 40 |
| RPART | cp = 0.01 | cp = 0.01 | cp = 0.01 |
| SVM LINEAR | cost = 0.5, gamma = 0.001 | cost = 0.5, gamma = 0.001 | cost = 0.5, gamma = 0.001 |
| SVM POLYNOMIAL | cost = 50, gamma = 0.05, degree = 3 | cost = 25, gamma = 0.05, degree = 3 | cost = 50, gamma = 0.05, degree = 3 |
| SVM RADIAL | cost = 50, gamma = 0.05 | cost = 0.5, gamma = 0.05 | cost = 50, gamma = 0.05 |
Logistic regression model for the outcome ‘dead or alive at 60 months’.
| Logistic regression | ||||||
| Variable | Value | B | SE | P-value | OR | 95% CI |
| Tumor location | Glottic larynx (ref) | 0.00 | − | − | 1.00 | − |
| Lip | 0.04 | 0.31 | 0.89 | 1.05 | [0.57−1.91] | |
| Oral cavity | 1.00 | 0.21 | 0.00 | 2.73 | [1.83−4.07] | |
| Oropharynx | 0.76 | 0.25 | 0.00 | 2.15 | [1.32−3.50] | |
| Nasopharynx | −0.09 | 0.41 | 0.82 | 0.91 | [0.41−2.03] | |
| Hypopharynx | 0.80 | 0.26 | 0.00 | 2.21 | [1.33−3.68] | |
| Supraglottic larynx | 0.39 | 0.22 | 0.07 | 1.48 | [0.97−2.26] | |
| ACE27 | Grade 0 (ref) | 0.00 | − | − | 1.00 | − |
| Grade 1 | 0.04 | 0.18 | 0.82 | 1.04 | [0.74−1.47] | |
| Grade 2 | 0.36 | 0.19 | 0.06 | 1.43 | [0.99−2.08] | |
| Grade 3 | 1.09 | 0.31 | 0.00 | 2.97 | [1.62−5.45] | |
| T-class | T1 (ref) | 0.00 | − | − | 1.00 | − |
| T2 | 0.67 | 0.17 | 0.00 | 1.95 | [1.38−2.74] | |
| T3 | 0.90 | 0.21 | 0.00 | 2.47 | [1.62−3.76] | |
| T4 | 1.30 | 0.21 | 0.00 | 3.68 | [2.44−5.55] | |
| N-class | N0 (ref) | 0.00 | − | − | 1.00 | − |
| N1 | 0.73 | 0.22 | 0.00 | 2.08 | [1.34−3.22] | |
| N2 | 1.02 | 0.22 | 0.00 | 2.76 | [1.81−4.22] | |
| N3 | 2.13 | 0.38 | 0.00 | 8.40 | [3.98−17.72] | |
| M-class | M0 (ref) | 0.00 | − | − | 1.00 | − |
| M1 | 1.65 | 0.85 | 0.05 | 5.23 | [0.99−27.63] | |
| Prior malignancies | No (ref) | 0.00 | − | − | 1.00 | − |
| Yes | 1.04 | 0.24 | 0.00 | 2.83 | [1.78−4.50] | |
| Gender | Male (ref) | 0.00 | − | − | 1.00 | − |
| Female | −0.05 | 0.17 | 0.77 | 0.95 | [0.68−1.33] | |
| Age at diagnosis per decade | 0.49 | 0.06 | 0.00 | 1.63 | [1.44−1.84] | |
| Constant | −4.79 | 0.44 | 0.00 | 0.01 | − | |
B: Regression coefficient.
SE: Standard error regression coefficient.
OR: Odds ratio.
CI: Confidence interval.
General linear model for the outcome ‘pseudo values at 60 months’.
| General linear model | |||||
| Variable | Value | B | SE | 95% CI | P-value |
| Tumor location | Glottic larynx (ref) | 0.00 | − | − | − |
| Lip | 0.00 | 0.05 | [−0.11−0.10] | 0.93 | |
| Oral cavity | −0.19 | 0.04 | [−0.26–−0.11] | 0.00 | |
| Oropharynx | −0.14 | 0.05 | [−0.23–−0.05] | 0.00 | |
| Nasopharynx | −0.06 | 0.08 | [−0.21−0.09] | 0.44 | |
| Hypopharynx | −0.15 | 0.05 | [−0.25–−0.06] | 0.00 | |
| Supraglottic larynx | −0.07 | 0.04 | [−0.15−0.01] | 0.08 | |
| ACE27 | Grade 0 (ref) | 0.00 | − | − | − |
| Grade 1 | 0.00 | 0.03 | [−0.06−0.06] | 0.99 | |
| Grade 2 | −0.07 | 0.04 | [−0.14−0.00] | 0.06 | |
| Grade 3 | −0.19 | 0.05 | [−0.29–−0.09] | 0.00 | |
| T-class | T1 (ref) | 0.00 | − | − | − |
| T2 | −0.13 | 0.03 | [−0.20–−0.07] | 0.00 | |
| T3 | −0.19 | 0.04 | [−0.27–−0.11] | 0.00 | |
| T4 | −0.27 | 0.04 | [−0.34–−0.19] | 0.00 | |
| N-class | N0 (ref) | 0.00 | − | − | − |
| N1 | −0.16 | 0.04 | [−0.25–−0.08] | 0.00 | |
| N2 | −0.22 | 0.04 | [−0.29–−0.14] | 0.00 | |
| N3 | −0.37 | 0.05 | [−0.47–−0.26] | 0.00 | |
| M-class | M0 (ref) | 0.00 | − | − | − |
| M1 | −0.27 | 0.11 | [−0.49–−0.05] | 0.02 | |
| Prior malignancies | No (ref) | 0.00 | − | − | − |
| Yes | −0.20 | 0.04 | [−0.28–−0.12] | 0.00 | |
| Gender | Male (ref) | 0.00 | − | − | − |
| Female | 0.01 | 0.03 | [−0.05−0.07] | 0.69 | |
| Age at diagnosis per decade | −0.09 | 0.01 | [−0.11–−0.07] | 0.00 | |
| Constant | 1.38 | 0.07 | [1.24−1.52] | 0.00 | |
B: Regression coefficient.
SE: Standard error regression coefficient.
CI: Confidence interval.
General linear model for the outcome ‘estimated survival time’.
| General linear model | |||||
| Variable | Value | B | SE | 95% CI | P-value |
| Tumor location | Glottic larynx (ref) | 0.00 | − | − | − |
| Lip | −0.79 | 9.44 | [−19.30−17.72] | 0.93 | |
| Oral cavity | −31.29 | 6.79 | [−44.59–−17.98] | 0.00 | |
| Oropharynx | −38.62 | 8.26 | [−54.82–−22.42] | 0.00 | |
| Nasopharynx | −21.39 | 13.66 | [−48.17−5.38] | 0.12 | |
| Hypopharynx | −44.97 | 8.59 | [−61.81–−28.13] | 0.00 | |
| Supraglottic larynx | −23.41 | 7.23 | [−37.59–−9.24] | 0.00 | |
| ACE27 | Grade 0 (ref) | 0.00 | − | − | − |
| Grade 1 | −2.43 | 5.75 | [−13.69−8.83] | 0.67 | |
| Grade 2 | −24.39 | 6.41 | [−36.95–−11.83] | 0.00 | |
| Grade 3 | −41.36 | 9.37 | [−59.72–−23.01] | 0.00 | |
| T-class | T1 (ref) | 0.00 | − | − | − |
| T2 | −25.71 | 5.79 | [−37.06–−14.35] | 0.00 | |
| T3 | −30.65 | 7.18 | [−44.72–−16.58] | 0.00 | |
| T4 | −46.44 | 6.93 | [−60.02–−32.86] | 0.00 | |
| N0 (ref) | 0.00 | − | − | − | |
| N-class | N1 | −27.65 | 7.60 | [−42.54–−12.76] | 0.00 |
| N2 | −36.42 | 7.16 | [−50.45–−22.40] | 0.00 | |
| N3 | −56.29 | 9.70 | [−75.29–−37.28] | 0.00 | |
| M-class | M0 (ref) | 0.00 | − | − | − |
| M1 | −47.31 | 19.71 | [−85.94–−8.68] | 0.02 | |
| Prior malignancies | No (ref) | 0.00 | − | − | − |
| Yes | −38.03 | 7.52 | [−52.76–−23.29] | 0.00 | |
| Gender | Male (ref) | 0.00 | − | − | − |
| Female | 2.99 | 5.56 | [−7.91−13.90] | 0.59 | |
| Age at diagnosis per decade | −22.71 | 1.88 | [−26.39–−19.03] | 0.00 | |
| Constant | 300.47 | 12.58 | [275.82−325.12] | 0.00 | |
B. Regression coefficient.
SE. Standard error regression coefficient.
CI. Confidence interval.