| Literature DB >> 31586211 |
Joeky T Senders1, Patrick Staples2, Alireza Mehrtash3,4, David J Cote1, Martin J B Taphoorn5, David A Reardon6, William B Gormley1, Timothy R Smith1, Marike L Broekman1,7, Omar Arnaout1.
Abstract
BACKGROUND: Although survival statistics in patients with glioblastoma multiforme (GBM) are well-defined at the group level, predicting individual patient survival remains challenging because of significant variation within strata.Entities:
Keywords: Artificial intelligence; Glioblastoma; Machine learning; Predictive analytics; Survival
Year: 2020 PMID: 31586211 PMCID: PMC7061165 DOI: 10.1093/neuros/nyz403
Source DB: PubMed Journal: Neurosurgery ISSN: 0148-396X Impact factor: 4.654
FIGURE 1.Forest plot for the accelerated failure time model characterizing the association between the individual predictors and survival. In the inferential analysis, the estimates for age and tumor size were presented per 10-yr and 10-mm increase, respectively, to reflect the incremental relative survival duration of clinically meaningful intervals. The P value was corrected for multiple testing by means of the Benjamini-Hochberg procedure.
Discriminatory Performance for All Time-to-Event, Continuous, and Binary Survival Models According to the (Integrated) Concordance Index
| C-index (95%CI) | |||
|---|---|---|---|
| Overall survival | 1 yr survival status | Integrated C-index | |
| Time-to-Event Models | |||
| Accelerated failure time | 0.70 (0.70-0.70) | 0.70 (0.70-0.70) | 0.70 (0.70-0.70) |
| Cox proportional hazards regression | 0.69 (0.69-0.70) | 0.69 (0.69-0.70) | 0.69 (0.69-0.70) |
| Boosted decision trees survival | 0.69 (0.69-0.70) | 0.69 (0.69-0.70) | 0.69 (0.69-0.70) |
| Random forest survival | 0.68 (0.68-0.68) | 0.69 (0.69-0.69) | 0.68 (0.68-0.68) |
| Recursive partitioning | 0.68 (0.68-0.68) | 0.68 (0.68-0.68) | 0.68 (0.68-0.68) |
| Continuous and Binary Models | |||
| Boosted decision trees | 0.70 (0.70-0.70) | 0.70 (0.70-0.70) | NA |
| Regularized generalized linear models | 0.70 (0.70-0.70) | 0.70 (0.70-0.70) | NA |
| Generalized linear models | 0.70 (0.70-0.70) | 0.70 (0.70-0.70) | NA |
| Support vector machines | 0.70 (0.70-0.70) | 0.69 (0.69-0.69) | NA |
| Multilayer perceptron | 0.61 (0.61-0.61) | 0.69 (0.69-0.69) | NA |
| Naïve Bayes[ | NA | 0.69 (0.69-0.69) | NA |
| Random forest | 0.69 (0.69-0.69) | 0.69 (0.69-0.69) | NA |
| Extreme boosted decision trees | 0.68 (0.68-0.68) | 0.68 (0.68-0.68) | NA |
| K-nearest neighbors | 0.67 (0.67-0.67) | 0.68 (0.67-0.68) | NA |
| Bagged decision trees | 0.67 (0.66-0.67) | 0.66 (0.66-0.66) | NA |
Abbreviations: 1 yr, one year; C-index, concordance index; not available.
aNaïve Bayes fits to categorical data only.
FIGURE 2.Calibration plot demonstrating a systematic underestimation of survival by the Cox proportional hazards regression model in the 1-yr survival probability range of 0.5 to 0.75 and a well-calibrated accelerated failure time model. Abbreviations: AFT, accelerated failure time; CPHR, Cox proportional hazards regression.
Secondary Metrics for Model Performance and Deployment
| Interpretability | Predictive applicability | Computational efficiency[ | ||||||
|---|---|---|---|---|---|---|---|---|
| Model | Inference | Prediction | Binary | Continuous | Survival curves | Size (Mb) | Load time (s) | Prediction time (s) |
| AFT | X | X | X | X | X | 20 | 0.9 | 1.9 |
| Bagged decision trees | – | X | X | X | – | 16 380 | 1335 | 31.8 |
| Boosted decision trees | – | X | X | X | – | 300 | 8.2 | 2.1 |
| BDTS | – | X | X | X | X | 36 790 | 2455 | 234.3 |
| CPHR | X | X | X | X | X | 37 | 1.7 | 7.5 |
| GLM | X | X | X | X | – | 1 | 0.2 | 1.7 |
| GLMnet | X | X | X | X | – | 109 | 6.7 | 2.3 |
| K-nearest neighbors | – | X | X | X | – | 91 | 5.6 | 1.9 |
| Multilayer perceptron | – | X | X | X | – | 45 | 1.4 | 17.4 |
| Naïve Bayes | – | X | X | – | – | 82 | 2.9 | 13.0 |
| Random forest | – | X | X | X | – | 1100 | 41.4 | 10.1 |
| Random forest survival | – | X | X | X | X | 6350 | 65.7 | 139.0 |
| Recursive partitioning | – | X | X | X | X | 490 | 52.1 | 3.4 |
| Support vector machine | – | X | X | X | – | 111 | 4.8 | 4.4 |
| X-boosted decision trees | – | X | X | X | – | 92 | 2.4 | 1.5 |
Abbreviations: AFT, accelerated failure time; BDTS, boosted decision trees survival; CPHR, Cox proportional hazards regression; GLM(net), (Lasso and elastic-net regularized) generalized linear models; Mb, megabyte; s, seconds; TTE, time to event; X, extreme.
aBased on a 100-fold bootstrapped model.
FIGURE 3.Estimated survival profile of a hypothetical patient (male, 50-yr old, white, non-Hispanic, married, insured, left-sided, frontal lobe, confined to its primary location, 50 mm in size, gross-total resection), plotted per adjuvant treatment strategy. Personalized estimates of overall survival in months (left), 1-yr survival probability (middle), and 5-yr survival curves (right) as predicted by the accelerated failure time model. The boxes and whiskers in the boxplots represent the 50% and 95% CI, respectively. The ribbons in the survival curves represent the 95% CI. Abbreviations: Rx, radiotherapy; Cx, chemotherapy.