| Literature DB >> 35154743 |
Christina E Holm1, Clare F Grazal2, Mathias Raedkjaer3, Thomas Baad-Hansen3, Rajpal Nandra4, Robert Grimer4, Jonathan A Forsberg2, Michael Moerk Petersen1, Michala Skovlund Soerensen1.
Abstract
BACKGROUND: Bone sarcomas often present late with advanced stage at diagnosis and an according, varying short-term survival. In 2016, Nandra et al. generated a Bayesian belief network model for 1-year survival in patients with bone sarcomas. The purpose of this study is: (1) to externally validate the prior 1-year Bayesian belief network prediction model for survival in patients with bone sarcomas and (2) to develop a gradient boosting machine model using Nandra et al.'s cohort and evaluate whether the gradient boosting machine model outperforms the Bayesian belief network model when externally validated in an independent Danish population cohort.Entities:
Keywords: Artificial intelligence; bone sarcoma; machine learning; prediction; survival
Year: 2022 PMID: 35154743 PMCID: PMC8832594 DOI: 10.1177/20503121221076387
Source DB: PubMed Journal: SAGE Open Med ISSN: 2050-3121
Figure 1.By shuffling copies of all features, the chosen Boruta algorithm trains a Random Forest on the overall data. Features are then rejected or confirmed. Confirmed features are ranked with their relative influence in the GBM model as demonstrated.
Distribution and comparison of baseline variables between training and validation cohort.
| Variable | Level | Training cohort | Validation cohort | Total | P value |
|---|---|---|---|---|---|
| Gender | 0.22
| ||||
| Female | 1451 (42) | 338 (44) | 1789 (42) | ||
| Male | 2042 (59) | 430 (56) | 2472 (58) | ||
| Missing | 0 | 3 | 3 | ||
| Age | <0.0001
| ||||
| Median (IQR) | 23 (14–51) | 44 (22–62) | 26 (15–53) | ||
| Missing | 0 | 3 | 3 | ||
| Tumor size (cm) | <0.0001
| ||||
| Median (IQR) | 10 (7–13) | 6(3–10) | 8 (2–12) | ||
| Missing | 1796 | 0 | 1796 | ||
| Grade | <0.0001
| ||||
| High | 2641 (76) | 293 (49) | 2934 (72) | ||
| Intermediate | 374 (11) | 143 (24) | 517 (13) | ||
| Low | 478 (14) | 158 (27) | 636 (16) | ||
| Missing | 0 | 177 | 177 | ||
| Histology | <0.0001
| ||||
| Osteosarcoma | 1572 (45) | 174 (25) | 1746 (41) | ||
| Chondrosarcoma | 793 (23) | 326 (46) | 1119 (26) | ||
| Ewings | 653 (19) | 114 (16) | 767 (18) | ||
| Sarcoma | 182 (5) | 26 (3) | 191 (4) | ||
| Chordoma | 70 (2) | 34 (5) | 104 (2) | ||
| Other (19 histologic diagnoses) | 223 (6) | 36 (5) | 259 (6) | ||
| Missing | 0 | 61 | 61 | ||
| Pathologic fracture at diagnosis | <0.0001
| ||||
| No | 3035 (87) | 729 (95) | 3764 (88) | ||
| Yes | 458 (13) | 42 (5) | 500 (12) | ||
| Missing | 0 | 0 | 0 | ||
| Anatomic location | <0.0001
| ||||
| Head and neck | 20 (1) | 50 (7) | 70 (2) | ||
| Lower extremity | 2118 (61) | 355 (47) | 2473 (58) | ||
| Pelvic girdle | 642 (18) | 117 (16) | 759 (18) | ||
| Spine | 0 | 32 (4) | 32 (1) | ||
| Upper extremity | 471 (14) | 103 (14) | 574 (14) | ||
| Upper trunk | 230 (7) | 93 (12) | 323 (8) | ||
| Missing | 12 | 21 | 33 | ||
| Metastasis at diagnosis | 0.63
| ||||
| No | 3010 (86) | 651 (87) | 3661 (86) | ||
| Yes | 483 (14) | 98 (13) | 581 (14) | ||
| Missing | 0 | 22 | 22 | ||
| Status at 1 year after diagnosis | 0.009
| ||||
| Alive | 3099 (89) | 655 (85) | 3754 (88) | ||
| Dead | 394 (11) | 113 (15) | 507 (12) | ||
| Missing | 0 | 3 | 3 | ||
| Year of diagnosis | |||||
| Missing | 222 | 3 | 225 | – |
IQR: interquartile range.
Mann–Whitney U-test.
Chi-square test.
Figure 2.ROC curves of the external validation of the1-year survival BBN model. The discriminatory accuracy of the BBN model for survival yielded poor power (0.68).
Figure 3.ROC curves of the internal validation of the 1-year survival GBM model. The discriminatory accuracy of the GBM model for survival was classified as good (0.75).
Figure 4.Net benefit plotted on the decision curve analysis graph against threshold probabilities demonstrating the benefit of intervention based on decision to treat from model output. The curve demonstrates a net benefit if using the model at thresholds above 0.50 compared to assuming all patients survive. For thresholds below 0.50, the model is no better or no worse that assuming all patients will survive.
Figure 5.ROC curves of the external validation of the 1-year survival BBN model. The discriminatory accuracy of the GBM model for survival yielded poor power (AUC: 0.63.)