| Literature DB >> 35455723 |
Alexandros Laios1, Evangelos Kalampokis2,3, Racheal Johnson1, Amudha Thangavelu1, Constantine Tarabanis4, David Nugent1, Diederick De Jong1.
Abstract
Complete surgical cytoreduction (R0 resection) is the single most important prognosticator in epithelial ovarian cancer (EOC). Explainable Artificial Intelligence (XAI) could clarify the influence of static and real-time features in the R0 resection prediction. We aimed to develop an AI-based predictive model for the R0 resection outcome, apply a methodology to explain the prediction, and evaluate the interpretability by analysing feature interactions. The retrospective cohort finally assessed 571 consecutive advanced-stage EOC patients who underwent cytoreductive surgery. An eXtreme Gradient Boosting (XGBoost) algorithm was employed to develop the predictive model including mostly patient- and surgery-specific variables. The Shapley Additive explanations (SHAP) framework was used to provide global and local explainability for the predictive model. The XGBoost accurately predicted R0 resection (area under curve [AUC] = 0.866; 95% confidence interval [CI] = 0.8-0.93). We identified "turning points" that increased the probability of complete cytoreduction including Intraoperative Mapping of Ovarian Cancer Score and Peritoneal Carcinomatosis Index < 4 and <5, respectively, followed by Surgical Complexity Score > 4, patient's age < 60 years, and largest tumour bulk < 5 cm in a surgical environment of optimized infrastructural support. We demonstrated high model accuracy for the R0 resection prediction in EOC patients and provided novel global and local feature explainability that can be used for quality control and internal audit.Entities:
Keywords: Explainable Artificial Intelligence; complete cytoreduction; epithelial ovarian cancer
Year: 2022 PMID: 35455723 PMCID: PMC9030484 DOI: 10.3390/jpm12040607
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Cohort descriptive statistics: values are mean ± SD or n (%).
| Demographic Characteristics | Overall | Train Set | Test Set | Non R0 | R0 | ||
|---|---|---|---|---|---|---|---|
| Age (y) | 63.5 ± 11.2 | 63.6 ± 11 | 63.2 ± 11.7 | 0.71 | 65.6 ± 10.1 | 62.4 ± 11.6 |
|
| Histology | 0.217 |
| |||||
| Serous | 504 (0.88) | 350 (0.88) | 153 (0.89) | 186 (0.94) | 318 (0.85) | ||
| Mucinous | 13 (0.02) | 7 (0.017) | 6 (0.035) | 1 (0.005) | 12 (0.03) | ||
| Clear cell/endometrioid | 33 (0.06) | 23 (0.057) | 10 (0.058) | 6 (0.03) | 27 (0.07) | ||
| Miscellaneous | 22 (0.04) | 19 (0.047) | 3 (0.017) | 4 (0.02) | 18 (0.05) | ||
| Primary = 0/recurrent = 1 | 0.37 | ||||||
| Primary | 561 (0.98) | 370 (0.66) | 191 (0.32) | 0.7 | 192 (0.98) | 369 (0.98) | |
| Recurrent | 10 (0.02) | 7 (0.02) | 3 (0.02) | ||||
| Grade (G1 = 0/G3 = 1) | 516 (0.9) | 354 (0.89) | 162 (0.94) | 174 (0.89) | 342 (0.91) | 0.43 | |
| FIGO stage (S3 = 0/S4 = 1) | 157 (0.27) | 106 (0.27) | 51 (0.3) | 59 (0.3) | 98 (0.26) | 0.36 | |
| Performance status (WHO) | 0.74 |
| |||||
| 0 | 273 (0.48) | 192 (0.48) | 80 (0.46) | 70 (0.35) | 203 (0.54) | ||
| 1 | 212 (0.37) | 144 (0.36) | 68 (0.39) | 90 (0.46) | 122 (0.32) | ||
| 2 | 68 (0.12) | 50 (0.125) | 18 (0.1) | 28 (0.14) | 40 (0.1) | ||
| 3 | 19 (0.03) | 13 (0.276) | 6 (0.35) | 9 (0.04) | 10 (0.02) | ||
| Age of consultant (y) | 49.2 ± 6 | 49.3 ± 6 | 48.9 ± 6 | 0.46 | 49.2 ± 6.3 | 49.2 ± 5.9 | 0.99 |
| Timing of surgery | 0.6 | 0.38 | |||||
| Interval debulking surgery | 396 (0.69) | 278 (0.7) | 118 (0.69) | 141 (0.72) | 255 (0.68) | ||
| Primary debulking surgery | 175 (0.31) | 113 (0.64) | 62 (0.36) | 122 (0.7) | 53 (0.3) | ||
| Year | 0.7 |
| |||||
| 2014 | 91 (0.158) | 63 (0.158) | 27 (0.156) | 44 (0.22) | 47 (0.12) | ||
| 2015 | 93 (0.162) | 65 (0.163) | 28 (0.16) | 36 (0.18) | 57 (0.15) | ||
| 2016 | 108 (0.19) | 74 (0.185) | 34 (0.198) | 38 (0.19) | 70 (0.19) | ||
| 2017 | 96 (0.17) | 64 (0.16) | 32 (0.186) | 31 (0.16) | 65 (0.17) | ||
| 2018 | 82 (0.14) | 55 (0.138) | 27 (0.157) | 23 (0.12) | 59 (0.16) | ||
| 2019 | 102 (0.18) | 78 (0.195) | 24 (0.14) | 25 (0.13) | 77 (0.21) | ||
| Pre-treatment CA125 | 1572.7 ± 2993.2 | 1542.9 ± 3070.7 | 1641.8 ± 2812.9 | 0.7 | 1790.9 ± 3207 | 1458.7 ± 2873.1 | 0.22 |
| Pre-surgery CA125 | 411.79 ± 1170 | 387.3 ± 943.7 | 468.6 ± 1576.7 | 0.52 | 451 ± 931.7 | 391.3 ± 1277.7 | 0.52 |
| Size of largest disease bulk (cm) | 8.9 ± 5.58 | 9.0 ± 5.79 | 8.63 ± 5.09 | 0.43 | 9.79 ± 5.24 | 8.41 ± 5.7 |
|
| Peritoneal Carcinomatosis Index (PCI) | 7.39 ± 4.53 | 7.39 ± 4.52 | 7.4 ± 4.56 | 0.99 | 8.91 ± 4.31 | 6.6 ± 4.44 |
|
| Surgical Complexity Score (SCS) | 3.8 ± 2.12 | 3.79 ± 2.14 | 3.82 ± 2.09 | 0.89 | 3.1 ± 1.44 | 4.17 ± 2.32 |
|
| Time procedure (min) | 170.36 ± 77.48 | 170.1 ± 79.48 | 170.96 ± 72.85 | 0.9 | 161.84 ± 63.81 | 174.81 ± 83.47 | 0.039 |
| Site of largest tumour deposit | 0.28 |
| |||||
| Ovary | 298 (0.52) | 216 (0.54) | 82 (0.477) | 83 (0.42) | 215 (0.57) | ||
| Omentum | 258 (0.45) | 171 (0.43) | 86 (0.5) | 110 (0.56) | 148 (0.39) | ||
| Miscellaneous | 16 (0.03) | 12 (0.03) | 4 (0.023) | 4 (0.02) | 12 (0.03) | ||
| Intra Operative Mapping Score | 4.92 ± 1.99 | 4.95 ± 1.99 | 4.85 ± 2.01 | 0.58 | 5.96 ± 1.69 | 4.38 ± 1.93 |
|
| Ascites (intra-op) | 131 (0.23) | 93 (0.23) | 38 (0.22) | 49 (0.25) | 82 (0.22) | 0.458 |
Figure 1Prediction performance of the XGBoost model in the test cohort through the Receiver Operator Characteristic (ROC) (AUC = 0.866) plot demonstrating also the CI (AUC = 0.8–0.93). Note that the ROC curves depend on the order of the feature relative risk values without the need to choose a threshold.
Performance evaluation scores of the test cohort.
| Precision | Recall | f1-Score | |
|---|---|---|---|
| CC0 | 0.91 | 0.87 | 0.89 |
| Non-CC0 | 0.71 | 0.78 | 0.75 |
Figure 2A set of beeswarm plots for global explainability of CC0 resection prediction. Dots correspond to the individual EOC patients in the study.
Figure 3SHAP dependence plots of the top-six global explainability features versus their SHAP value for R0 resection prediction: (A) IMO score, (B) PCI, (C) SCS, and (D) year of surgery.
Figure 4SHAP Interaction Value Dependence plots: (A) IMO score with type of surgery (B) PCI with time of the procedure (C) PCI with ECOG status (D) Size of largest tumor deposit with SCS.
Figure 5Succinct visual summary of feature integration by the XAI model into a single risk for R0 resection prediction. The SHAP force plots illustrate examples of explained risks for individual patients. For R0 resection risk, blue features have values that increased the risk, while red features decreased the risk. The combination of impacts of all features is the predicted R0 prediction risk. (A–D) The odds for R0 resection range between 1.54 and 2.35 times higher than expected (E). The odds for incomplete cytoreduction are 3.21 times higher than expected. Each feature impact value represents the change in risk when that feature’s value is known versus unknown. These examples clearly demonstrated the complex interactions between patients and ovarian cancer specific features in a dynamic well supported surgical environment.
Figure 6Distribution of features for global and local explainability of an AI-based model for prediction of complete cytoreduction in EOC patients. B; baseline, T; transition, E; evaluation.