| Literature DB >> 33305079 |
Melek Akcay1, Durmus Etiz1, Ozer Celik2.
Abstract
PURPOSE: Radical surgery is the most important treatment modality in gastric cancer. Preoperative or postoperative radiation therapy (RT) and perioperative chemotherapy are the treatment options that should be added to surgery. This study aimed to evaluate the overall survival (OS) and recurrence patterns by machine learning in gastric cancer cases undergoing RT. METHODS AND MATERIALS: Between 2012 and 2019, the OS and recurrence patterns of 75 gastric cancer cases receiving RT ± chemotherapy at the Department of Radiation Oncology were evaluated by machine learning. Logistic regression, multilayer perceptron, XGBoost, support vector classification, random forest, and Gaussian Naive Bayes (GNB) algorithms were used to predict OS, hematogenous distant metastases, and peritoneal metastases. After the correlation analysis, the backward feature selection was performed as the variable selection method, and the variables with P values less than .005 were selected.Entities:
Year: 2020 PMID: 33305079 PMCID: PMC7718548 DOI: 10.1016/j.adro.2020.07.007
Source DB: PubMed Journal: Adv Radiat Oncol ISSN: 2452-1094
Patient and tumor characteristics
| Variable | Number of patients (%)/(min-max) |
|---|---|
| Age | Median: 60 (22-78) |
| Sex | |
| Female | 16 (21.3) |
| Male | 59 (78.7) |
| Karnofsky Performance Scale score | Median: 90 (70-100) |
| Tumor location | |
| Proximal | 17 (22.7) |
| Middle | 25 (33.3) |
| Distal | 33 (44) |
| T stage | |
| Ia | 2 (2.7) |
| Ib | 1 (1.3) |
| II | 4 (5.3) |
| III | 43 (57.3) |
| IVa | 24 (32) |
| IVb | 1 (1.3) |
| N stage | |
| N0 | 11 (14.7) |
| N1 | 17 (22) |
| N2 | 21 (28) |
| N3a | 16 (21.3) |
| N3b | 10 (13.3) |
| TNM stage | |
| IB | 1 (1.3) |
| IIA | 10 (13.3) |
| IIB | 17 (22.7) |
| IIIA | 20 (26.7) |
| IIIB | 14 (18.7) |
| IIIC | 13 (17.3) |
| Tumor grade | |
| I (well-differentiated) | 6 (8) |
| II (moderately differentiated) | 31 (41.3) |
| III (poorly differentiated) | 38 (50.7) |
| Lymphatic invasion | |
| Positive | 47 (62.7) |
| Negative | 28 (37.7) |
| Vascular invasion | |
| Positive | 44 (58.7) |
| Negative | 31 (41.3) |
| Perineural invasion | |
| Positive | 46 (61.3) |
| Negative | 29 (38.7) |
| Tumor size, mm | Median: 55 (10-150) |
Treatment characteristics
| Clinical characteristics | Number of patients (%)/(min-max) |
|---|---|
| Radiation therapy dose, Gy | Median: 45 (45-54) |
| Resection type | |
| Total gastrectomy | 56 (74.4) |
| Subtotal gastrectomy | 19 (25.3) |
| Lymph node dissection | |
| D1 | 37 (49.3) |
| D2 | 38 (50.7) |
| Number of dissected lymph nodes | Median: 25 (6-82) |
| Number of metastatic lymph nodes | Median: 4 (0-52) |
| Number of metastatic lymph nodes/number of dissected lymph nodes | Median: 3 (0-5) |
| Surgical margin | |
| R0 | 57 (76) |
| R1 | 14 (18.7) |
| R2 | 4 (5.3) |
| Neoadjuvant chemotherapy | |
| Yes | 12 (16) |
| No | 63 (84) |
| Concurrent chemotherapy | |
| Yes | 63 (84) |
| No | 12 (16) |
| Concurrent chemotherapy regime | |
| FUFA | 38 (50.7) |
| Capecitabine | 25 (33.3) |
| None | 12 (16) |
Figure 1Recurrence patterns.
Machine learning algorithms
| Algorithm | Accuracy rate | Sensitivity | Specificity | AUC | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (CI) | ||||||||||||
| OS | DM | PR | OS | DM | PR | OS | DM | PR | OS | DM | PR | |
| Logistic regression | 0.63 | 0.83 | 0.94 | 0.71 | 0.80 | 0.94 | 0.60 | 0.87 | 0.94 | 0.64 | 0.83 | 0.95 |
| (0.43-0.93) | (0.71-0.95) | (0.87-1) | ||||||||||
| MLP | 0.45 | 0.91 | 0.92 | 0.46 | 0.89 | 0.94 | 0.42 | 0.94 | 0.90 | 0.45 | 0.92 | 0.92 |
| (0.24-0.66) | (0.82-1) | (0.83-1) | ||||||||||
| XGBoost | 0.63 | 0.86 | 0.94 | 0.71 | 0.93 | 1 | 0.60 | 0.80 | 0.90 | 0.64 | 0.86 | 0.95 |
| (0.43-0.83) | (0.74-0.97) | (0.87-1) | ||||||||||
| SVC | 0.50 | 0.72 | 0.60 | 0.50 | 0.83 | 0.55 | 0.60 | 0.66 | 1 | 0.50 | 0.72 | 0.61 |
| (0.29-0.70) | (0.57-0.86) | (0.45-0.76) | ||||||||||
| Random forest | 0.59 | 0.89 | 0.97 | 0.62 | 0.89 | 1 | 0.57 | 0.89 | 0.95 | 0.59 | 0.89 | 0.97 |
| (0.38-0.79) | (0.78-0.99) | (0.92-1) | ||||||||||
| GNB | 0.81 | 0.69 | 0.80 | 0.81 | 0.81 | 0.94 | 0.81 | 0.64 | 0.85 | 0.82 | 0.69 | 0.89 |
| (0.65-0.97) | (0.54-0.84) | (0.79-0.99) | ||||||||||
Abbreviations: AUC = area under the curve; CI = confidence interval; DM = distant metastasis; GNB = Gaussian Naive Bayes; MLP = multilayer perceptron; OS = overall survival; PR = peritoneal recurrence; SVC = support vector classification.
Confusion matrix for overall survival
| Outcome | Gaussian Naive Bayes | ||
|---|---|---|---|
| Surviving | Deceased | Accuracy, % | |
| Surviving | 9 | 2 | 81.8 |
| Deceased | 1 | 9 | 81.8 |
| Accuracy, % | 81.8 | ||
Figure 2Receiver operating characteristic curve graphs of (a) overall survival, (b) distant metastasis, and (c) peritoneal recurrence.
Confusion matrix for distant metastasis
| Distant metastasis | XGBoost | ||
|---|---|---|---|
| Absent | Present | Accuracy, % | |
| Absent | 17 | 1 | 94 |
| Present | 2 | 16 | 89 |
| Accuracy, % | 91 | ||
Confusion matrix for peritoneal recurrence
| Peritoneal recurrence | Random forest | ||
|---|---|---|---|
| Absent | Present | Accuracy, % | |
| Absent | 18 | 1 | 94 |
| Present | 0 | 19 | 100 |
| Accuracy, % | 97 | ||
Cox regression analysis: overall survival
| Variables | Univariate | Multivariate | ||||
|---|---|---|---|---|---|---|
| OR | 95% CI | OR | 95% CI | |||
| KPS score | 0.965 | 0.935-0.997 | 0.944 | 0.908-1.000 | ||
| Total number of lymph nodes removed | .650 | 1.005 | 0.983-1.028 | 1.071 | 1.024-1.120 | |
| Number of metastatic lymph nodes | .412 | 1.010 | 0.986-1.034 | 0.893 | 0.827-0.965 | |
| Lymph node ratio | .105 | 2.257 | 0.843-6.042 | 54.151 | 4.500-651.567 | |
| Surgical margin | 0.496 | 0.249-0.988 | .282 | 0.626 | 0.267-1.469 | |
| Neoadjuvant CT history | 0.266 | 0.119-0.596 | 0.223 | 0.088-0.561 | ||
| Pretreatment platelet | .271 | 0.998 | 0.995-1.002 | .072 | 0.996 | 0.993-1.000 |
Bold indicates statistical significance.