| Literature DB >> 34922474 |
Bozhi Hu1, Chao Wang1, Kewei Jiang1, Zhanlong Shen1, Xiaodong Yang1, Mujun Yin1, Bin Liang1, Qiwei Xie1, Yingjiang Ye2, Zhidong Gao3.
Abstract
INTRODUCTION: Gastrointestinal stromal tumor (GIST) is the most common gastrointestinal soft tissue tumor. Clinical diagnosis mainly relies on enhanced CT, endoscopy and endoscopic ultrasound (EUS), but the misdiagnosis rate is still high without fine needle aspiration biopsy. We aim to develop a novel diagnostic model by analyzing the preoperative data of the patients.Entities:
Keywords: Diagnostic model; Gastrointestinal stromal tumor; XGBoost
Mesh:
Year: 2021 PMID: 34922474 PMCID: PMC8684147 DOI: 10.1186/s12876-021-02048-1
Source DB: PubMed Journal: BMC Gastroenterol ISSN: 1471-230X Impact factor: 3.067
Characteristics of patients included in this study
| Characteristic | All data (n = 124) | ||
|---|---|---|---|
| GIST (n = 90) | Others (n = 34) | ||
| Tumor size* | 4.35 ± 2.87 | 3.87 ± 3.35 | 0.484 |
| Long/short diameter* | 1.26 ± 0.29 | 1.52 ± 0.61 | 0.032 |
| CT value* | 34.18 ± 5.69 | 36.79 ± 9.97 | 0.192 |
| Arterial phase enhancement** | 14.60 ± 13.29 | 17.00 ± 13.38 | 0.800 |
| Venous phase enhancement** | 34.15 ± 16.63 | 35.85 ± 19.91 | 0.423 |
| Homogeneous enhancement* | |||
| Yes | 46 | 22 | 0.218 |
| No | 33 | 8 | |
| Bleeding# | |||
| Yes | 9 | 3 | 1.000 |
| No | 66 | 28 | |
| Ulcer on the surface# | |||
| Yes | 12 | 3 | 0.545 |
| No | 63 | 28 | |
| Calcification inside## | |||
| Yes | 7 | 4 | 0.734 |
| No | 33 | 15 | |
| Liquid area inside## | |||
| Yes | 12 | 0 | 0.006 |
| No | 28 | 19 | |
| White blood cell (× 109/L)+ | 5.74 ± 2.14 | 5.55 ± 1.43 | 0.645 |
| Neutrophil (× 109/L)+ | 3.49 ± 1.93 | 3.24 ± 1.33 | 0.475 |
| Lymphocyte (× 109/L)+ | 1.62 ± 0.52 | 1.67 ± 0.57 | 0.616 |
| Platelet (× 109/L)+ | 219.5 ± 56.50 | 228.00 ± 81.77 | 0.517 |
| Haemoglobin (g/L)+ | 129.77 ± 23.78 | 128.29 ± 18.17 | 0.745 |
| ALT+ | 21.18 ± 10.86 | 17.09 ± 8.08 | 0.026 |
| AST+ | 22.78 ± 10.79 | 20.03 ± 7.30 | 0.173 |
| Albmin (g/L)+ | 40.25 ± 3.87 | 40.76 ± 3.53 | 0.501 |
| Fibrinogen+ | 302.36 ± 56.55 | 303.50 ± 70.41 | 0.925 |
| De Ritis ratio++ | 1.21 ± 0.44 | 1.30 ± 0.52 | 0.324 |
| PNI++ | 48.33 ± 5.12 | 49.12 ± 4.34 | 0.428 |
| SII++ | 527.31 ± 394.00 | 518.67 ± 400.14 | 0.914 |
| PLR++ | 147.67 ± 57.70 | 155.44 ± 93.01 | 0.577 |
| NLR++ | 2.42 ± 1.73 | 2.29 ± 1.65 | 0.712 |
PNI = serum albumin (g/L) + 0.005 × lymphocyte count (/mm3)
SII = Platelet (counts/L) × Neutrophils (counts/L)/Lymphocytes (counts/L)
De Ritis ratio = ALT/AST
ALT alanine transaminase, AST glutamic oxaloacetic transaminase, PNI prognostic nutritional index, SII systemic immune-inflammation index, PLR platelet–lymphocyte ratio, NLR neutrophillymphocyte ratio
*Tumor performance under CT scan
**Tumor performance under enhanced CT scan
#Tumor performance under endoscopy
##Tumor performance under EUS
+Counts in peripheral blood
++Calculated by indicators in peripheral blood
Fig. 1The process of predictor selection. In the initial XGBoost model, we used all clinical data before surgery to construct the model. a Shows the importance of each feature. It’s easy to find out that most hematological test features don’t have an important impact on GIST diagnosis except ALT. Similarly, we built the second model using the data only from enhanced CT, endoscopy and EUS (b). The latter three features also have little importance, and that’s why they were excluded. The first 6 predictors were determined to be the most important features and utilized in the final model development. The importance of the six predictors in the final XGBoost model are shown in c. The most important predictors of this model is the existence of liquid area inside the tumor under EUS, following by the ratio of long and short diameter under CT, the CT value of the tumor, the enhancement of the tumor in arterial period and venous period, existence calcified area inside the tumor under EUS. (The specific values and 95 CI of importance are shown in Additional file 2: Table S1.)
Validation performance of the model
| Metric | Performance (95 CI) |
|---|---|
| Accuracy | 0.73 (0.58–0.88) |
| Precision | 0.79 (0.60–0.95) |
| Recall | 0.87 (0.67–1.00) |
| f1-score | 0.82 (0.70–0.92) |
| auROC | 0.77 (0.57–0.90) |
| C-index | 0.76 (0.56–0.89) |
Accuracy: number of correct predictions/total number of all predictions
Precision: number of correct positive predictions/number of positive predictions
Recall: number of correct positive predictions/number of all positive individuals
F1-score: the harmonic mean of precision and recall
auROC: area under the receiver operating characteristic curve
C-index: concordance index
Fig. 2Partial importance of features in the final model. Round or round-like tumors (b) with a CT value of around 30 (25–37) (c) and delayed enhancement (d, e), as well as liquid (a) but not calcified (f) area inside the tumor best indicate the diagnosis of GIST
Fig. 3Prediction in individual scale. Inputting the data of two patients into this model yielded individualized results. The length of the square indicates the impact on the outcome. Green indicates “GIST”, while red indicates “Other SMT”. The final prediction result is compared with 0.684. A prediction value higher than 0.684 is predicted as “GIST”, while lower than 0.684 is diagnosed as “Other SMT”. The prediction result of patient 1 (a) in this model is “GIST”, and patient 2 (b) is “Other SMT”