| Literature DB >> 36091102 |
Yan Shi1, Ying Zou2,3, Jihua Liu2,3, Yuanyuan Wang1, Yingbin Chen4, Fang Sun1, Zhi Yang1, Guanghe Cui1, Xijun Zhu1, Xu Cui1, Feifei Liu1,5.
Abstract
Objectives: A radiomics-based explainable eXtreme Gradient Boosting (XGBoost) model was developed to predict central cervical lymph node metastasis (CCLNM) in patients with papillary thyroid carcinoma (PTC), including positive and negative effects.Entities:
Keywords: lymphatic metastasis; machine learning; papillary thyroid carcinoma; radiomics; ultrasound
Year: 2022 PMID: 36091102 PMCID: PMC9458917 DOI: 10.3389/fonc.2022.897596
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1Flowchart of patient selection and group allocation for the study.
Figure 2Flowchart of the study.
Demographics and ultrasound image characteristics in the training and test cohorts.
| Training cohort (N = 469) | Test cohort (N = 118) | |||||
|---|---|---|---|---|---|---|
| CCLNM (-) | CCLNM (+) |
| CCLNM (-) | CCLNM (+) |
| |
| (n = 348) | (n = 121) | (n = 86) | (n = 32) | |||
| Age | 48.4 ± 15.7 | 43.4 ± 16.0 | 0.003 | 51.0 ± 14.0 | 44.2 ± 13.9 | 0.021 |
| Sex | 0.010 | 0.127 | ||||
| Female | 229 (65.8) | 63 (52.1) | 58 (67.4) | 16 (50.0) | ||
| Male | 119 (34.2) | 58 (47.9) | 28 (32.6) | 16 (50.0) | ||
| Diameter (cm) | 1.18 ± 0.73 | 2.06 ± 1.41 | <0.001 | 1.03 ± 0.55 | 2.30 ± 1.14 | <0.001 |
| Location | 0.396 | 0.910 | ||||
| Right lobe | 31 (8.91) | 16 (13.2) | 8 (9.30) | 4 (12.5) | ||
| Left lobe | 157 (45.1) | 52 (43.0) | 39 (45.3) | 14 (43.8) | ||
| Isthmus | 160 (46.0) | 53 (43.8) | 39 (45.3) | 14 (43.8) | ||
| Composition | 0.543 | 1.000 | ||||
| Mixed | 56 (16.1) | 16 (13.2) | 13 (15.1) | 4 (12.5) | ||
| Solid | 292 (83.9) | 105 (86.8) | 73 (84.9) | 28 (87.5) | ||
| Echogenicity | 0.449 | 0.929 | ||||
| Hyper/Isoechoic | 21 (6.03) | 4 (3.31) | 3 (3.49) | 1 (3.12) | ||
| Hypoechoic | 188 (54.0) | 64 (52.9) | 48 (55.8) | 17 (53.1) | ||
| Very hypoechoic | 139 (39.9) | 53 (43.8) | 35 (40.7) | 14 (43.8) | ||
| Shape | 0.744 | 0.929 | ||||
| Wider-than-tall | 136 (39.1) | 50 (41.3) | 35 (40.7) | 14 (43.8) | ||
| Taller-than-wide | 212 (60.9) | 71 (58.7) | 51 (59.3) | 18 (56.2) | ||
| Margin | 1.000 | 1.000 | ||||
| Smooth | 122 (35.1) | 42 (34.7) | 34 (39.5) | 12 (37.5) | ||
| Ill-defined | 212 (60.9) | 75 (62.0) | 49 (57.0) | 19 (59.4) | ||
| Lobulated/irregular | 14 (4.02) | 4 (3.31) | 3 (3.49) | 1 (3.12) | ||
| Calcification | <0.001 | 0.001 | ||||
| None | 125 (35.9) | 38 (31.4) | 24 (27.9) | 7 (21.9) | ||
| Macrocalcification | 86 (24.7) | 60 (49.6) | 18 (20.9) | 19 (59.4) | ||
| Rim calcification | 107 (30.7) | 19 (15.7) | 33 (38.4) | 5 (15.6) | ||
| Microcalcification | 30 (8.62) | 4 (3.31) | 11 (12.8) | 1 (3.12) | ||
| Vascularization | 0.261 | 0.666 | ||||
| No | 175 (50.3) | 53 (43.8) | 46 (53.5) | 15 (46.9) | ||
| Yes | 173 (49.7) | 68 (56.2) | 40 (46.5) | 17 (53.1) | ||
| Capsular invasion | <0.001 | <0.001 | ||||
| No | 332 (95.4) | 57 (47.1) | 81 (94.2) | 11 (34.4) | ||
| Yes | 16 (4.60) | 64 (52.9) | 5 (5.81) | 21 (65.6) | ||
| Radiomics score | 0.22 ± 0.09 | 0.36 ± 0.15 | <0.001 | 0.23 ± 0.09 | 0.35 ± 0.14 | <0.001 |
CCLNM, central cervical lymph node metastasis.
Data are shown as n (%) or mean ± standard deviation.
Figure 3The Boruta algorithm incorporating the independent clinical, ultrasound variables and radiomics score was performed to select the final predictors for CCLNM. The key features included capsular invasion, radiomics score, diameter, age, and calcification.
Performance of the XGBoost model.
| Training dataset (N = 469) | Test dataset (N = 118) | |
|---|---|---|
| BA | 84.89% | 85.21% |
|
| 73.36% | 76.06% |
| MCC | 63.68% | 66.56% |
| Precision | 63.10% | 69.23% |
| Recall | 87.60% | 84.38% |
| AUC | 91.53% | 90.88% |
| RMSE | 0.4052 | 0.3796 |
| R2 | 0.1424 | 0.2711 |
XGBoost, eXtreme Gradient Boosting; BA, balanced accuracy; MCC, Matthew’s correlation coefficient; AUC, area under the curve; RMSE, root mean square error; R2, coefficient of determination.
Figure 4ROC, learning, and DET curves of the XBoost model. (A) The ROC curves of the XGBoost model in the training and test cohorts for predicting CCLNM in PTC patients, with an AUC of 0.9153 and 0.9088, respectively. (B) The learning curve in the training cohort of the XGBoost model. The two curves finally merge near 0.85, indicating that the model is well fitted for training. (C, D) The DET curves of the XGBoost model in the training and test cohorts. They were both concentrated in the third quadrant, indicating that the false rejection rate and false acceptance rate were both low.
Figure 5SHAP plots of the XGBoost model. (A) The classified bar charts of the SHAP summary plots show the influence of each parameter on the XGBoost model. (B) The SHAP summary plot’s scatter plot shows the relationship between the characteristic value and the predicted probability through colors, including positive and negative predictive effects. (C) SHAP decision plot for all patients with PTC; (D) SHAP decision plot for 10 random patients with PTC, with one misjudgment case (dotted line).
Figure 6Two examples of correct prediction of CCLNM+ and CCLNM-. (A–D) A 28-year-old male patient was admitted to our hospital for further treatment after physical examination found a nodule in the right lobe of the thyroid. Ultrasound examination showed a nodule in the middle of the right lobe of the thyroid, 0.69 cm in diameter, with coarse calcification and capsular invasion. Postoperative pathological findings: papillary carcinoma of the right lobe of thyroid, with lymph node metastasis in the right central region. Retrospective analysis of the case showed that the radiomics score was 0.657 and the XGBoost model predicted CCLNM+ correctly. (E–H) A 44-year-old male patient was admitted to our hospital because or volume increase of the right thyroid lobe. Ultrasound examination showed a nodule in the middle of the right lobe of thyroid, 2.74 cm in diameter, with coarse calcification and without capsular invasion. Postoperative pathological findings: papillary carcinoma of the right lobe of thyroid, with no lymph node metastasis in the right central region. Retrospective analysis of the case showed that the 4adiomics score was 0.128 and the XGBoost model predicted CCLNM- correctly.