| Literature DB >> 34868967 |
Yadi Zhu1, Ling Yang1, Hailin Shen2.
Abstract
PURPOSE: To explore the value of machine learning model based on CE-MRI radiomic features in preoperative prediction of sentinel lymph node (SLN) metastasis of breast cancer.Entities:
Keywords: CE-MRI; breast cancer; machine learning; radiomics; sentinel lymph node metastasis
Year: 2021 PMID: 34868967 PMCID: PMC8640128 DOI: 10.3389/fonc.2021.757111
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Radiomics workflow.
Figure 2The final selected feature.
Clinical and histopathological characteristics.
| Characteristics | Training set ( | Validation set ( |
| |||
|---|---|---|---|---|---|---|
| Total ( | SLN+ ( | SLN- ( |
| |||
| Age, years (Mean ± SD) | 46.30 ± 10.80 | 45.07 | 47.33 | 0.250 | 46.54 ± 9.60 | 0.890 |
| Tumor size on MRI, median (IQR), cm | 2.20 (1.70–3.10) | 2.75 (1.92–3.77) | 1.90 (1.50–2.70) | <0.01 | 2.15 (1.60–2.92) | 0.488 |
|
| 0.006 | 0.321 | ||||
| BI-RADS 4 | 51 (41.5%) | 17 (30.3%) | 34 (50.7%) | 29 (53.7%) | ||
| BI-RADS 5 | 49 (39.8%) | 31 (55.4%) | 18 (26.9%) | 17 (31.5%) | ||
| BI-RADS 6 | 23 (18.7%) | 8 (14.3%) | 15 (22.4%) | 8 (14.8%) | ||
|
| 0. 180 | 0.259 | ||||
| Invasive ductal carcinoma | 114 (92.7%) | 54 (96.4%) | 60 (89.6%) | 47 (87.0%) | ||
| Others | 9 (7.3%) | 2 (3.6%) | 7 (10.4%) | 7 (13.0%) | ||
|
| 0.689 | 0.909 | ||||
| I | 7 (5.7%) | 4 (7.1%) | 3 (4.5%) | 4 (7.4%) | ||
| II | 77 (62.6%) | 33 (59.0%) | 44 (65.7%) | 33 (61.1%) | ||
| III | 39 (31.7%) | 19 (33.9%) | 20 (29.8%) | 17 (31.5%) | ||
|
| 0.418 | 0.536 | ||||
| Luminal A | 20 (16.3%) | 7 (12.5%) | 13 (19.4%) | 11 (20.4%) | ||
| Luminal B | 60 (48.8%) | 26 (46.43%) | 34 (50.8%) | 29 (53.7) | ||
| HER-2 positive | 18 (14.6%) | 11 (19.7%) | 7 (10.4%) | 4 (7.4%) | ||
| Triple negative | 25 (20.3%) | 12 (21.4%) | 13 (19.4%) | 10 (18.5%) | ||
SLN+, patients with SLN metastasis; SLN−, patients without SLN metastasis; SD, standard deviation; IQR, interquartile range; HER2, human epidermal growth factor receptor-2.
Data are numbers of patients, with percentages in parentheses.
P-value < 0.05 indicates a significant difference between SLN+ and SLN− group in the training set.
P*-value < 0.05 indicates a significant difference between training and validation sets.
Prediction performance in training and validation sets of five machine learning models.
| Machine learning algorithm | Training set | Validation set | ||||||
|---|---|---|---|---|---|---|---|---|
| ACC | SEN | SPE | AUC | ACC | SEN | SPE | AUC | |
| SVM | 0.83 | 0.71 | 0.94 | 0.91 | 0.78 | 0.60 | 0.90 | 0.86 |
| RF | 1.00 | 1.00 | 1.00 | 1.00 | 0.81 | 0.72 | 0.90 | 0.85 |
| LR | 0.84 | 0.77 | 0.90 | 0.92 | 0.78 | 0.68 | 0.86 | 0.84 |
| GBDT | 1.00 | 1.00 | 1.00 | 1.00 | 0.72 | 0.68 | 0.76 | 0.82 |
| DT | 1.00 | 1.00 | 1.00 | 1.00 | 0.74 | 0.68 | 0.79 | 0.74 |
SVM, support vector machine; RF, Random Forest; LR, logistic regression; GBDT, Gradient Boosting Decision Tree; DT, Decision Tree; ACC, accuracy; SEN, sensitivity; SPE, specificity; AUC, area under the curve.
Figure 3ROC curves of the SVM, RF, LR, GBDT, and DT classifiers in validation set.
Prediction performance of training and validation sets of the combined model (SVM algorithm).
| Group | ACC | SEN | SPE | AUC |
|---|---|---|---|---|
| Training set | 0.83 | 0.73 | 0.91 | 0.92 |
| Validation set | 0.80 | 0.68 | 0.90 | 0.88 |
SVM, support vector machine; ACC, accuracy; SEN, sensitivity; SPE, specificity; AUC, area under the curve.
Figure 4ROC curves of the combined model incorporating CE-MRI radiomics features, tumor size, and BI-RADS classification.