| Literature DB >> 32144248 |
Xueyi Zheng1, Zhao Yao2, Yini Huang1, Yanyan Yu3, Yun Wang1, Yubo Liu1, Rushuang Mao1, Fei Li1, Yang Xiao3, Yuanyuan Wang2,4, Yixin Hu1, Jinhua Yu5,6, Jianhua Zhou7.
Abstract
Accurate identification of axillary lymph node (ALN) involvement in patients with early-stage breast cancer is important for determining appropriate axillary treatment options and therefore avoiding unnecessary axillary surgery and complications. Here, we report deep learning radiomics (DLR) of conventional ultrasound and shear wave elastography of breast cancer for predicting ALN status preoperatively in patients with early-stage breast cancer. Clinical parameter combined DLR yields the best diagnostic performance in predicting ALN status between disease-free axilla and any axillary metastasis with areas under the receiver operating characteristic curve (AUC) of 0.902 (95% confidence interval [CI]: 0.843, 0.961) in the test cohort. This clinical parameter combined DLR can also discriminate between low and heavy metastatic burden of axillary disease with AUC of 0.905 (95% CI: 0.814, 0.996) in the test cohort. Our study offers a noninvasive imaging biomarker to predict the metastatic extent of ALN for patients with early-stage breast cancer.Entities:
Mesh:
Year: 2020 PMID: 32144248 PMCID: PMC7060275 DOI: 10.1038/s41467-020-15027-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Patient recruitment workflow.
In total, 584 out of 1342 patients were included according to the selection criteria. The included patients were examined by conventional US and SWE, and had complete clinical information needed for the study.
The performance comparison of different models.
| AUC | ACC (%) | SENS (%) | SPEC (%) | PPV (%) | NPV (%) | PARAM (Mb) | ||
|---|---|---|---|---|---|---|---|---|
| Resnet50 | T | 0.936 [0.910, 0.962] | 85.7 [82.0, 89.4] | 79.1 [71.6, 85.3] | 93.6 [89.2, 96.5] | 90.0 [83.5, 94.6] | 85.9 [80.6, 90.2] | 98 |
| V | 0.904 [0.847, 0.961] | 81.4 [73.9, 88.2] | 74.0 [59.7, 85.5] | 88.2 [78.1, 94.8] | 82.2 [67.9, 92.0] | 82.2 [71.5, 90.2] | ||
| I-T | 0.902 [0.843, 0.961] | 81.0 [73.4, 87.7] | 81.6 [68.0, 91.2] | 83.6 [72.5, 91.5] | 78.4 [64.7, 88.7] | 86.2 [75.2, 93.5] | ||
| Resnet50+C | T | 0.945 [0.922, 0.969] | 87.5 [84.0, 90.9] | 85.8 [80.1, 90.3] | 90.1 [85.1, 93.8] | 89.4 [84.1, 93.4] | 86.7 [81.3, 91.0] | 98 |
| V | 0.864 [0.796, 0.933] | 75.9 [68.1, 83.6] | 73.5 [58.9, 85.1] | 88.1 [77.8, 94.7] | 81.8 [67.3, 91.8] | 81.9 [71.0, 90.1] | ||
| I-T | 0.842 [0.767, 0.916] | 74.6 [66.7, 82.4] | 70.0 [55.4, 82.1] | 76.5 [64.6, 85.9] | 68.6 [54.1, 80.9] | 77.6 [65.8, 86.9] | ||
| Resnet101 | T | 0.901 [0.869, 0.934] | 81.9 [77.7, 85.8] | 72.1 [64.1, 79.2] | 92.5 [88.0, 95.8] | 87.6 [80.3, 92.9] | 81.9 [76.3, 86.7] | 172 |
| V | 0.847 [0.771, 0.923] | 77.1 [69.0, 84.4] | 78.0 [64.0, 88.5] | 82.4 [71.2, 90.5] | 76.5 [62.5, 87.2] | 83.6 [72.4, 91.6] | ||
| I-T | 0.836 [0.758, 0.914] | 82.2 [74.3, 88.4] | 72.0 [57.5, 83.8] | 89.7 [79.9, 95.8] | 83.7 [69.1, 93.3] | 81.3 [70.7, 89.4] | ||
| Inception V3 | T | 0.875 [0.841, 0.910] | 79.7 [75.2, 83.7] | 82.2 [76.2, 87.3] | 79.7 [73.5, 85.0] | 79.8 [73.6, 85.1] | 82.1 [76.0, 87.2] | 253 |
| V | 0.853 [0.783, 0.924] | 78.5 [71.0, 85.9] | 81.6 [68.1, 91.2] | 79.1 [67.4, 88.1] | 74.1 [60.3, 85.0] | 85.5 [74.1, 93.2] | ||
| I-T | 0.796 [0.713, 0.878] | 73.7 [64.9, 80.9] | 76.0 [61.8, 86.9] | 73.5 [61.4, 83.5] | 67.9 [53.9, 79.8] | 80.6 [68.6, 89.6] | ||
| VGG19 | T | 0.792 [0.744, 0.840] | 74.3 [69.7, 78.9] | 62.2 [53.8, 70.0] | 80.7 [74.6, 85.9] | 70.2 [61.6, 77.9] | 74.4 [68.1, 80.1] | 636 |
| V | 0.759 [0.667, 0.851] | 73.3 [65.2, 81.3] | 67.4 [52.5, 80.1] | 74.6 [62.5, 84.5] | 66.0 [51.1, 78.9] | 75.8 [63.6, 85.5] | ||
| I-T | 0.750 [0.656, 0.844] | 69.5 [61.2, 77.8] | 66.0 [51.2, 78.8] | 79.4 [67.9, 88.3] | 70.2 [55.1, 82.7] | 76.1 [64.4, 85.5] |
95% confidence intervals are included in brackets.
AUC area under the receiver operating characteristic curve, ACC accuracy, SENS sensitivity, SPEC specificity, PPV positive predict value, NPV negative predict value, PARAM model parameters amount, T training cohort (n = 350), V validation cohort (n = 116), I–T independent test cohort (n = 118).
Patient and tumor characteristics.
| Characteristics | All patients | Training | Test | |
|---|---|---|---|---|
| Number | 584 | 466 (80%) | 118 (20%) | – |
| Age, mean ± SD, years | 50.27 ± 10.32 | 50.46 ± 10.36 | 49.52 ± 10.20 | 0.372 |
| US size, mean ± SD, mm | 18.90 ± 6.48 | 19.06 ± 6.57 | 18.13 ± 6.10 | 0.164 |
| ER | 0.567 | |||
| Positive | 471 | 372 (79.8%) | 99 (83.9%) | – |
| Negative | 113 | 94 (20.2%) | 19 (16.1%) | – |
| PR | 0.531 | |||
| Positive | 429 | 338 (72.5%) | 91 (77.1%) | – |
| Negative | 155 | 128 (27.5%) | 27 (22.9%) | – |
| HER2 | 0.381 | |||
| Positive | 135 | 104 (22.3%) | 31 (26.3%) | – |
| Negative | 449 | 362 (77.7%) | 87 (73.7%) | – |
| Ki-67 | 0.380 | |||
| Positive | 491 | 396 (85.0%) | 95 (80.5%) | – |
| Negative | 93 | 70 (15.0%) | 23 (19.5%) | – |
| BI-RADS category | 0.158 | |||
| 4A category | 31 | 24 (5.1%) | 7 (5.9%) | – |
| 4B category | 164 | 128 (27.5%) | 36 (30.5%) | – |
| 4C category | 267 | 210 (45.1%) | 57 (48.3%) | – |
| 5 category | 122 | 104 (22.3%) | 18 (15.3%) | – |
| Tumor type | 0.742 | |||
| Invasive ductal carcinoma | 516 | 412 (88.4%) | 104 (88.1%) | – |
| Invasive lobular carcinoma | 18 | 12 (2.6%) | 6 (5.1%) | – |
| Other tumor types | 50 | 42 (9.0%) | 8 (6.8%) | – |
Qualitative variables are in n (%) and quantitative variables are in mean ± SD, when appropriate. Source data are provided as a Source Data file.
The prediction of ALN status results (N0 v.s. N+(≥1)).
| Methods | AUC | ACC (%) | SENS (%) | SPEC (%) | PPV (%) | NPV (%) | |
|---|---|---|---|---|---|---|---|
| Axillary US | 0.735a [0.694, 0.775] | 63.5 [58.6, 70.4] | 72.1 [76.1, 84.8] | 57.3 [52.3, 64.9] | 72.6 [67.9, 77.2] | 57.3 [50.1, 61.2] | |
| Classification by clinicopathologic data | T | 0.771 [0.719, 0.824] | 73.6 [68.8, 78.0] | 68.0 [59.8, 75.5] | 79.6 [73.4, 84.9] | 70.9 [62.7, 78.3] | 77.3 [71.0, 82.8] |
| V | 0.755 [0.665, 0.845] | 71.6 [63.3, 79.8] | 63.3 [48.3, 76.6] | 71.6 [59.3, 82.0] | 62.0 [47.2, 75.3] | 72.7 [60.3, 83.0] | |
| I–T | 0.727b [0.630, 0.825] | 70.9 [62.1, 78.6] | 62.0 [47.2, 75.3] | 69.1 [56.7, 79.8] | 59.6 [45.1, 73.0] | 71.2 [58.7, 81.7] | |
| DLR on images only | T | 0.850 [0.813, 0.887] | 76.7 [72.1, 81.0] | 71.6 [64.7, 77.8] | 80.2 [74.0, 85.5] | 77.9 [71.1, 83.7] | 74.3 [68.0, 80.0] |
| V | 0.804 [0.717, 0.891] | 72.4 [63.3, 79.8] | 69.4 [54.6, 81.7] | 79.1 [67.4, 88.1] | 70.8 [55.9, 83.0] | 77.9 [66.2, 87.1] | |
| I–T | 0.796c [0.708, 0.883] | 71.6 [63.0, 79.4] | 67.4 [52.5, 80.1] | 79.1 [67.4, 88.1] | 70.2 [55.1, 82.7] | 76.8 [65.1, 86.1] | |
| Clinical parameter combined DLR | T | 0.936 [0.910, 0.962] | 85.7 [81.7, 89.1] | 79.1 [71.6, 85.3] | 93.6 [89.2, 96.5] | 90.0 [83.5, 94.6] | 85.9 [80.6, 90.2] |
| V | 0.904 [0.847, 0.961] | 81.4 [73.9, 88.2] | 74.0 [59.7, 85.5] | 88.2 [78.1, 94.8] | 82.2 [67.9, 92.0] | 82.2 [71.5, 90.2] | |
| I–T | 0.902 [0.843, 0.961] | 81.0 [73.4, 87.7] | 81.6 [68.0, 91.2] | 83.6 [72.5, 91.5] | 78.4 [64.7, 88.7] | 86.2 [75.2, 93.5] |
95% confidence intervals are included in brackets. Source data are provided as a Source Data file.
AUC area under the receiver operating characteristic curve, ACC accuracy, SENS sensitivity, SPEC specificity, PPV positive predict value, NPV negative predict value, DLR deep learning radiomics, T training cohort (n = 350),V validation cohort (n = 116), I–T independent test cohort (n = 118).
aindicates P < 0.001, Hanley & McNeil in comparison with clinical parameter combined DLR in independent test cohort.
bindicates P = 0.002, Delong et al. in comparison with clinical parameter combined DLR in independent test cohort.
cindicates P = 0.004, Delong et al. in comparison with clinical parameter combined DLR in independent test cohort.
Fig. 2Comparison of receiver operating characteristic (ROC) curves between different models for predicting disease-free axilla (N0) and any axillary metastasis (N+(≥1)).
DLR deep learning radiomics. Numbers in parentheses are areas under the receiver operating characteristic curves. Source data are provided as a Source Data file.
The prediction of ALN status results (N+(1–2) v.s. N+(≥3)).
| Methods | AUC | ACC (%) | SENS (%) | SPEC (%) | PPV (%) | NPV (%) | |
|---|---|---|---|---|---|---|---|
| Classification by clinicopathologic data | T | 0.756 [0.674, 0.838] | 73.9 [66.6, 80.7] | 65.5 [51.9, 77.5] | 75.6 [65.4, 84.0] | 63.3 [49.9, 75.4] | 77.3 [67.1, 85.5] |
| V | 0.701 [0.567, 0.835] | 72.0 [58.8, 84.1] | 62.0 [36.1, 80.9] | 78.0 [61.4, 92.3] | 66.7 [40.2, 87.1] | 75.0 [56.6, 88.5] | |
| I–T | 0.686a [0.528, 0.844] | 71.4 [57.3, 82.7] | 68.4 [43.4, 87.4] | 73.3 [54.1, 87.7] | 61.9 [38.4, 81.9] | 78.6 [58.6, 91.9] | |
| DLR on images only | T | 0.874 [0.814, 0.934] | 79.7 [72.5, 85.6] | 84.5 [72.6, 92.7] | 80.0 [70.2, 87.7] | 73.1 [60.8, 83.3] | 88.9 [79.9, 94.8] |
| V | 0.80 [0.671, 0.929] | 73.5 [61.1, 85.8] | 84.2 [60.4, 96.6] | 73.3 [54.1, 87.7] | 66.7 [44.7, 84.4] | 88.0 [68.8, 97.5] | |
| I–T | 0.777b [0.644, 0.911] | 69.4 [55.0, 80.9] | 79.0 [54.4, 93.9] | 66.7 [47.2, 82.7] | 60.0 [38.7, 78.9] | 83.3 [62.1, 95.4] | |
| Clinical parameter combined DLR | T | 0.956 [0.926, 0.986] | 89.2 [84.2, 94.2] | 91.4 [81.0, 97.1] | 87.8 [79.2, 93.7] | 82.8 [73.1, 91.1] | 94.0 [86.6, 98.1] |
| V | 0.925 [0.850, 0.997] | 88.0 [78.6, 96.9] | 95.0 [75.1, 98.9] | 86.7 [69.1, 95.3] | 82.6 [60.6, 95.2] | 96.3 [81.0, 98.0] | |
| I–T | 0.905 [0.814, 0.996] | 80.0 [68.9, 91.1] | 85.0 [62.1, 96.8] | 86.7 [69.3, 96.2] | 81.0 [57.4, 94.8] | 89.7 [72.6, 97.8] |
95% confidence intervals are included in brackets. Source data are provided as a Source Data file.
AUC area under the receiver operating characteristic curve, ACC accuracy, SENS sensitivity, SPEC specificity, PPV positive predict value, NPV negative predict value, DLR deep learning radiomics, T training cohort (n = 148), V validation cohort (n = 49), I–T independent test cohort (n = 50).
aIndicates P = 0.03, Delong et al. in comparison with clinical parameter combined DLR in independent test cohort.
bIndicates P = 0.04, Delong et al. in comparison with clinical parameter combined DLR in independent test cohort.
Source data are provided as a Source Data file.
Fig. 3Receiver operating characteristic (ROC) curves comparison between different models for predicting low metastatic burden of axillary disease (N+(1–2)) and heavy metastatic burden of axillary disease (N+(≥3)).
DLR deep learning radiomics. Numbers in parentheses are areas under the receiver operating characteristic curves. Source data are provided as a Source Data file.
Fig. 4The confusion matrix of predicting metastasis among disease-free axilla (N0), low metastatic burden of axillary disease (N+(1–2)) and heavy metastatic burden of axillary disease (N+(≥3)).
Source data are provided as a Source Data file.
Fig. 5Visualization of two patient examples.
Each example shows the gray-scale US image and corresponding heart map, and the red region represents a larger weight, which can be decoded by the color bar on the right. Image a shows that the low echo area inside the tumor is valuable for predicting ALN status, while it is the tumor boundary for image b.
Fig. 6The overall pipeline of the model.
The parallel pre-trained ResNet model encodes the input images to features which be combined with clinical parameters. Then the combined features be classified by an SVM model.