| Literature DB >> 34722313 |
Feng Xu1, Chuang Zhu2, Wenqi Tang2, Ying Wang3, Yu Zhang2, Jie Li1, Hongchuan Jiang1, Zhongyue Shi3, Jun Liu2, Mulan Jin3.
Abstract
OBJECTIVES: To develop and validate a deep learning (DL)-based primary tumor biopsy signature for predicting axillary lymph node (ALN) metastasis preoperatively in early breast cancer (EBC) patients with clinically negative ALN.Entities:
Keywords: axillary lymph node metastasis; breast cancer; core-needle biopsy; deep learning; whole-slide images
Year: 2021 PMID: 34722313 PMCID: PMC8551965 DOI: 10.3389/fonc.2021.759007
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Patient recruitment workflow.
Figure 2The overall pipeline of the deep learning core-needle biopsy incorporating the clinical data (DL-CNB+C) model to predict axillary lymph node (ALN) status between N0 and N(+). (A) Multiple training bags were built based on clinical data and the cropped patches from the selected tumor regions of each core-needle biopsy (CNB) whole-slide image (WSI). (B) DL-CNB+C model training process included two phases of feature extraction and multiple-instance learning (MIL), and finally the weighted features fused with clinical features were used to predict classification probabilities and calculate the cross-entropy loss. (C) The predicted probabilities of each bag from a raw CNB WSI were merged to guide the final ALN status classification between N0 and N(+).
Figure 3Overview on interpretability methods of deep learning core-needle biopsy (DL-CNB) model based on nucleus morphometric features. (A) The selected tumor regions of each whole-slide image (WSI) was cropped into patches. (B) For each patch, we processed nucleus segmentation (a weakly supervised segmentation framework was applied to obtain the nucleus), defined multiple nucleus morphometric features (such as major axis, minor axis, area, orientation, circumference, density, circularity, and rectangularity, which are denoted as f 1, f 2, f 3, …, f n), and extracted n feature parameters correspondingly. (C) All n kinds of feature parameters from a WSI were quantized into n distribution histograms and saved to n feature matrices (m 1, m 2, m 3, …, m n). (D) The matrices from a WSI were considered as instances of a bag and served as the input of DL-CNB model; the re-trained DL-CNB model could generate scores of features (instances) in the bag, which represented the weight of each feature in pathological diagnosis.
Patient and tumor characteristics.
| Characteristics | All patients | Training | Test |
| |
|---|---|---|---|---|---|
|
| 1,058 | 840 (80%) | 218 (20%) | ||
|
| 57.58 ± 12.523 | 57.80 ± 12.481 | 56.72 ± 12.674 | 0.344 | |
|
| 2.234 ± 0.8623 | 2.228 ± 0.8516 | 2.256 ± 0.9040 | 0.898 | |
|
| 1.20 ± 2.081 | 1.20 ± 2.095 | 1.20 ± 2.033 | 0.847 | |
|
| 0.812 | ||||
| Invasive ductal carcinoma | 957 | 760 (90.5%) | 197 (90.4%) | ||
| Invasive lobular carcinoma | 101 | 80 (9.5%) | 21 (9.6%) | ||
|
| 0.327 | ||||
| T1 | 556 | 435 (51.8%) | 121 (55.5%) | ||
| T2 | 502 | 405 (48.2%) | 97 (44.5%) | ||
|
| 0.333 | ||||
| Positive | 831 | 665 (79.2%) | 166 (76.1%) | ||
| Negative | 227 | 175 (20.8%) | 52 (23.9%) | ||
|
| 0.312 | ||||
| Positive | 790 | 633 (75.4%) | 157 (72.0%) | ||
| Negative | 268 | 207 (24.6%) | 61 (28.0%) | ||
|
| 0.613 | ||||
| Positive | 277 | 217 (25.8%) | 60 (27.5%) | ||
| Negative | 781 | 623 (74.2%) | 158 (72.5%) | ||
|
| 0.880 | ||||
| Yes | 403 | 521 (62.0%) | 134 (61.5%) | ||
| No | 655 | 319 (38.0%) | 84 (38.5%) |
Qualitative variables are in n (%), and quantitative variables are in mean ± SD, when appropriate.
SD, standard deviation; ER, estrogen receptor; PR, progesterone receptor; HER-2, human epidermal growth factor receptor-2; LNM, lymph node metastasis.
The performance in prediction of ALN status (N0 vs. N(+)).
| Methods | AUC | ACC (%) | SENS (%) | SPEC (%) | PPV (%) | NPV (%) | |
|---|---|---|---|---|---|---|---|
| Clinical data only | T | 0.661 [0.622, 0.698] | 64.13 [60.24, 67.88] | 64.58 [58.17, 70.63] | 63.85 [58.86, 68.62] | 52.36 [48.32, 56.38] | 74.55 [70.85, 77.92] |
| V | 0.709 [0.643, 0.770] | 67.62 [60.84, 73.90] | 65.82 [54.29, 76.13] | 68.70 [60.02, 76.52] | 55.91 [48.46, 63.11] | 76.92 [70.62, 82.22] | |
| I−T | 0.613a,b [0.545, 0.678] | 61.93 [55.12, 68.40] | 50.00 [38.89, 61.11] | 69.40 [60.86, 77.07] | 50.60 [42.34, 58.83] | 68.89 [63.49, 73.82] | |
| DL-CNB model | T | 0.901 [0.875, 0.923] | 80.32 [76.99, 83.35] | 94.17 [90.41, 96.77] | 71.79 [67.05, 76.21] | 67.26 [63.61, 70.71] | 95.24 [92.30, 97.09] |
| V | 0.808 [0.748, 0.859] | 72.86 [66.31, 78.75] | 77.22 [66.40, 85.90] | 70.23 [61.62, 77.90] | 61.00 [53.95, 67.62] | 83.64 [77.04, 88.62] | |
| I−T | 0.816 | 74.77 [68.46, 80.39] | 80.95 [70.92, 88.70] | 70.90 [62.43, 78.42] | 63.55 [56.76, 69.84] | 85.59 [79.04, 90.34] | |
| DL-CNB+C model | T | 0.878 [0.622, 0.698] | 76.51 [73.00, 79.77] | 93.33 [89.40, 96.14] | 66.15 [61.22, 70.84] | 62.92 [59.53, 66.19] | 94.16 [90.90, 96.30] |
| V | 0.823 [0.765, 0.872] | 75.71 [69.34, 81.35] | 74.68 [63.64, 83.80] | 76.34 [68.12, 83.32] | 65.56 [57.69, 72.65] | 83.33 [77.19, 88.08] | |
| I−T | 0.831 [0.775, 0.878] | 75.69 [69.44, 81.23] | 89.29 [80.63, 94.98] | 67.16 [58.53, 75.03] | 63.03 [56.96, 68.71] | 90.91 [84.21, 94.94] |
95% confidence intervals are included in brackets.
AUC, area under the receiver operating characteristic curve; ACC, accuracy; SENS, sensitivity; SPEC, specificity; PPV, positive predictive value; NPV, negative predictive value; T, training cohort (n = 630); V, validation cohort (n = 210); I–T, independent test cohort (n = 218); ALN, axillary lymph node; DL-CNB+C, deep learning core-needle biopsy incorporating the clinical data.
Indicates p < 0.0001, Delong et al. in comparison with DL-CNB model in independent test cohort.
Indicates p < 0.0001, Delong et al. in comparison with DL-CNB+C model in independent test cohort.
Indicates p = 0.4532, Delong et al. in comparison with DL-CNB+C model in independent test cohort.
Figure 4Comparison of receiver operating characteristic (ROC) curves between different models for predicting disease-free axilla (N0) and heavy metastatic burden of axillary disease (N(+)). Numbers in parentheses are areas under the receiver operating characteristic curve (AUCs).
Figure 5The confusion matrix of predicting axillary lymph node (ALN) status between disease-free axilla (N0), low metastatic burden of axillary disease (N+(1 − 2)), and heavy metastatic burden of axillary disease (N+(≥3)).
Figure 6The interpretability of the deep learning core-needle biopsy (DL-CNB) model of two patients. (A, B) The heat maps and nuclear segmentation from core-needle biopsy (CNB) whole-slide images (WSIs) of the N0 and the N(+) separately, and the red regions show greater contribution to the final classification. (C) The statistical analysis of three nuclear characteristics most relevant to diagnosis of all patients.