| Literature DB >> 33708922 |
Lang Qian1,2, Zhikun Lv3,4, Kai Zhang1,2, Kun Wang3,4, Qian Zhu1,2, Shichong Zhou1,2, Cai Chang1,2, Jie Tian3,4,5.
Abstract
BACKGROUND: To develop an ultrasound-based deep learning model to predict postoperative upgrading of pure ductal carcinoma in situ (DCIS) diagnosed by core needle biopsy (CNB) before surgery.Entities:
Keywords: Artificial intelligence (AI); core needle biopsy (CNB); ductal carcinoma in situ (DCIS); prediction of upstaging
Year: 2021 PMID: 33708922 PMCID: PMC7944276 DOI: 10.21037/atm-20-3981
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Figure 1Complex and diverse ultrasound images of ductal carcinoma in situ. (A) Calcification is the main manifestation; (B) duct abnormalities are the main manifestation; (C) the mass is the main manifestation; (D) the structural disorder is the main manifestation.
Figure 2Schematic presentation of our proposal for classifying pure ductal carcinoma in situ (DCIS) and upgraded DCIS.
Baseline characteristics of the patients
| Pure DCIS (n=180) | Upstaged DCIS (n=180) | % | |
|---|---|---|---|
| Age | |||
| ≤50 | 66 | 93 | 44.16 |
| >50 | 114 | 87 | 56.83 |
| Tumor size on ultrasonography | |||
| ≤20 | 108 | 39 | 40.83 |
| >20 | 72 | 141 | 59.16 |
| Family history | |||
| No | 136 | 152 | 80.00 |
| Yes | 44 | 28 | 20.00 |
| Menopause | |||
| No | 94 | 114 | 57.77 |
| Yes | 86 | 66 | 42.22 |
DCIS, ductal carcinoma in situ.
Comparison of clinical features between the patients in the training set and validation set
| Training set (n=240) | Validation set (n=120) | Univariate P value | |
|---|---|---|---|
| Age | 0.178 | ||
| ≤50 years | 105 | 54 | |
| >50 years | 135 | 66 | |
| Tumor size on ultrasonography | 0.495 | ||
| ≤20 | 95 | 52 | |
| >20 | 145 | 68 | |
| Family history | 0.855 | ||
| No | 190 | 98 | |
| Yes | 50 | 22 | |
| Menopause | 0.097 | ||
| No | 146 | 62 | |
| Yes | 94 | 58 | |
Figure 3Receiver operating curve (ROC) for the four models. (A) ROC of the validation set; (B) ROC of the training set.
Diagnostic performance of the deep learning algorithms for the test dataset
| Algorithm | Validation | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|---|
| AUROC | Accuracy | |||||
| ResNet-b0 | 0.804 | 0.742 | 0.767 | 0.716 | 0.730 | 0.753 |
| ResNet-b1 | 0.802 | 0.742 | 0.733 | 0.750 | 0.745 | 0.738 |
| ResNet-b2 | 0.737 | 0.675 | 0.667 | 0.683 | 0.678 | 0.672 |
| Vgg-change | 0.724 | 0.683 | 0.717 | 0.650 | 0.696 | 0.671 |
AUROC, area under the receiver operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value.
Figure 4Comparison of robustness between the 3 deep learning models. (A) 3-fold cross-validation performance in the ResNet-b0 model; (B) 3-fold cross-validation performance in the ResNet-b1 model; (C) 3-fold cross-validation performance in the ResNet-b2 model.