| Literature DB >> 34552163 |
Sunghoon Joo1,2, Eun Sook Ko3, Soonhwan Kwon1, Eunjoo Jeon1, Hyungsik Jung1, Ji-Yeon Kim4, Myung Jin Chung, Young-Hyuck Im5,6.
Abstract
The achievement of the pathologic complete response (pCR) has been considered a metric for the success of neoadjuvant chemotherapy (NAC) and a powerful surrogate indicator of the risk of recurrence and long-term survival. This study aimed to develop a multimodal deep learning model that combined clinical information and pretreatment MR images for predicting pCR to NAC in patients with breast cancer. The retrospective study cohort consisted of 536 patients with invasive breast cancer who underwent pre-operative NAC. We developed a deep learning model to fuse high-dimensional MR image features and the clinical information for the pretreatment prediction of pCR to NAC in breast cancer. The proposed deep learning model trained on all datasets as clinical information, T1-weighted subtraction images, and T2-weighted images shows better performance with area under the curve (AUC) of 0.888 as compared to the model using only clinical information (AUC = 0.827, P < 0.05). Our results demonstrate that the multimodal fusion approach using deep learning with both clinical information and MR images achieve higher prediction performance compared to the deep learning model without the fusion approach. Deep learning could integrate pretreatment MR images with clinical information to improve pCR prediction performance.Entities:
Mesh:
Year: 2021 PMID: 34552163 PMCID: PMC8458289 DOI: 10.1038/s41598-021-98408-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
General characteristics of the study population.
| Total | pCR | Non-pCR | ||
|---|---|---|---|---|
| Mean (± standard deviation) | 45.22 (± 9.91) | 46.13 (± 10.13) | 44.92 (± 9.82) | 0.222 |
| 0.024 | ||||
| Mean (± standard deviation) | 17.52 (± 23.59) | 13.53 (± 15.41) | 18.84 (± 25.61) | |
| Positive | 266 (49.63) | 51 (19.17) | 215 (80.83) | 0.002 |
| Negative | 270 (50.37) | 82 (30.37) | 188 (69.63) | |
| Positive | 209 (38.99) | 31 (14.83) | 178 (85.17) | < 0.001 |
| Negative | 327 (61.01) | 102 (31.19) | 225 (68.81) | |
| Positive | 177 (33.02) | 65 (36.72) | 112 (63.28) | < 0.001 |
| Negative | 359 (66.98) | 68 (18.94) | 291 (81.06) | |
| IDC | 481 (89.74) | 127 (26.40) | 354 (73.60) | 0.012 |
| Others | 55 (10.26) | 6 (10.91) | 49 (89.09) | |
| 1 + | 102 (19.03) | 14 (13.73) | 88 (86.27) | 0.002 |
| 2 + | 179 (33.40) | 44 (24.58) | 135 (75.42) | |
| 3 + | 128 (23.88) | 30 (23.44) | 98 (76.56) | |
| 4 + | 127 (23.69) | 45 (35.43) | 82 (64.57) | |
| cT1 | 26 (4.85) | 12 (46.15) | 14 (53.85) | < 0.001 |
| cT2 | 288 (53.73) | 86 (29.86) | 202 (70.14) | |
| cT3 | 178 (33.21) | 25 (14.04) | 153 (85.96) | |
| cT4 | 44 (8.21) | 10 (22.73) | 34 (77.27) | |
| cN0 | 42 (7.84) | 12 (28.57) | 30 (71.43) | 0.032 |
| cN1 | 108 (20.15) | 38 (35.19) | 70 (64.81) | |
| cN2 | 230 (42.91) | 50 (21.74) | 180 (78.26) | |
| cN3 | 156 (29.10) | 33 (21.15) | 123 (78.85) | |
| AC-T | 367 (68.47) | 75 (20.44) | 292 (79.56) | < 0.001 |
| ACTH | 131 (24.44) | 51 (38.93) | 80 (61.07) | |
| AC-T & Platinum | 15 (2.80) | 4 (26.67) | 11 (73.33) | |
| AC | 23 (4.29) | 3 (13.04) | 20 (86.96) | |
*For ER and PR, the Allred scores (0–8) are used in the actual training and validation procedures. For convenience, they are dichotomized in this table.
Performance of seven deep learning models in a validation dataset in predicting pCR.
| Input modality | AUC (S.E.) | Accuracy | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|---|
| Clinical information | 0.827 (0.027) | 0.785 | 0.848 | 0.757 | 0.609 | 0.918 |
| T1W subtraction image | 0.725 (0.031) | 0.718 | 0.314 | 0.907 | 0.601 | 0.748 |
| T2W image | 0.663 (0.033) | 0.709 | 0.457 | 0.824 | 0.537 | 0.773 |
| Cropped T1W subtraction image for lesion (56 × 56 × 12) | 0.624# (0.033) | 0.700 | 0.429 | 0.813 | 0.506 | 0.762 |
| T1W subtraction + T2W images | 0.745* (0.031) | 0.736 | 0.486 | 0.853 | 0.596 | 0.788 |
| T1W subtraction image + clinical information | 0.848 (0.025) | 0.822 | 0.485 | 0.973 | 0.889 | 0.809 |
| T1W subtraction image + T2W image + clinical information | 0.888* (0.022) | 0.850 | 0.667 | 0.932 | 0.814 | 0.863 |
* Statistically significant difference (p < 0.05) compared to Clinical information. # Statistically significant difference (P < 0.05) compared to T1W subtraction image.
Figure 1Receiver operating characteristic curves showing the AUC values of different deep learning models in the validation set. (A) ROC curves for the prediction of pretreatment pCR based on different deep learning models trained on clinical information and MR images or only clinical information pCR classifiers. T1 + T2 + C: subtracted-T1W images, T2W images, and clinical information. T1 + C: T1W subtraction images and clinical information. C: clinical information. (B) ROC curves for prediction of pretreatment pCR based on different deep learning models trained on the dataset in the combinations of MR images. T1 + T2: T1W subtraction images and T2W images. T1: T1W subtraction images. T2: T2W images. T1 (lesion): cropped image of the lesion in T1W subtraction images.
Figure 2Deep learning architectures for the multimodal pCR prediction model. (A) The feature extractors for contrast-enhanced T1W subtraction MR images and T2W MR images were used in two 3D ResNet-50. The MR images for the input were subjected to isotropic transformation and cropped to a 3D form of 224 × 224 × 64. (B) FC layer was used for clinical information inputs. The outputs of each 3D ResNet-50 and FC layer for clinical information were concatenated. The final FC layer with sigmoid activation function was used in the prediction of pCR.