| Literature DB >> 29426948 |
Eun-Kyung Kim1, Hyo-Eun Kim2, Kyunghwa Han3, Bong Joo Kang4, Yu-Mee Sohn5, Ok Hee Woo6, Chan Wha Lee7.
Abstract
We assessed the feasibility of a data-driven imaging biomarker based on weakly supervised learning (DIB; an imaging biomarker derived from large-scale medical image data with deep learning technology) in mammography (DIB-MG). A total of 29,107 digital mammograms from five institutions (4,339 cancer cases and 24,768 normal cases) were included. After matching patients' age, breast density, and equipment, 1,238 and 1,238 cases were chosen as validation and test sets, respectively, and the remainder were used for training. The core algorithm of DIB-MG is a deep convolutional neural network; a deep learning algorithm specialized for images. Each sample (case) is an exam composed of 4-view images (RCC, RMLO, LCC, and LMLO). For each case in a training set, the cancer probability inferred from DIB-MG is compared with the per-case ground-truth label. Then the model parameters in DIB-MG are updated based on the error between the prediction and the ground-truth. At the operating point (threshold) of 0.5, sensitivity was 75.6% and 76.1% when specificity was 90.2% and 88.5%, and AUC was 0.903 and 0.906 for the validation and test sets, respectively. This research showed the potential of DIB-MG as a screening tool for breast cancer.Entities:
Mesh:
Year: 2018 PMID: 29426948 PMCID: PMC5807343 DOI: 10.1038/s41598-018-21215-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographics in cancer cases.
| Train (n = 3101) | Validation (n = 619) | Test (n = 619) | P value | |
|---|---|---|---|---|
| Density | 0.7843 | |||
| almost entire fat | 196 (6.32) | 32 (5.2) | 31 (5.0) | |
| scattered fibroglandular densities | 640 (20.6) | 136 (22.0) | 137 (22.1) | |
| heterogeneous dense | 1555 (50.2) | 312 (50.4) | 312 (50.4) | |
| extremely dense | 710 (22.9) | 139 (22.4) | 139 (22.5) | |
| Age | 0.9941 | |||
| ≥50 | 1759 (56.7) | 350 (56.5) | 350 (56.5) | |
| <50 | 1342 (43.3) | 269 (43.5) | 269 (43.5) | |
| Manufacturer | 0.9351 | |||
| GE | 1226 (39.5) | 238 (38.5) | 251 (40.6) | |
| Hologic | 1032 (33.2) | 200 (32.3) | 198 (32.0) | |
| Siemens | 843 (27.2) | 181 (29.2) | 170 (27.4) | |
| Feature | ||||
| mass | 1688 (54.4) | 339 (54.8) | 339 (54.8) | 0.9806 |
| non mass | 1413 (45.6) | 280 (45.2) | 280 (45.2) | |
| calcifications | 1402 (45.2) | 280 (45.2) | 280 (45.2) | 0.9999 |
| non calcifications | 1699 (54.8) | 339 (54.8) | 339 (54.8) | |
| Type | 0.2767 | |||
| Invasive | 2673 (86.2) | 542 (87.56) | 547 (88.37) | |
| Noninvasive | 428 (13.8) | 77 (12.44) | 72 (11.63) | |
| Size (invasive) | 0.8409 | |||
| Size ≥20 | 1216 (45.5) | 254 (46.9) | 251 (45.9) | |
| Size <20 | 1457 (54.5) | 288 (53.1) | 296 (54.1) |
Demographics in normal cases.
| Train (n = 23530) | Validation (n = 619) | Test (n = 619) | P value | |
|---|---|---|---|---|
| Density | 0.898 | |||
| almost entire fat | 837 (3.6) | 27 (4.4) | 18 (2.9) | |
| scattered fibroglandular densities | 4206 (17.9) | 106 (17.1) | 115 (18.6) | |
| heterogeneous dense | 16434 (69.8) | 432 (69.8) | 432 (69.8) | |
| extremely dense | 2053 (8.7) | 54 (8.7) | 54 (8.7) | |
| Age | 0.997 | |||
| ≥50 | 14533 (61.8) | 383 (61.9) | 383 (61.9) | |
| <50 | 8997 (38.2) | 236 (38.1) | 236 (38.1) | |
| Manufacturer | 0.4872 | |||
| GE | 11526 (49.0) | 284 (45.9) | 315 (50.9) | |
| Hologic | 10191 (43.3) | 282 (45.6) | 257 (41.5) | |
| Siemens | 1813 (7.7) | 53 (8.6) | 47 (7.6) |
Figure 1Overall architecture – 19 convolutions followed by a global-average-pooling (GPavg).
Figure 2Hierarchical feature abstraction, DIB map generation, and cancer probability generation.
Figure 3DIB example with ground-truth lesion. A 44-year-old woman with invasive ductal carcinoma of the right breast. A 22 mm-sized mass was correctly highlighted by DIB. The confidence score for cancer of DIB was 1.0 and 0.026 for the right and left breast.
Diagnostic Performances according to age and manufacturer.
| Sensitivity (%) | Specificity (%) | Accuracy (%) | AUC | |
|---|---|---|---|---|
| Validation Set | 75.6 (468/619) | 90.2 (558/619) | 82.9 (1026/1238) | 0.903 |
| Age | ||||
| ≥50 | 77.1 (270/350) | 91.9 (352/383) | 84.9 (622/733) | 0.914 |
| <50 | 73.6 (198/269) | 87.3 (206/236) | 80.0 (404/505) | 0.882 |
| p value* | 0.310 | 0.061 | 0.026 | 0.080 |
| Manufacturer | ||||
| GE | 77.5 (186/240) | 91.1 (257/282) | 84.9 (443/522) | 0.924 |
| Hologic | 63.6 (126/198) | 93.3 (265/284) | 81.1 (391/482) | 0.863 |
| Siemens | 86.2 (156/181) | 67.9 (36/53) | 82.1 (192/234) | 0.861 |
| p value* | 0.2704 | |||
| Test Set | 76.1 (471/619) | 88.5 (548/619) | 82.3 (1019/1238) | 0.906 |
| Age | ||||
| ≥50 | 76.3 (267/350) | 90.1 (345/383) | 83.5 (612/733) | 0.911 |
| <50 | 75.83 (204/269) | 86.01 (203/236) | 80.6 (407/505) | 0.897 |
| p value* | 0.897 | 0.124 | 0.189 | 0.395 |
| Manufacturer | ||||
| GE | 74.6 (188/252) | 89.1 (229/257) | 81.9 (417/509) | 0.910 |
| Hologic | 67.0 (132/197) | 92.1 (290/315) | 82.4 (422/512) | 0.880 |
| Siemens | 88.8 (151/170) | 61.7 (29/47) | 83.0 (180/217) | 0.888 |
| p value* | 0.943 | |||
*chi-square test.
Diagnostic Performances according to breast density.
| Sensitivity (%) | Specificity (%) | Accuracy (%) | AUC | |
|---|---|---|---|---|
| Validation Set | 75.6 (468/619) | 90.2 (558/619) | 82.9 (1026/1238) | 0.903 |
| Parenchymal density | ||||
| A (n = 59) | 81.3 (26/32) | 100 (27/27) | 89.8 (53/59) | 0.946 |
| B (n = 242) | 80.9 (110/136) | 96.2 (102/106) | 87.6 (212/242) | 0.950 |
| C (n = 744) | 75.0 (234/312) | 90.3 (390/432) | 83.9 (624/744) | 0.900 |
| D (n = 193) | 70.5 (98/139) | 72.2 (39/54) | 71.0 (137/193) | 0.790 |
| p value* | 0.201 | |||
| Test Set | 76.1 (471/619) | 88.5 (548/619) | 82.3 (1019/1238) | 0.906 |
| Parenchymal density | ||||
| A (n = 49) | 90.3 (28/31) | 100 (18/18) | 93.88 (46/49) | 0.960 |
| B (n = 252) | 75.9 (104/137) | 92.17 (106/115) | 83.33 (210/252) | 0.935 |
| C (n = 744) | 75.6 (236/312) | 88.43 (382/432) | 83.06 (618/744) | 0.899 |
| D (n = 193) | 74.1 (103/139) | 77.78 (42/54) | 75.13 (145/193) | 0.851 |
| p value* | 0.301 | |||
*Chi-square test.
Diagnostic Performances according to malignant characteristics.
| Sensitivity | ||
|---|---|---|
| Cancer cases | Validation set (n = 619) | Test set (n = 619) |
| Feature | ||
| mass | 84.1 (285/339) | 86.1 (292/339) |
| calcification | 77.5 (217/280) | 77.9 (218/280) |
| p value* | 0.0385 | 0.0076 |
| Type | ||
| Invasive | 77.9 (422/542) | 79.0 (432/547) |
| Noninvasive | 59.7 (46/77) | 54.2 (39/72) |
| p value** | 0.0005 | |
| Size (invasive) | ||
| ≥20 | 88.6 (225/254) | 88.5 (222/251) |
| <20 | 68.4 (197/288) | 71.0 (210/296) |
| p value** | ||
*Logistic regression using GEE, **chi-square test.