| Literature DB >> 34635726 |
Ziba Gandomkar1, Somphone Siviengphanom2, Ernest U Ekpo2, Mo'ayyad Suleiman2, Seyedamir Tavakoli Taba2, Tong Li2, Dong Xu3, Karla K Evans4, Sarah J Lewis2, Jeremy M Wolfe5,6, Patrick C Brennan2.
Abstract
The information captured by the gist signal, which refers to radiologists' first impression arising from an initial global image processing, is poorly understood. We examined whether the gist signal can provide complementary information to data captured by radiologists (experiment 1), or computer algorithms (experiment 2) based on detailed mammogram inspection. In the first experiment, 19 radiologists assessed a case set twice, once based on a half-second image presentation (i.e., gist signal) and once in the usual viewing condition. Their performances in two viewing conditions were compared using repeated measure correlation (rm-corr). The cancer cases (19 cases × 19 readers) exhibited non-significant trend with rm-corr = 0.012 (p = 0.82, CI: -0.09, 0.12). For normal cases (41 cases × 19 readers), a weak correlation of rm-corr = 0.238 (p < 0.001, CI: 0.17, 0.30) was found. In the second experiment, we combined the abnormality score from a state-of-the-art deep learning-based tool (DL) with the radiological gist signal using a support vector machine (SVM). To obtain the gist signal, 53 radiologists assessed images based on half-second image presentation. The SVM performance for each radiologist and an average reader, whose gist responses were the mean abnormality scores given by all 53 readers to each image was assessed using leave-one-out cross-validation. For the average reader, the AUC for gist, DL, and the SVM, were 0.76 (CI: 0.62-0.86), 0.79 (CI: 0.63-0.89), and 0.88 (CI: 0.79-0.94). For all readers with a gist AUC significantly better than chance-level, the SVM outperformed DL. The gist signal provided malignancy evidence with no or weak associations with the information captured by humans in normal radiologic reporting, which involves detailed mammogram inspection. Adding gist signal to a state-of-the-art deep learning-based tool improved its performance for the breast cancer detection.Entities:
Mesh:
Year: 2021 PMID: 34635726 PMCID: PMC8505651 DOI: 10.1038/s41598-021-99582-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The gist responses and ratings in the usual viewing condition for cancer cases from four radiologists. The R2 for the fitted trendlines were less than 0.02 for all readers.
Mixed linear regression models.
| Coef | SE | CI [0.025 0.975] | ||
|---|---|---|---|---|
| Intercept | 3.692 | 0.132 | 3.433–3.951 | |
| Mass: Calcification | 0.193 | 0.068 | 0.059–0.326 | |
| Density | −0.052 | 0.026 | −0.102–0.002 | |
| Size | 0.011 | 0.005 | 0.002–0.021 | |
| Gist | 0.000 | 0.001 | 0.755 | −0.002–0.003 |
| Intercept | 0.548 | 0.172 | 0.211–0.885 | |
| Density | −0.123 | 0.064 | 0.054 | −0.248–0.002 |
| Gist | 0.017 | 0.002 | 0.012–0.022 | |
The model generated separately for normal and abnormal cases. Note that although larger sample size in Normal category could result in more precision and increase power, a coefficient of zero for the “Gist” response suggests that it is not a strong predictor of ratings in the usual presentation for the Cancer Cases.
Bold values showed the significant p-values
Figure 2(a) The AUC value of the readers when the ratings in the usual viewing condition and gist responses were combined versus the AUC values of the readers in the usual viewing condition. The x = y line is also indicated (red line). As shown, the multiplication by the gist response resulted in the deterioration of the performance only in two of the readers (indicated by arrow). (b) The ROC curves of the average reader in the usual viewing and reporting condition, gist experiments, and when two scores were combined. The AUC values are 0.93 (0.86–0.98), 0.85 (CI: 0.68–0.93), and 0.96 (CI: 0.91- 0.99) respectively.
Figure 3The ROC curves for the average reader (a), the model on its own (b), and when the gist signal is aggregated with the models’ output using an SVM for the average reader. The confidence intervals (dashed lines) were calculated using bootstrap. The number of bootstrap replicas was set to 100.
Figure 4The AUC of the SVMs personalized for each radiologist versus their AUCs in the gist experiment. The dashed line shows the AUC of the model on its own. As shown in the figure, for most of the readers with an AUC above 0.635, an improvement in the model’s performance was observed after aggregating the gist signal with the deep learning’s abnormality score.
Figure 5The protocol for presenting the images to collect the gist signal.
Characteristics of cases and radiologists in the first experiment.
| Characteristics | Characteristics | Numbers | |
|---|---|---|---|
| Almost entirely fatty | 13 | Yes | 7 |
| Scattered areas of fibroglandular density | 22 | No | 12 |
| Heterogeneously dense | 12 | ||
| Extremely dense | 14 | Breast | 17 |
| Others | 1 | ||
| Left | |||
| Right | 10 | ≤ 10 h | 13 |
| > 10 h | 6 | ||
| Mean + Std (mm) | 10.5 ± 6.1 | ||
| Range (mm) | 4.0–26.0 | ≤ 100 cases | 6 |
| > 100 cases | 13 | ||
| Mean + Std (mm) | 60.5 ± 29.2 | ||
| Range (mm) | 97.0–119.0 | Yes | 16 |
| No | 3 | ||
| Calcification | 7 | ||
| Stellate | 5 | ≤ 10 years | 8 |
| Discrete Mass | 3 | > 10 years | 11 |
| Architectural Distortion | 3 | ||
| Non-specific density | 2 | ||
Bold values showed the significant p-values
Characteristics of cancer cases in the second experiment.
| Type | Location | Lesion size (mm) | |||
|---|---|---|---|---|---|
| Architectural Distortion | 2 | Central | 5 | Mean | 11.35 |
| Calcification | 1 | Lower Inner (inferior medial) | 2 | Std | 5.26 |
| Discrete Mass | 2 | Lower Outer (inferior lateral) | 1 | Min | 5 |
| Non-specific density | 6 | Retro Areolar | 1 | Max | 24 |
| Spiculated Mass | 3 | Upper Inner (superior medial) | 1 | ||
| Stellate | 6 | Upper Outer (superior lateral) | 10 | ||