| Literature DB >> 32939632 |
Garima Suman1, Ananya Panda1, Panagiotis Korfiatis1, Marie E Edwards1, Sushil Garg2, Daniel J Blezek1, Suresh T Chari3, Ajit H Goenka4.
Abstract
PURPOSE: To evaluate the performance of trained technologists vis-à-vis radiologists for volumetric pancreas segmentation and to assess the impact of supplementary training on their performance.Entities:
Keywords: Artificial intelligence; COVID-19; Data curation; Deep learning
Mesh:
Year: 2020 PMID: 32939632 PMCID: PMC7493700 DOI: 10.1007/s00261-020-02741-x
Source DB: PubMed Journal: Abdom Radiol (NY)
Fig. 1Images from training material: Color-coded depiction of abdominal organs (a) on an axial CT image (pancreas: red; liver: purple; kidneys: light green; stomach: yellow; small bowel: blue, and spleen: cyan). Depiction of pancreas outline in red with labeled subadjacent anatomical structures on axial (b) and coronal (c) CT images. Tracing of pancreas outline on enterprise custom image-viewing software using freehand tools (d). The smaller red squares are artefactually generated by the software with any outline task
Fig. 2Evaluation of technologists’ segmentation: Color-coded areas represent correct segmentation (blue), incorrect segmentation (red), and overlap between technologists’ and radiologists’ segmentation (purple). Example of accurate segmentation (a); exclusion of a portion of pancreatic head resulted in undersegmentation error or false negative (b), and inclusion of duodenum within the segmentation resulted in an oversegmentation error or false positive (c)
Summary of technologists’ performance between the first batch (before supplementary training) and the second batch (after supplementary training) for the cases that needed revision
| Performance metrics | First batch | Second batch | |
|---|---|---|---|
| DSC (mean ± SD) | 0.63 ± 0.15 | 0.63 ± 0.16 | 0.61 |
| JC (mean ± SD) | 0.48 ± 0.15 | 0.48 ± 0.15 | 0.61 |
| FP (mean ± SD) | 0.29 ± 0.21 | 0.21 ± 0.10 | 0.07 |
| FN (mean ± SD | 0.36 ± 0.20 | 0.43 ± 0.19 | 0.12 |
DSC Dice–Sorenson coefficient, JC Jaccard coefficient, FP false positive; FN − false negative
Fig. 3Bland–Altman analyses for mean pancreatic volume difference between technologists’ and radiologists’ segmentations for cases that required correction before (a) and after supplementary training (b): mean pancreatic volume difference before supplementary training (a) was − 2.74 cc (minimum: − 92.96 cc, maximum: 87.47 cc). Mean pancreatic volume difference after supplementary training (b) was − 23.57 cc (minimum: − 77.32 cc, maximum: 30.19 cc). Dotted lines indicate limits of differences (mean ± 1.96 SD)
Fig. 4Box and whisker plots of technologists’ performance during first (blue, labeled as Batch 1.0) and second batch of segmentations (orange, labeled as Batch 2.0) when compared against radiologist’ segmentations in terms of Dice-Sorenson coefficient (Dice) (a), Jaccard coefficient (Jaccard) (b), false positive rate (c), and false negative rate (d)