Literature DB >> 33840636

Quality gaps in public pancreas imaging datasets: Implications & challenges for AI applications.

Garima Suman1, Anurima Patra1, Panagiotis Korfiatis1, Shounak Majumder2, Suresh T Chari3, Mark J Truty4, Joel G Fletcher1, Ajit H Goenka5.   

Abstract

OBJECTIVE: Quality gaps in medical imaging datasets lead to profound errors in experiments. Our objective was to characterize such quality gaps in public pancreas imaging datasets (PPIDs), to evaluate their impact on previously published studies, and to provide post-hoc labels and segmentations as a value-add for these PPIDs.
METHODS: We scored the available PPIDs on the medical imaging data readiness (MIDaR) scale, and evaluated for associated metadata, image quality, acquisition phase, etiology of pancreas lesion, sources of confounders, and biases. Studies utilizing these PPIDs were evaluated for awareness of and any impact of quality gaps on their results. Volumetric pancreatic adenocarcinoma (PDA) segmentations were performed for non-annotated CTs by a junior radiologist (R1) and reviewed by a senior radiologist (R3).
RESULTS: We found three PPIDs with 560 CTs and six MRIs. NIH dataset of normal pancreas CTs (PCT) (n = 80 CTs) had optimal image quality and met MIDaR A criteria but parts of pancreas have been excluded in the provided segmentations. TCIA-PDA (n = 60 CTs; 6 MRIs) and MSD(n = 420 CTs) datasets categorized to MIDaR B due to incomplete annotations, limited metadata, and insufficient documentation. Substantial proportion of CTs from TCIA-PDA and MSD datasets were found unsuitable for AI due to biliary stents [TCIA-PDA:10 (17%); MSD:112 (27%)] or other factors (non-portal venous phase, suboptimal image quality, non-PDA etiology, or post-treatment status) [TCIA-PDA:5 (8.5%); MSD:156 (37.1%)]. These quality gaps were not accounted for in any of the 25 studies that have used these PPIDs (NIH-PCT:20; MSD:1; both: 4). PDA segmentations were done by R1 in 91 eligible CTs (TCIA-PDA:42; MSD:49). Of these, corrections were made by R3 in 16 CTs (18%) (TCIA-PDA:4; MSD:12) [mean (standard deviation) Dice: 0.72(0.21) and 0.63(0.23) respectively].
CONCLUSION: Substantial quality gaps, sources of bias, and high proportion of CTs unsuitable for AI characterize the available limited PPIDs. Published studies on these PPIDs do not account for these quality gaps. We complement these PPIDs through post-hoc labels and segmentations for public release on the TCIA portal. Collaborative efforts leading to large, well-curated PPIDs supported by adequate documentation are critically needed to translate the promise of AI to clinical practice.
Copyright © 2021 IAP and EPC. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Benchmarking; Bias; Deep learning; Metadata; Pancreatic carcinoma

Mesh:

Year:  2021        PMID: 33840636     DOI: 10.1016/j.pan.2021.03.016

Source DB:  PubMed          Journal:  Pancreatology        ISSN: 1424-3903            Impact factor:   3.996


  2 in total

1.  Radiomics for Detection of Pancreas Adenocarcinoma on CT Scans: Impact of Biliary Stents.

Authors:  Garima Suman; Anurima Patra; Sovanlal Mukherjee; Panagiotis Korffiatis; Ajit H Goenka
Journal:  Radiol Imaging Cancer       Date:  2022-01

2.  Detection of Pancreatic Cancer in CT Scan Images Using PSO SVM and Image Processing.

Authors:  Arshiya S Ansari; Abu Sarwar Zamani; Mohammad Sajid Mohammadi; Mahyudin Ritonga; Syed Sohail Ahmed; Devabalan Pounraj; Karthikeyan Kaliyaperumal
Journal:  Biomed Res Int       Date:  2022-07-26       Impact factor: 3.246

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.