Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Data valuation for medical imaging using Shapley value and application to a large-scale chest X-ray dataset.

Literature DB >> 33863957

Data valuation for medical imaging using Shapley value and application to a large-scale chest X-ray dataset.

Siyi Tang¹, Amirata Ghorbani¹, Rikiya Yamashita², Sameer Rehman³, Jared A Dunnmon⁴, James Zou^1,2,4, Daniel L Rubin^5,6.

Abstract

The reliability of machine learning models can be compromised when trained on low quality data. Many large-scale medical imaging datasets contain low quality labels extracted from sources such as medical reports. Moreover, images within a dataset may have heterogeneous quality due to artifacts and biases arising from equipment or measurement errors. Therefore, algorithms that can automatically identify low quality data are highly desired. In this study, we used data Shapley, a data valuation metric, to quantify the value of training data to the performance of a pneumonia detection algorithm in a large chest X-ray dataset. We characterized the effectiveness of data Shapley in identifying low quality versus valuable data for pneumonia detection. We found that removing training data with high Shapley values decreased the pneumonia detection performance, whereas removing data with low Shapley values improved the model performance. Furthermore, there were more mislabeled examples in low Shapley value data and more true pneumonia cases in high Shapley value data. Our results suggest that low Shapley value indicates mislabeled or poor quality images, whereas high Shapley value indicates data that are valuable for pneumonia detection. Our method can serve as a framework for using data Shapley to denoise large-scale medical imaging datasets.

Entities: Disease Gene Species

Year: 2021 PMID： 33863957 PMCID： PMC8052417 DOI： 10.1038/s41598-021-87762-2

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

19 in total

1. Wavelet-domain medical image denoising using bivariate laplacian mixture model.

Authors: Hossein Rabbani; Reza Nezafat; Saeed Gazor
Journal: IEEE Trans Biomed Eng Date: 2009-08-18 Impact factor: 4.538

2. An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets.

Authors: Hyunkwang Lee; Sehyo Yune; Mohammad Mansouri; Myeongchan Kim; Shahein H Tajmir; Claude E Guerrier; Sarah A Ebert; Stuart R Pomerantz; Javier M Romero; Shahmir Kamalian; Ramon G Gonzalez; Michael H Lev; Synho Do
Journal: Nat Biomed Eng Date: 2018-12-17 Impact factor: 25.671

3. Exploring Large-scale Public Medical Image Datasets.

Authors: Luke Oakden-Rayner
Journal: Acad Radiol Date: 2019-11-06 Impact factor: 3.173

Review 4. Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis.

Authors: Davood Karimi; Haoran Dou; Simon K Warfield; Ali Gholipour
Journal: Med Image Anal Date: 2020-06-20 Impact factor: 8.545

5. Assessment of Convolutional Neural Networks for Automated Classification of Chest Radiographs.

Authors: Jared A Dunnmon; Darvin Yi; Curtis P Langlotz; Christopher Ré; Daniel L Rubin; Matthew P Lungren
Journal: Radiology Date: 2018-11-13 Impact factor: 29.146

Review 6. A Review of Denoising Medical Images Using Machine Learning  Approaches.

Authors: Prabhpreet Kaur; Gurvinder Singh; Parminder Kaur
Journal: Curr Med Imaging Rev Date: 2018-10

7. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study.

Authors: John R Zech; Marcus A Badgeley; Manway Liu; Anthony B Costa; Joseph J Titano; Eric Karl Oermann
Journal: PLoS Med Date: 2018-11-06 Impact factor: 11.069

8. Weakly supervised classification of aortic valve malformations using unlabeled cardiac MRI sequences.

Authors: Jason A Fries; Paroma Varma; Vincent S Chen; Ke Xiao; Heliodoro Tejeda; Priyanka Saha; Jared Dunnmon; Henry Chubb; Shiraz Maskatia; Madalina Fiterau; Scott Delp; Euan Ashley; Christopher Ré; James R Priest
Journal: Nat Commun Date: 2019-07-15 Impact factor: 14.919

9. Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists.

Authors: Pranav Rajpurkar; Jeremy Irvin; Robyn L Ball; Kaylie Zhu; Brandon Yang; Hershel Mehta; Tony Duan; Daisy Ding; Aarti Bagul; Curtis P Langlotz; Bhavik N Patel; Kristen W Yeom; Katie Shpanskaya; Francis G Blankenberg; Jayne Seekins; Timothy J Amrhein; David A Mong; Safwan S Halabi; Evan J Zucker; Andrew Y Ng; Matthew P Lungren
Journal: PLoS Med Date: 2018-11-20 Impact factor: 11.069

1 in total

1. Review on COVID-19 diagnosis models based on machine learning and deep learning approaches.

Authors: Zaid Abdi Alkareem Alyasseri; Mohammed Azmi Al-Betar; Iyad Abu Doush; Mohammed A Awadallah; Ammar Kamal Abasi; Sharif Naser Makhadmeh; Osama Ahmad Alomari; Karrar Hameed Abdulkareem; Afzan Adam; Robertas Damasevicius; Mazin Abed Mohammed; Raed Abu Zitar
Journal: Expert Syst Date: 2021-07-28 Impact factor: 2.812

1 in total

Review 4. Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis.

Review 6. A Review of Denoising Medical Images Using Machine Learning Approaches.

Review 6. A Review of Denoising Medical Images Using Machine Learning  Approaches.