Literature DB >> 33863957

Data valuation for medical imaging using Shapley value and application to a large-scale chest X-ray dataset.

Siyi Tang1, Amirata Ghorbani1, Rikiya Yamashita2, Sameer Rehman3, Jared A Dunnmon4, James Zou1,2,4, Daniel L Rubin5,6.   

Abstract

The reliability of machine learning models can be compromised when trained on low quality data. Many large-scale medical imaging datasets contain low quality labels extracted from sources such as medical reports. Moreover, images within a dataset may have heterogeneous quality due to artifacts and biases arising from equipment or measurement errors. Therefore, algorithms that can automatically identify low quality data are highly desired. In this study, we used data Shapley, a data valuation metric, to quantify the value of training data to the performance of a pneumonia detection algorithm in a large chest X-ray dataset. We characterized the effectiveness of data Shapley in identifying low quality versus valuable data for pneumonia detection. We found that removing training data with high Shapley values decreased the pneumonia detection performance, whereas removing data with low Shapley values improved the model performance. Furthermore, there were more mislabeled examples in low Shapley value data and more true pneumonia cases in high Shapley value data. Our results suggest that low Shapley value indicates mislabeled or poor quality images, whereas high Shapley value indicates data that are valuable for pneumonia detection. Our method can serve as a framework for using data Shapley to denoise large-scale medical imaging datasets.

Entities:  

Year:  2021        PMID: 33863957      PMCID: PMC8052417          DOI: 10.1038/s41598-021-87762-2

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  19 in total

1.  Wavelet-domain medical image denoising using bivariate laplacian mixture model.

Authors:  Hossein Rabbani; Reza Nezafat; Saeed Gazor
Journal:  IEEE Trans Biomed Eng       Date:  2009-08-18       Impact factor: 4.538

2.  An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets.

Authors:  Hyunkwang Lee; Sehyo Yune; Mohammad Mansouri; Myeongchan Kim; Shahein H Tajmir; Claude E Guerrier; Sarah A Ebert; Stuart R Pomerantz; Javier M Romero; Shahmir Kamalian; Ramon G Gonzalez; Michael H Lev; Synho Do
Journal:  Nat Biomed Eng       Date:  2018-12-17       Impact factor: 25.671

3.  Exploring Large-scale Public Medical Image Datasets.

Authors:  Luke Oakden-Rayner
Journal:  Acad Radiol       Date:  2019-11-06       Impact factor: 3.173

Review 4.  Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis.

Authors:  Davood Karimi; Haoran Dou; Simon K Warfield; Ali Gholipour
Journal:  Med Image Anal       Date:  2020-06-20       Impact factor: 8.545

5.  Assessment of Convolutional Neural Networks for Automated Classification of Chest Radiographs.

Authors:  Jared A Dunnmon; Darvin Yi; Curtis P Langlotz; Christopher Ré; Daniel L Rubin; Matthew P Lungren
Journal:  Radiology       Date:  2018-11-13       Impact factor: 29.146

Review 6.  A Review of Denoising Medical Images Using Machine Learning 
Approaches.

Authors:  Prabhpreet Kaur; Gurvinder Singh; Parminder Kaur
Journal:  Curr Med Imaging Rev       Date:  2018-10

7.  Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study.

Authors:  John R Zech; Marcus A Badgeley; Manway Liu; Anthony B Costa; Joseph J Titano; Eric Karl Oermann
Journal:  PLoS Med       Date:  2018-11-06       Impact factor: 11.069

8.  Weakly supervised classification of aortic valve malformations using unlabeled cardiac MRI sequences.

Authors:  Jason A Fries; Paroma Varma; Vincent S Chen; Ke Xiao; Heliodoro Tejeda; Priyanka Saha; Jared Dunnmon; Henry Chubb; Shiraz Maskatia; Madalina Fiterau; Scott Delp; Euan Ashley; Christopher Ré; James R Priest
Journal:  Nat Commun       Date:  2019-07-15       Impact factor: 14.919

9.  Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists.

Authors:  Pranav Rajpurkar; Jeremy Irvin; Robyn L Ball; Kaylie Zhu; Brandon Yang; Hershel Mehta; Tony Duan; Daisy Ding; Aarti Bagul; Curtis P Langlotz; Bhavik N Patel; Kristen W Yeom; Katie Shpanskaya; Francis G Blankenberg; Jayne Seekins; Timothy J Amrhein; David A Mong; Safwan S Halabi; Evan J Zucker; Andrew Y Ng; Matthew P Lungren
Journal:  PLoS Med       Date:  2018-11-20       Impact factor: 11.069

View more
  1 in total

1.  Review on COVID-19 diagnosis models based on machine learning and deep learning approaches.

Authors:  Zaid Abdi Alkareem Alyasseri; Mohammed Azmi Al-Betar; Iyad Abu Doush; Mohammed A Awadallah; Ammar Kamal Abasi; Sharif Naser Makhadmeh; Osama Ahmad Alomari; Karrar Hameed Abdulkareem; Afzan Adam; Robertas Damasevicius; Mazin Abed Mohammed; Raed Abu Zitar
Journal:  Expert Syst       Date:  2021-07-28       Impact factor: 2.812

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.