Literature DB >> 32078570

Inaccurate Labels in Weakly-Supervised Deep Learning: Automatic Identification and Correction and Their Impact on Classification Performance.

Degan Hao, Lei Zhang, Jules Sumkin, Aly Mohamed, Shandong Wu.   

Abstract

In data-driven deep learning-based modeling, data quality may substantially influence classification performance. Correct data labeling for deep learning modeling is critical. In weakly-supervised learning, a challenge lies in dealing with potentially inaccurate or mislabeled training data. In this paper, we proposed an automated methodological framework to identify mislabeled data using two metric functions, namely, Cross-entropy Loss that indicates divergence between a prediction and ground truth, and Influence function that reflects the dependence of a model on data. After correcting the identified mislabels, we measured their impact on the classification performance. We also compared the mislabeling effects in three experiments on two different real-world clinical questions. A total of 10,500 images were studied in the contexts of clinical breast density category classification and breast cancer malignancy diagnosis. We used intentionally flipped labels as mislabels to evaluate the proposed method at a varying proportion of mislabeled data included in model training. We also compared the effects of our method to two published schemes for breast density category classification. Experiment results show that when the dataset contains 10% of mislabeled data, our method can automatically identify up to 98% of these mislabeled data by examining/checking the top 30% of the full dataset. Furthermore, we show that correcting the identified mislabels leads to an improvement in the classification performance. Our method provides a feasible solution for weakly-supervised deep learning modeling in dealing with inaccurate labels.

Entities:  

Year:  2020        PMID: 32078570      PMCID: PMC7429345          DOI: 10.1109/JBHI.2020.2974425

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  12 in total

1.  New Federal Requirements to Inform Patients About Breast Density: Will They Help Patients?

Authors:  Nancy L Keating; Lydia E Pace
Journal:  JAMA       Date:  2019-06-18       Impact factor: 56.272

2.  Inter- and intraradiologist variability in the BI-RADS assessment and breast density categories for screening mammograms.

Authors:  A Redondo; M Comas; F Macià; F Ferrer; C Murta-Nascimento; M T Maristany; E Molins; M Sala; X Castells
Journal:  Br J Radiol       Date:  2012-09-19       Impact factor: 3.039

3.  Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation.

Authors:  Brad M Keller; Diane L Nathan; Yan Wang; Yuanjie Zheng; James C Gee; Emily F Conant; Despina Kontos
Journal:  Med Phys       Date:  2012-08       Impact factor: 4.071

4.  Data Programming: Creating Large Training Sets, Quickly.

Authors:  Alexander Ratner; Christopher De Sa; Sen Wu; Daniel Selsam; Christopher Ré
Journal:  Adv Neural Inf Process Syst       Date:  2016-12

5.  A deep learning method for classifying mammographic breast density categories.

Authors:  Aly A Mohamed; Wendie A Berg; Hong Peng; Yahong Luo; Rachel C Jankowitz; Shandong Wu
Journal:  Med Phys       Date:  2017-12-22       Impact factor: 4.071

Review 6.  Artificial intelligence in radiology.

Authors:  Ahmed Hosny; Chintan Parmar; John Quackenbush; Lawrence H Schwartz; Hugo J W L Aerts
Journal:  Nat Rev Cancer       Date:  2018-08       Impact factor: 60.716

7.  Deep Learning to Distinguish Recalled but Benign Mammography Images in Breast Cancer Screening.

Authors:  Sarah S Aboutalib; Aly A Mohamed; Wendie A Berg; Margarita L Zuley; Jules H Sumkin; Shandong Wu
Journal:  Clin Cancer Res       Date:  2018-10-11       Impact factor: 12.531

8.  Preliminary evaluation of the publicly available Laboratory for Breast Radiodensity Assessment (LIBRA) software tool: comparison of fully automated area and volumetric density measures in a case-control study with digital mammography.

Authors:  Brad M Keller; Jinbo Chen; Dania Daye; Emily F Conant; Despina Kontos
Journal:  Breast Cancer Res       Date:  2015-08-25       Impact factor: 6.466

9.  Histogram-based normalization technique on human brain magnetic resonance images from different acquisitions.

Authors:  Xiaofei Sun; Lin Shi; Yishan Luo; Wei Yang; Hongpeng Li; Peipeng Liang; Kuncheng Li; Vincent C T Mok; Winnie C W Chu; Defeng Wang
Journal:  Biomed Eng Online       Date:  2015-07-28       Impact factor: 2.819

10.  Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists.

Authors:  Pranav Rajpurkar; Jeremy Irvin; Robyn L Ball; Kaylie Zhu; Brandon Yang; Hershel Mehta; Tony Duan; Daisy Ding; Aarti Bagul; Curtis P Langlotz; Bhavik N Patel; Kristen W Yeom; Katie Shpanskaya; Francis G Blankenberg; Jayne Seekins; Timothy J Amrhein; David A Mong; Safwan S Halabi; Evan J Zucker; Andrew Y Ng; Matthew P Lungren
Journal:  PLoS Med       Date:  2018-11-20       Impact factor: 11.069

View more
  2 in total

1.  Weakly supervised end-to-end artificial intelligence in gastrointestinal endoscopy.

Authors:  Lukas Buendgens; Didem Cifci; Narmin Ghaffari Laleh; Marko van Treeck; Maria T Koenen; Henning W Zimmermann; Till Herbold; Thomas Joachim Lux; Alexander Hann; Christian Trautwein; Jakob Nikolas Kather
Journal:  Sci Rep       Date:  2022-03-22       Impact factor: 4.379

2.  Architectural Distortion-Based Digital Mammograms Classification Using Depth Wise Convolutional Neural Network.

Authors:  Khalil Ur Rehman; Jianqiang Li; Yan Pei; Anaa Yasin; Saqib Ali; Yousaf Saeed
Journal:  Biology (Basel)       Date:  2021-12-23
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.