Literature DB >> 32965624

Assessing and mitigating the effects of class imbalance in machine learning with application to X-ray imaging.

Wendi Qu1, Indranil Balki1, Mauro Mendez1, John Valen1, Jacob Levman2,3, Pascal N Tyrrell4,5,6.   

Abstract

PURPOSE: Machine learning (ML) algorithms are well known to exhibit variations in prediction accuracy when provided with imbalanced training sets typically seen in medical imaging (MI) due to the imbalanced ratio of pathological and normal cases. This paper presents a thorough investigation of the effects of class imbalance and methods for mitigating class imbalance in ML algorithms applied to MI.
METHODS: We first selected five classes from the Image Retrieval in Medical Applications (IRMA) dataset, performed multiclass classification using the random forest model (RFM), and then performed binary classification using convolutional neural network (CNN) on a chest X-ray dataset. An imbalanced class was created in the training set by varying the number of images in that class. Methods tested to mitigate class imbalance included oversampling, undersampling, and changing class weights of the RFM. Model performance was assessed by overall classification accuracy, overall F1 score, and specificity, recall, and precision of the imbalanced class.
RESULTS: A close-to-balanced training set resulted in the best model performance, and a large imbalance with overrepresentation was more detrimental to model performance than underrepresentation. Oversampling and undersampling methods were both effective in mitigating class imbalance, and efficacy of oversampling techniques was class specific.
CONCLUSION: This study systematically demonstrates the effect of class imbalance on two public X-ray datasets on RFM and CNN, making these findings widely applicable as a reference. Furthermore, the methods employed here can guide researchers in assessing and addressing the effects of class imbalance, while considering the data-specific characteristics to optimize imbalance mitigating methods.

Keywords:  Class imbalance; Machine learning; Medical imaging; Radiology; X-ray

Mesh:

Year:  2020        PMID: 32965624     DOI: 10.1007/s11548-020-02260-6

Source DB:  PubMed          Journal:  Int J Comput Assist Radiol Surg        ISSN: 1861-6410            Impact factor:   2.924


  7 in total

1.  Development of a convolutional neural network to detect abdominal aortic aneurysms.

Authors:  Justin R Camara; Roger T Tomihama; Andrew Pop; Matthew P Shedd; Brandon S Dobrowski; Cole J Knox; Ahmed M Abou-Zamzam; Sharon C Kiang
Journal:  J Vasc Surg Cases Innov Tech       Date:  2022-05-02

2.  Deep learning model calibration for improving performance in class-imbalanced medical image classification tasks.

Authors:  Sivaramakrishnan Rajaraman; Prasanth Ganesan; Sameer Antani
Journal:  PLoS One       Date:  2022-01-27       Impact factor: 3.240

3.  Deep Learning-Based Natural Language Processing in Radiology: The Impact of Report Complexity, Disease Prevalence, Dataset Size, and Algorithm Type on Model Performance.

Authors:  A W Olthof; P M A van Ooijen; L J Cornelissen
Journal:  J Med Syst       Date:  2021-09-04       Impact factor: 4.460

Review 4.  Pediatric chest radiograph interpretation: how far has artificial intelligence come? A systematic literature review.

Authors:  Sirwa Padash; Mohammad Reza Mohebbian; Scott J Adams; Robert D E Henderson; Paul Babyn
Journal:  Pediatr Radiol       Date:  2022-04-23

5.  Visual assessment of interactions among resuscitation activity factors in out-of-hospital cardiopulmonary arrest using a machine learning model.

Authors:  Yasuyuki Kawai; Hirozumi Okuda; Arisa Kinoshita; Koji Yamamoto; Keita Miyazaki; Keisuke Takano; Hideki Asai; Yasuyuki Urisono; Hidetada Fukushima
Journal:  PLoS One       Date:  2022-09-06       Impact factor: 3.752

6.  Can Sequential Images from the Same Object Be Used for Training Machine Learning Models? A Case Study for Detecting Liver Disease by Ultrasound Radiomics.

Authors:  Laith R Sultan; Theodore W Cary; Maryam Al-Hasani; Mrigendra B Karmacharya; Santosh S Venkatesh; Charles-Antoine Assenmacher; Enrico Radaelli; Chandra M Sehgal
Journal:  AI (Basel)       Date:  2022-09-01

7.  A Pneumonia Diagnosis Scheme Based on Hybrid Features Extracted from Chest Radiographs Using an Ensemble Learning Algorithm.

Authors:  Mehedi Masud; Anupam Kumar Bairagi; Abdullah-Al Nahid; Niloy Sikder; Saeed Rubaiee; Anas Ahmed; Divya Anand
Journal:  J Healthc Eng       Date:  2021-02-25       Impact factor: 2.682

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.