Literature DB >> 34812384

Bias Analysis on Public X-Ray Image Datasets of Pneumonia and COVID-19 Patients.

Omar Del Tejo Catala1, Ismael Salvador Igual1, Francisco Javier Perez-Benito1, David Millan Escriva1, Vicent Ortiz Castello1, Rafael Llobet1,2, Juan-Carlos Perez-Cortes1,3.   

Abstract

Chest X-ray images are useful for early COVID-19 diagnosis with the advantage that X-ray devices are already available in health centers and images are obtained immediately. Some datasets containing X-ray images with cases (pneumonia or COVID-19) and controls have been made available to develop machine-learning-based methods to aid in diagnosing the disease. However, these datasets are mainly composed of different sources coming from pre-COVID-19 datasets and COVID-19 datasets. Particularly, we have detected a significant bias in some of the released datasets used to train and test diagnostic systems, which might imply that the results published are optimistic and may overestimate the actual predictive capacity of the techniques proposed. In this article, we analyze the existing bias in some commonly used datasets and propose a series of preliminary steps to carry out before the classic machine learning pipeline in order to detect possible biases, to avoid them if possible and to report results that are more representative of the actual predictive power of the methods under analysis. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.

Entities:  

Keywords:  COVID-19; Deep learning; bias; chest X-ray; convolutional neural networks; saliency map; segmentation

Year:  2021        PMID: 34812384      PMCID: PMC8545228          DOI: 10.1109/ACCESS.2021.3065456

Source DB:  PubMed          Journal:  IEEE Access        ISSN: 2169-3536            Impact factor:   3.476


  23 in total

Review 1.  A survey on deep learning in medical image analysis.

Authors:  Geert Litjens; Thijs Kooi; Babak Ehteshami Bejnordi; Arnaud Arindra Adiyoso Setio; Francesco Ciompi; Mohsen Ghafoorian; Jeroen A W M van der Laak; Bram van Ginneken; Clara I Sánchez
Journal:  Med Image Anal       Date:  2017-07-26       Impact factor: 8.545

2.  Applying probabilistic temporal and multisite data quality control methods to a public health mortality registry in Spain: a systematic approach to quality control of repositories.

Authors:  Carlos Sáez; Oscar Zurriaga; Jordi Pérez-Panadés; Inma Melchor; Montserrat Robles; Juan M García-Gómez
Journal:  J Am Med Inform Assoc       Date:  2016-04-23       Impact factor: 4.497

3.  CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection.

Authors:  Abdul Waheed; Muskan Goyal; Deepak Gupta; Ashish Khanna; Fadi Al-Turjman; Placido Rogerio Pinheiro
Journal:  IEEE Access       Date:  2020-05-14       Impact factor: 3.367

4.  PadChest: A large chest x-ray image dataset with multi-label annotated reports.

Authors:  Aurelia Bustos; Antonio Pertusa; Jose-Maria Salinas; Maria de la Iglesia-Vayá
Journal:  Med Image Anal       Date:  2020-08-20       Impact factor: 8.545

5.  Deep Learning COVID-19 Features on CXR Using Limited Training Data Sets.

Authors:  Yujin Oh; Sangjoon Park; Jong Chul Ye
Journal:  IEEE Trans Med Imaging       Date:  2020-05-08       Impact factor: 10.048

6.  Subgrouping Factors Influencing Migraine Intensity in Women: A Semi-automatic Methodology Based on Machine Learning and Information Geometry.

Authors:  Francisco J Pérez-Benito; J Alberto Conejero; Carlos Sáez; Juan M García-Gómez; Esperanza Navarro-Pardo; Lidiane L Florencio; César Fernández-de-Las-Peñas
Journal:  Pain Pract       Date:  2019-12-02       Impact factor: 3.183

7.  Temporal variability analysis reveals biases in electronic health records due to hospital process reengineering interventions over seven years.

Authors:  Francisco Javier Pérez-Benito; Carlos Sáez; J Alberto Conejero; Salvador Tortajada; Bernardo Valdivieso; Juan M García-Gómez
Journal:  PLoS One       Date:  2019-08-07       Impact factor: 3.240

8.  Unveiling COVID-19 from CHEST X-Ray with Deep Learning: A Hurdles Race with Small Data.

Authors:  Enzo Tartaglione; Carlo Alberto Barbano; Claudio Berzovini; Marco Calandri; Marco Grangetto
Journal:  Int J Environ Res Public Health       Date:  2020-09-22       Impact factor: 3.390

View more
  2 in total

1.  Explainable artificial intelligence-based edge fuzzy images for COVID-19 detection and identification.

Authors:  Qinhua Hu; Francisco Nauber B Gois; Rafael Costa; Lijuan Zhang; Ling Yin; Naercio Magaia; Victor Hugo C de Albuquerque
Journal:  Appl Soft Comput       Date:  2022-05-13       Impact factor: 8.263

2.  Validating Automatic Concept-Based Explanations for AI-Based Digital Histopathology.

Authors:  Daniel Sauter; Georg Lodde; Felix Nensa; Dirk Schadendorf; Elisabeth Livingstone; Markus Kukuk
Journal:  Sensors (Basel)       Date:  2022-07-18       Impact factor: 3.847

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.