Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Bias Analysis on Public X-Ray Image Datasets of Pneumonia and COVID-19 Patients.

Literature DB >> 34812384

Bias Analysis on Public X-Ray Image Datasets of Pneumonia and COVID-19 Patients.

Omar Del Tejo Catala¹, Ismael Salvador Igual¹, Francisco Javier Perez-Benito¹, David Millan Escriva¹, Vicent Ortiz Castello¹, Rafael Llobet^1,2, Juan-Carlos Perez-Cortes^1,3.

Abstract

Chest X-ray images are useful for early COVID-19 diagnosis with the advantage that X-ray devices are already available in health centers and images are obtained immediately. Some datasets containing X-ray images with cases (pneumonia or COVID-19) and controls have been made available to develop machine-learning-based methods to aid in diagnosing the disease. However, these datasets are mainly composed of different sources coming from pre-COVID-19 datasets and COVID-19 datasets. Particularly, we have detected a significant bias in some of the released datasets used to train and test diagnostic systems, which might imply that the results published are optimistic and may overestimate the actual predictive capacity of the techniques proposed. In this article, we analyze the existing bias in some commonly used datasets and propose a series of preliminary steps to carry out before the classic machine learning pipeline in order to detect possible biases, to avoid them if possible and to report results that are more representative of the actual predictive power of the methods under analysis. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.

Entities: Chemical

Keywords: COVID-19; Deep learning; bias; chest X-ray; convolutional neural networks; saliency map; segmentation

Year: 2021 PMID： 34812384 PMCID： PMC8545228 DOI： 10.1109/ACCESS.2021.3065456

Source DB: PubMed Journal: IEEE Access ISSN： 2169-3536 Impact factor: 3.476

23 in total

Review 1. A survey on deep learning in medical image analysis.

Authors: Geert Litjens; Thijs Kooi; Babak Ehteshami Bejnordi; Arnaud Arindra Adiyoso Setio; Francesco Ciompi; Mohsen Ghafoorian; Jeroen A W M van der Laak; Bram van Ginneken; Clara I Sánchez
Journal: Med Image Anal Date: 2017-07-26 Impact factor: 8.545

2. Applying probabilistic temporal and multisite data quality control methods to a public health mortality registry in Spain: a systematic approach to quality control of repositories.

Authors: Carlos Sáez; Oscar Zurriaga; Jordi Pérez-Panadés; Inma Melchor; Montserrat Robles; Juan M García-Gómez
Journal: J Am Med Inform Assoc Date: 2016-04-23 Impact factor: 4.497

3. CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection.

Authors: Abdul Waheed; Muskan Goyal; Deepak Gupta; Ashish Khanna; Fadi Al-Turjman; Placido Rogerio Pinheiro
Journal: IEEE Access Date: 2020-05-14 Impact factor: 3.367

4. PadChest: A large chest x-ray image dataset with multi-label annotated reports.

Authors: Aurelia Bustos; Antonio Pertusa; Jose-Maria Salinas; Maria de la Iglesia-Vayá
Journal: Med Image Anal Date: 2020-08-20 Impact factor: 8.545

5. Deep Learning COVID-19 Features on CXR Using Limited Training Data Sets.

Authors: Yujin Oh; Sangjoon Park; Jong Chul Ye
Journal: IEEE Trans Med Imaging Date: 2020-05-08 Impact factor: 10.048

6. Subgrouping Factors Influencing Migraine Intensity in Women: A Semi-automatic Methodology Based on Machine Learning and Information Geometry.

Authors: Francisco J Pérez-Benito; J Alberto Conejero; Carlos Sáez; Juan M García-Gómez; Esperanza Navarro-Pardo; Lidiane L Florencio; César Fernández-de-Las-Peñas
Journal: Pain Pract Date: 2019-12-02 Impact factor: 3.183

7. Temporal variability analysis reveals biases in electronic health records due to hospital process reengineering interventions over seven years.

Authors: Francisco Javier Pérez-Benito; Carlos Sáez; J Alberto Conejero; Salvador Tortajada; Bernardo Valdivieso; Juan M García-Gómez
Journal: PLoS One Date: 2019-08-07 Impact factor: 3.240