| Literature DB >> 34951033 |
Jennifer Dhont1, Cecile Wolfs1, Frank Verhaegen1.
Abstract
PURPOSE: Over the last 2 years, the artificial intelligence (AI) community has presented several automatic screening tools for coronavirus disease 2019 (COVID-19) based on chest radiography (CXR), with reported accuracies often well over 90%. However, it has been noted that many of these studies have likely suffered from dataset bias, leading to overly optimistic results. The purpose of this study was to thoroughly investigate to what extent biases have influenced the performance of a range of previously proposed and promising convolutional neural networks (CNNs), and to determine what performance can be expected with current CNNs on a realistic and unbiased dataset.Entities:
Keywords: COVID-19; X-ray imaging; artificial intelligence; dataset bias
Mesh:
Year: 2022 PMID: 34951033 PMCID: PMC9015341 DOI: 10.1002/mp.15419
Source DB: PubMed Journal: Med Phys ISSN: 0094-2405 Impact factor: 4.506
FIGURE 1Illustration of the four different datasets with chest radiography (CXRs) of coronavirus disease 2019 (COVID‐19)‐positive (COVID +) and ‐negative (COVID–) patients, including the number of CXRs per class used for training and testing (left), together with a schematic overview of the methodology (right)
Coronavirus disease 2019 (COVID‐19) positive precision/recall obtained on the unseen test set of each dataset in both the internal (grey shading) and cross‐dataset evaluation. BIMCV+/COVIDx– and COVIDx+/BIMCV– were created by combining the opposing classes from the Valencian Region Medical Image Bank (BIMCV) and COVIDxB8 datasets. Numbers in bold indicate the particularly poor performance when the origin of the two classes is reversed between training and testing
|
|
|
|
|
| |
|---|---|---|---|---|---|
| 1. VGG19 | BIMCV | 0.72/0.71 | 0.58/0.72 | – | – |
| COVIDxB8 | 0.56/0.44 | 0.98/0.85 | – | – | |
| BIMCV+/COVIDx‐ | – | – | 0.98/0.99 |
| |
| COVIDx+/BIMCV‐ | – | – |
| 1.00/0.94 | |
| 2. ResNet50 | BIMCV | 0.74/0.66 | 0.58/0.62 | – | – |
| COVIDxB8 | 0.61/0.48 | 0.98/0.77 | – | – | |
| BIMCV+/COVIDx‐ | – | – | 0.96/0.98 |
| |
| COVIDx+/BIMCV‐ | – | – |
| 0.98/0.85 | |
| 3. InceptionV3 | BIMCV | 0.65/0.65 | 0.63/0.63 | – | – |
| COVIDxB8 | 0.60/0.55 | 0.98/0.84 | – | – | |
| BIMCV+/COVIDx‐ | – | – | 0.95/0.98 |
| |
| COVIDx+/BIMCV‐ | – | – |
| 0.99/0.84 | |
| 4. DenseNet201 | BIMCV | 0.76/0.59 | 0.82/0.57 | – | – |
| COVIDxB8 | 0.58/0.55 | 0.96/0.87 | – | – | |
| BIMCV+/COVIDx‐ | – | – | 0.94/1.00 |
| |
| COVIDx+/BIMCV‐ | – | – |
| 1.00/0.84 | |
| 5. COVID‐Net | COVIDxB8 | 0.55/0.41 | 0.99/0.77 | – | – |
FIGURE 2Four representative examples (two coronavirus disease 2019 [COVID‐19] positive, two COVID‐19 negative) of the saliency maps obtained for convolutional neural network (CNN) 1–4 trained on the Valencian Region Medical Image Bank (BIMCV) dataset, showing the most salient segments (top 10%, in green). All images originate from the BIMCV test set. Chest radiography (CXRs) delineated in red were misclassified
FIGURE 3Four representative examples (two coronavirus disease 2019 [COVID‐19] positive and two COVID‐19 negative) of the saliency maps obtained for convolutional neural network (CNN) 1–4 trained on the COVIDxB8 dataset, showing the most salient segments (top 10%, in green). All images originate from the COVIDxB8 test set and were correctly classified by each network
Coronavirus disease 2019 (COVID‐19) positive precision/recall obtained on the unseen test set of each dataset, in both the internal (grey shading) and cross‐dataset evaluation, when pixels outside the lungs are masked before training and testing
|
| |||
|---|---|---|---|
|
|
|
| |
| 1. VGG19 | BIMCV | 0.68/0.65 | 0.68/0.69 |
| COVIDxB8 | 0.61/0.43 | 0.97/0.80 | |
| 2. ResNet50 | BIMCV | 0.67/0.71 | 0.67/0.64 |
| COVIDxB8 | 0.63/0.64 | 0.97/0.87 | |
| 3. InceptionV3 | BIMCV | 0.65 / 0.64 | 0.66/0.60 |
| COVIDxB8 | 0.61/0.51 | 0.98/0.80 | |
| 4. DenseNet201 | BIMCV | 0.69/0.56 | 0.82/0.57 |
| COVIDxB8 | 0.63/0.71 | 0.95/0.86 |
Abbreviation: BIMCV, Valencian Region Medical Image Bank.