| Literature DB >> 32971995 |
Enzo Tartaglione1, Carlo Alberto Barbano1, Claudio Berzovini2, Marco Calandri3, Marco Grangetto1.
Abstract
The possibility to use widespread and simple chest X-ray (CXR) imaging for early screening of COVID-19 patients is attracting much interest from both the clinical and the AI community. In this study we provide insights and also raise warnings on what is reasonable to expect by applying deep learning to COVID classification of CXR images. We provide a methodological guide and critical reading of an extensive set of statistical results that can be obtained using currently available datasets. In particular, we take the challenge posed by current small size COVID data and show how significant can be the bias introduced by transfer-learning using larger public non-COVID CXR datasets. We also contribute by providing results on a medium size COVID CXR dataset, just collected by one of the major emergency hospitals in Northern Italy during the peak of the COVID pandemic. These novel data allow us to contribute to validate the generalization capacity of preliminary results circulating in the scientific community. Our conclusions shed some light into the possibility to effectively discriminate COVID using CXR.Entities:
Keywords: COVID-19; chest X-ray; classification; deep learning
Mesh:
Year: 2020 PMID: 32971995 PMCID: PMC7557723 DOI: 10.3390/ijerph17186933
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1CXR pre-processing steps proposed.
Figure 2Original image (a) and extracted lung segmented image (b). Many possible bias sources like all the writings and medical equipment is naturally removed.
Figure 3Summary of the training strategy. The feature extractor is (optionally) pre-trained on CXR pathology datasets and then fine-tuned on the COVID datasets. The presence of gears involves training/fine-tuning for the specific part, while the lock implies that part is not modified.
Composition of datasets used for training and test are shown in the rows, with details on number of images taken from different sources in the columns; the COVID-positive samples are indicated as “+” while the negative ones with “—”. For better readability, each data source is identified by a letter: CORDA (A), ChestXRay (B), RSNA (C) and COVID-ChestXRay (D).
| COMPOSED DATASET | ORIGINAL DATASETS | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| A | C | B | D | TOTAL | |||||||
| + | — | + | — | + | — | + | — | + | — | ||
|
| train | 126 | 105 | - | - | - | - | - | - | 126 | 105 |
| test | 90 | 45 | - | - | - | - | - | - | 90 | 45 | |
|
| train | 207 | 105 | - | - | - | 102 | - | - | 207 | 207 |
| test | 90 | 45 | - | - | - | 45 | - | - | 90 | 90 | |
|
| train | 207 | 105 | - | 102 | - | - | - | - | 207 | 207 |
| test | 90 | 45 | - | 45 | - | - | - | - | 90 | 90 | |
|
| train | 116 | 105 | - | - | - | - | 49 | 24 | 165 | 129 |
| test | 90 | 45 | - | - | - | - | 10 | 5 | 100 | 50 | |
|
| train | - | - | - | - | - | - | 98 | 24 | 98 | 24 |
| test | - | - | - | - | - | - | 10 | 5 | 10 | 5 | |
Summary for some results obtained over a number of architectures trained on various combinations of datasets. Dataset naming follows Table 1.
| Architecture | Pre-Trained | Training | Test | Sensitivity | Specificity | BA | AUC | DOR |
|---|---|---|---|---|---|---|---|---|
| ResNet-18 | none | AB | AB | 0.88 | 0.94 | 0.91 | 0.97 | 112.93 |
| none | AB | AD | 0.87 | 0.20 | 0.54 | 0.60 | 1.67 | |
| none | A | A | 0.56 | 0.58 | 0.57 | 0.59 | 1.71 | |
| B | AB | AB | 0.88 | 0.94 | 0.91 | 0.97 | 122.67 | |
| B | AB | A | 0.88 | 0.24 | 0.56 | 0.66 | 2.32 | |
| C | A | A | 0.54 | 0.80 | 0.67 | 0.72 | 4.78 | |
| C | AB | AB | 0.82 | 0.95 | 0.89 | 0.97 | 89.14 | |
| C | AB | A | 0.82 | 0.38 | 0.60 | 0.63 | 2.81 | |
| B | D | A | 0.91 | 0.20 | 0.56 | 0.61 | 2.56 | |
| B | D | D | 1.00 | 1.00 | 1.00 | 1.00 |
| |
| ResNet-50 | B | D | AB | 0.98 | 0.72 | 0.85 | 0.90 | 112.57 |
| B | D | AC | 0.98 | 0.11 | 0.55 | 0.61 | 5.59 | |
| B | D | AD | 0.98 | 0.20 | 0.59 | 0.65 | 12.25 | |
| COVID-Net | B | D | A | 0.12 | 0.98 | 0.55 | 0.55 | 6.68 |
| B | D | D | 0.90 | 0.80 | 0.85 | 0.85 | 36.00 | |
| DenseNet-121 | B | D | A | 0.99 | 0.07 | 0.53 | 0.61 | 6.36 |
| B | D | D | 1.00 | 0.80 | 0.90 | 0.98 |
|
Figure 4t-SNE on ChestXRay trained encoder. (a) shows ChestXRay data vs. CORDA data, (b) instead shows RSNA vs. CORDA.
Results obtained training a ResNet-18 model (no pre-trained encoder). Dataset naming follows Table 1.
| Training | Test | Sensitivity | Specificity | F-Score | Accuracy | BA | AUC | DOR |
|---|---|---|---|---|---|---|---|---|
| AC | A | 0.56 | 0.42 | 0.60 | 0.51 | 0.49 | 0.52 | 0.91 |
| AB | 0.56 | 0.22 | 0.15 | 0.26 | 0.39 | 0.33 | 0.36 | |
| AC | 0.56 | 0.96 | 0.49 | 0.95 | 0.76 | 0.95 | 34.23 | |
| AD | 0.52 | 0.48 | 0.58 | 0.51 | 0.50 | 0.53 | 1.00 | |
| A | A | 0.56 | 0.58 | 0.63 | 0.56 | 0.57 | 0.59 | 1.71 |
| AB | 0.56 | 0.37 | 0.18 | 0.39 | 0.46 | 0.43 | 0.74 | |
| AC | 0.56 | 0.38 | 0.08 | 0.39 | 0.47 | 0.46 | 0.76 | |
| AD | 0.56 | 0.58 | 0.63 | 0.57 | 0.57 | 0.59 | 1.76 | |
| AD | A | 0.58 | 0.64 | 0.66 | 0.60 | 0.61 | 0.63 | 2.48 |
| AB | 0.58 | 0.63 | 0.27 | 0.63 | 0.61 | 0.63 | 2.37 | |
| AC | 0.58 | 0.54 | 0.11 | 0.54 | 0.56 | 0.58 | 1.62 | |
| AD | 0.57 | 0.66 | 0.66 | 0.60 | 0.61 | 0.64 | 2.57 | |
| D | A | 0.91 | 0.11 | 0.77 | 0.64 | 0.51 | 0.54 | 1.28 |
| AB | 0.91 | 0.66 | 0.41 | 0.69 | 0.78 | 0.87 | 19.56 | |
| AC | 0.91 | 0.11 | 0.09 | 0.14 | 0.51 | 0.45 | 1.22 | |
| AD | 0.91 | 0.18 | 0.78 | 0.67 | 0.55 | 0.58 | 2.22 | |
| AB | A | 0.88 | 0.18 | 0.77 | 0.64 | 0.53 | 0.58 | 1.55 |
| AB | 0.88 | 0.94 | 0.76 | 0.93 | 0.91 | 0.97 | 112.93 | |
| AC | 0.88 | 0.14 | 0.09 | 0.17 | 0.51 | 0.42 | 1.14 | |
| AD | 0.87 | 0.20 | 0.77 | 0.65 | 0.54 | 0.60 | 1.67 |
Results obtained training a ResNet-18 model. Dataset naming follows Table 1.
| Pre-Trained | Training | Test | Sensitivity | Specificity | F-Score | Accuracy | BA | AUC | DOR |
|---|---|---|---|---|---|---|---|---|---|
| C | AC | A | 0.68 | 0.44 | 0.69 | 0.60 | 0.56 | 0.61 | 1.68 |
| AB | 0.68 | 0.22 | 0.18 | 0.27 | 0.45 | 0.49 | 0.59 | ||
| AC | 0.68 | 0.90 | 0.37 | 0.89 | 0.79 | 0.90 | 19.82 | ||
| AD | 0.67 | 0.50 | 0.70 | 0.61 | 0.58 | 0.63 | 2.03 | ||
| A | A | 0.54 | 0.80 | 0.66 | 0.63 | 0.67 | 0.72 | 4.78 | |
| AB | 0.54 | 0.31 | 0.16 | 0.34 | 0.43 | 0.48 | 0.54 | ||
| AC | 0.54 | 0.55 | 0.10 | 0.55 | 0.55 | 0.61 | 1.48 | ||
| AD | 0.57 | 0.76 | 0.67 | 0.63 | 0.67 | 0.72 | 4.20 | ||
| AD | A | 0.70 | 0.49 | 0.72 | 0.63 | 0.59 | 0.67 | 2.23 | |
| AB | 0.70 | 0.30 | 0.20 | 0.34 | 0.50 | 0.59 | 0.98 | ||
| AC | 0.70 | 0.37 | 0.10 | 0.39 | 0.53 | 0.61 | 1.37 | ||
| AD | 0.71 | 0.52 | 0.73 | 0.65 | 0.61 | 0.70 | 2.65 | ||
| D | A | 0.94 | 0.09 | 0.79 | 0.66 | 0.52 | 0.57 | 1.66 | |
| AB | 0.94 | 0.61 | 0.39 | 0.65 | 0.78 | 0.92 | 26.24 | ||
| AC | 0.94 | 0.08 | 0.09 | 0.12 | 0.51 | 0.58 | 1.50 | ||
| AD | 0.95 | 0.14 | 0.80 | 0.68 | 0.54 | 0.62 | 3.09 | ||
| AB | A | 0.82 | 0.38 | 0.77 | 0.67 | 0.60 | 0.63 | 2.81 | |
| AB | 0.82 | 0.95 | 0.75 | 0.94 | 0.89 | 0.97 | 89.14 | ||
| AC | 0.82 | 0.30 | 0.10 | 0.32 | 0.56 | 0.59 | 1.98 | ||
| AD | 0.83 | 0.38 | 0.78 | 0.68 | 0.60 | 0.64 | 2.99 | ||
| B | AC | A | 0.86 | 0.31 | 0.78 | 0.67 | 0.58 | 0.60 | 2.67 |
| AB | 0.86 | 0.29 | 0.24 | 0.36 | 0.58 | 0.48 | 2.47 | ||
| AC | 0.86 | 0.95 | 0.61 | 0.95 | 0.90 | 0.97 | 122.64 | ||
| AD | 0.82 | 0.38 | 0.77 | 0.67 | 0.60 | 0.61 | 2.79 | ||
| A | A | 0.54 | 0.58 | 0.62 | 0.56 | 0.56 | 0.67 | 1.64 | |
| AB | 0.54 | 0.37 | 0.17 | 0.39 | 0.46 | 0.49 | 0.70 | ||
| AC | 0.54 | 0.73 | 0.15 | 0.72 | 0.64 | 0.72 | 3.21 | ||
| AD | 0.56 | 0.62 | 0.64 | 0.58 | 0.59 | 0.70 | 2.08 | ||
| AD | A | 0.71 | 0.49 | 0.72 | 0.64 | 0.60 | 0.67 | 2.35 | |
| AB | 0.71 | 0.25 | 0.20 | 0.31 | 0.48 | 0.51 | 0.83 | ||
| AC | 0.71 | 0.47 | 0.11 | 0.48 | 0.59 | 0.64 | 2.16 | ||
| AD | 0.73 | 0.52 | 0.74 | 0.66 | 0.62 | 0.70 | 2.93 | ||
| D | A | 0.91 | 0.20 | 0.79 | 0.67 | 0.56 | 0.61 | 2.56 | |
| AB | 0.91 | 0.70 | 0.44 | 0.73 | 0.81 | 0.89 | 24.38 | ||
| AC | 0.91 | 0.15 | 0.09 | 0.19 | 0.53 | 0.55 | 1.83 | ||
| AD | 0.92 | 0.28 | 0.81 | 0.71 | 0.60 | 0.66 | 4.47 | ||
| AB | A | 0.88 | 0.24 | 0.78 | 0.67 | 0.56 | 0.66 | 2.32 | |
| AB | 0.88 | 0.94 | 0.77 | 0.94 | 0.91 | 0.97 | 122.67 | ||
| AC | 0.88 | 0.24 | 0.10 | 0.27 | 0.56 | 0.67 | 2.26 | ||
| AD | 0.88 | 0.26 | 0.78 | 0.67 | 0.57 | 0.68 | 2.58 |
Results obtained training a ResNet-50 model. Dataset naming follows Table 1.
| Pre-Trained | Training | Test | Sensitivity | Specificity | F-Score | Accuracy | BA | AUC | DOR |
|---|---|---|---|---|---|---|---|---|---|
| C | AC | A | 0.74 | 0.49 | 0.74 | 0.66 | 0.62 | 0.65 | 2.79 |
| AB | 0.74 | 0.40 | 0.24 | 0.44 | 0.57 | 0.64 | 1.92 | ||
| AC | 0.74 | 0.92 | 0.43 | 0.91 | 0.83 | 0.93 | 31.76 | ||
| AD | 0.70 | 0.54 | 0.73 | 0.65 | 0.62 | 0.66 | 2.74 | ||
| A | A | 0.61 | 0.71 | 0.70 | 0.64 | 0.66 | 0.67 | 3.87 | |
| AB | 0.61 | 0.40 | 0.20 | 0.43 | 0.51 | 0.53 | 1.06 | ||
| AC | 0.61 | 0.58 | 0.12 | 0.58 | 0.60 | 0.63 | 2.20 | ||
| AD | 0.62 | 0.74 | 0.71 | 0.66 | 0.68 | 0.69 | 4.64 | ||
| AD | A | 0.53 | 0.64 | 0.62 | 0.57 | 0.59 | 0.64 | 2.07 | |
| AB | 0.53 | 0.56 | 0.22 | 0.56 | 0.55 | 0.58 | 1.47 | ||
| AC | 0.53 | 0.57 | 0.10 | 0.57 | 0.55 | 0.58 | 1.53 | ||
| AD | 0.55 | 0.68 | 0.64 | 0.59 | 0.61 | 0.66 | 2.60 | ||
| D | A | 0.97 | 0.04 | 0.79 | 0.66 | 0.51 | 0.57 | 1.35 | |
| AB | 0.97 | 0.45 | 0.32 | 0.51 | 0.71 | 0.89 | 23.29 | ||
| AC | 0.97 | 0.09 | 0.09 | 0.13 | 0.53 | 0.56 | 2.91 | ||
| AD | 0.97 | 0.10 | 0.80 | 0.68 | 0.54 | 0.62 | 3.59 | ||
| AB | A | 0.76 | 0.33 | 0.72 | 0.61 | 0.54 | 0.65 | 1.55 | |
| AB | 0.76 | 0.95 | 0.72 | 0.93 | 0.85 | 0.97 | 63.61 | ||
| AC | 0.76 | 0.36 | 0.10 | 0.38 | 0.56 | 0.63 | 1.75 | ||
| AD | 0.76 | 0.32 | 0.72 | 0.61 | 0.54 | 0.64 | 1.49 | ||
| B | AC | A | 0.73 | 0.40 | 0.72 | 0.62 | 0.57 | 0.58 | 1.83 |
| AB | 0.73 | 0.25 | 0.20 | 0.31 | 0.49 | 0.44 | 0.92 | ||
| AC | 0.73 | 0.96 | 0.58 | 0.95 | 0.85 | 0.97 | 68.71 | ||
| AD | 0.70 | 0.46 | 0.71 | 0.62 | 0.58 | 0.60 | 1.99 | ||
| A | A | 0.64 | 0.56 | 0.69 | 0.61 | 0.60 | 0.65 | 2.27 | |
| AB | 0.64 | 0.49 | 0.24 | 0.51 | 0.57 | 0.61 | 1.72 | ||
| AC | 0.64 | 0.63 | 0.14 | 0.63 | 0.64 | 0.69 | 3.06 | ||
| AD | 0.67 | 0.60 | 0.72 | 0.65 | 0.64 | 0.69 | 3.05 | ||
| AD | A | 0.63 | 0.38 | 0.65 | 0.55 | 0.51 | 0.63 | 1.05 | |
| AB | 0.63 | 0.46 | 0.22 | 0.48 | 0.55 | 0.61 | 1.46 | ||
| AC | 0.63 | 0.62 | 0.14 | 0.62 | 0.63 | 0.70 | 2.86 | ||
| AD | 0.65 | 0.44 | 0.67 | 0.58 | 0.55 | 0.66 | 1.46 | ||
| D | A | 0.98 | 0.13 | 0.81 | 0.70 | 0.56 | 0.61 | 6.77 | |
| AB | 0.98 | 0.72 | 0.48 | 0.75 | 0.85 | 0.90 | 112.57 | ||
| AC | 0.98 | 0.11 | 0.10 | 0.15 | 0.55 | 0.61 | 5.59 | ||
| AD | 0.98 | 0.20 | 0.82 | 0.72 | 0.59 | 0.65 | 12.25 | ||
| AB | A | 0.81 | 0.29 | 0.75 | 0.64 | 0.55 | 0.64 | 1.74 | |
| AB | 0.81 | 0.94 | 0.73 | 0.93 | 0.88 | 0.97 | 73.35 | ||
| AC | 0.81 | 0.25 | 0.09 | 0.28 | 0.53 | 0.57 | 1.43 | ||
| AD | 0.80 | 0.30 | 0.74 | 0.63 | 0.55 | 0.64 | 1.71 |
Results obtained training a DenseNet-121 model. Dataset naming follows Table 1.
| Pre-Trained | Training | Test | Sensitivity | Specificity | F-Score | Accuracy | BA | AUC | DOR |
|---|---|---|---|---|---|---|---|---|---|
| C | AC | A | 0.68 | 0.51 | 0.71 | 0.62 | 0.59 | 0.64 | 2.20 |
| AB | 0.68 | 0.22 | 0.18 | 0.27 | 0.45 | 0.43 | 0.58 | ||
| AC | 0.68 | 0.93 | 0.44 | 0.92 | 0.80 | 0.93 | 27.98 | ||
| AD | 0.67 | 0.54 | 0.71 | 0.63 | 0.60 | 0.65 | 2.38 | ||
| A | A | 0.77 | 0.38 | 0.74 | 0.64 | 0.57 | 0.63 | 1.99 | |
| AB | 0.77 | 0.08 | 0.18 | 0.16 | 0.42 | 0.31 | 0.29 | ||
| AC | 0.77 | 0.37 | 0.11 | 0.39 | 0.57 | 0.62 | 1.97 | ||
| AD | 0.77 | 0.42 | 0.75 | 0.65 | 0.59 | 0.66 | 2.42 | ||
| AD | A | 0.60 | 0.64 | 0.68 | 0.61 | 0.62 | 0.68 | 2.72 | |
| AB | 0.60 | 0.36 | 0.19 | 0.39 | 0.48 | 0.51 | 0.84 | ||
| AC | 0.60 | 0.54 | 0.11 | 0.54 | 0.57 | 0.63 | 1.73 | ||
| AD | 0.61 | 0.68 | 0.69 | 0.63 | 0.65 | 0.71 | 3.32 | ||
| D | A | 0.87 | 0.11 | 0.75 | 0.61 | 0.49 | 0.62 | 0.81 | |
| AB | 0.87 | 0.37 | 0.26 | 0.43 | 0.62 | 0.70 | 3.80 | ||
| AC | 0.87 | 0.11 | 0.09 | 0.14 | 0.49 | 0.49 | 0.79 | ||
| AD | 0.88 | 0.18 | 0.77 | 0.65 | 0.53 | 0.66 | 1.61 | ||
| AB | A | 0.81 | 0.31 | 0.75 | 0.64 | 0.56 | 0.67 | 1.94 | |
| AB | 0.81 | 0.93 | 0.71 | 0.92 | 0.87 | 0.97 | 61.00 | ||
| AC | 0.81 | 0.13 | 0.08 | 0.16 | 0.47 | 0.47 | 0.62 | ||
| AD | 0.82 | 0.30 | 0.76 | 0.65 | 0.56 | 0.67 | 1.95 | ||
| B | AC | A | 0.67 | 0.56 | 0.71 | 0.63 | 0.61 | 0.66 | 2.50 |
| AB | 0.67 | 0.36 | 0.21 | 0.39 | 0.51 | 0.48 | 1.11 | ||
| AC | 0.67 | 0.98 | 0.63 | 0.96 | 0.82 | 0.98 | 90.25 | ||
| AD | 0.62 | 0.60 | 0.68 | 0.61 | 0.61 | 0.66 | 2.45 | ||
| A | A | 0.63 | 0.62 | 0.70 | 0.63 | 0.63 | 0.70 | 2.84 | |
| AB | 0.63 | 0.34 | 0.19 | 0.37 | 0.49 | 0.52 | 0.88 | ||
| AC | 0.63 | 0.45 | 0.10 | 0.46 | 0.54 | 0.59 | 1.42 | ||
| AD | 0.66 | 0.64 | 0.72 | 0.65 | 0.65 | 0.73 | 3.45 | ||
| AD | A | 0.63 | 0.62 | 0.70 | 0.63 | 0.63 | 0.68 | 2.84 | |
| AB | 0.63 | 0.47 | 0.23 | 0.49 | 0.55 | 0.63 | 1.56 | ||
| AC | 0.63 | 0.61 | 0.13 | 0.61 | 0.62 | 0.70 | 2.74 | ||
| AD | 0.65 | 0.66 | 0.71 | 0.65 | 0.66 | 0.71 | 3.61 | ||
| D | A | 0.99 | 0.07 | 0.81 | 0.68 | 0.53 | 0.61 | 6.36 | |
| AB | 0.99 | 0.62 | 0.41 | 0.66 | 0.80 | 0.91 | 142.68 | ||
| AC | 0.99 | 0.04 | 0.09 | 0.08 | 0.51 | 0.53 | 3.41 | ||
| AD | 0.99 | 0.14 | 0.82 | 0.71 | 0.56 | 0.65 | 16.12 | ||
| AB | A | 0.78 | 0.44 | 0.76 | 0.67 | 0.61 | 0.69 | 2.80 | |
| AB | 0.78 | 0.96 | 0.75 | 0.94 | 0.87 | 0.97 | 86.56 | ||
| AC | 0.78 | 0.37 | 0.11 | 0.39 | 0.57 | 0.66 | 2.02 | ||
| AD | 0.80 | 0.50 | 0.78 | 0.70 | 0.65 | 0.72 | 4.00 |