| Literature DB >> 34101042 |
Karim Hammoudi1,2, Halim Benhabiles3, Mahmoud Melkemi4,5, Fadi Dornaika6,7, Ignacio Arganda-Carreras6,7,8, Dominique Collard9,10, Arnaud Scherpereel11.
Abstract
Coronavirus disease 2019 (COVID-19) is an infectious disease with first symptoms similar to the flu. COVID-19 appeared first in China and very quickly spreads to the rest of the world, causing then the 2019-20 coronavirus pandemic. In many cases, this disease causes pneumonia. Since pulmonary infections can be observed through radiography images, this paper investigates deep learning methods for automatically analyzing query chest X-ray images with the hope to bring precision tools to health professionals towards screening the COVID-19 and diagnosing confirmed patients. In this context, training datasets, deep learning architectures and analysis strategies have been experimented from publicly open sets of chest X-ray images. Tailored deep learning models are proposed to detect pneumonia infection cases, notably viral cases. It is assumed that viral pneumonia cases detected during an epidemic COVID-19 context have a high probability to presume COVID-19 infections. Moreover, easy-to-apply health indicators are proposed for estimating infection status and predicting patient status from the detected pneumonia cases. Experimental results show possibilities of training deep learning models over publicly open sets of chest X-ray images towards screening viral pneumonia. Chest X-ray test images of COVID-19 infected patients are successfully diagnosed through detection models retained for their performances. The efficiency of proposed health indicators is highlighted through simulated scenarios of patients presenting infections and health problems by combining real and synthetic health data.Entities:
Keywords: COVID-19; Chest X-ray images (CXR); Coronavirus disease; Health scoring system; Pneumonia detection; Radiology
Mesh:
Year: 2021 PMID: 34101042 PMCID: PMC8185498 DOI: 10.1007/s10916-021-01745-4
Source DB: PubMed Journal: J Med Syst ISSN: 0148-5598 Impact factor: 4.460
Fig. 1Global workflow using deep learning for automatic detection of infection towards supporting COVID-19 screening from chest X-ray images. In a COVID-19 epidemic context, a detected viral pneumonia can particularly presume a COVID-19 infection
Fig. 2Global workflow using deep learning based for automatic estimation of a CNN-based infection rate indicator from chest X-ray images
Fig. 3Chest X-ray samples from the test datasets. Row 1 shows image categories. Row 2 shows various artifacts captured with chest X-rays such as writings (e.g.; letter “R”) and medical devices (e.g., tubes, sensors)
Scores related to age
| Age | Fatality risk ratio (Hubei, China) | Associated score ( |
|---|---|---|
| ≥ 80 | 18 | |
| 70–79 | 9.8% | |
| 60–69 | 4.6% | |
| 50–59 | 1.3% | |
| 40–49 | 0.4% | |
| 30–39 | 0.18% | |
| 20–29 | 0.09% | |
| 10–19 | 0.02% | |
| 0–9 | 0.01 |
The sources of the “Fatality risk ratio” are [19, 20]
Bold entries correspond to values of interest with respect to contextual information such as best performances
Scores related to disease
| Disease | Case fatality rate | Associated |
|---|---|---|
| (China) | score ( | |
| Cardiovascular disease | 10.5% | |
| Diabetes | 7.3% | |
| Chronic respiratory | 6.3% | |
| Hypertension | 6 % | |
| Cancer | 5.6 % | |
| No health condition | 0.9% |
The sources of the “Case fatality rate” are [21, 22]
Bold entries correspond to values of interest with respect to contextual information such as best performances
Comparison of average accuracies obtained on classification using some tailored CNN-based architectures
| Data/Network | Average accuracy (%) | ||||
|---|---|---|---|---|---|
| ResNet34 | ResNet50 | DenseNet169 | VGG-19 | Inception | |
| ResNetV2 & RNN | |||||
| Image size | (310 × 310), (310 × 273) | (310 × 310), (310 × 273) | (310 × 310), (310 × 273) | (224 × 224) | (300 × 300) |
| Raw resizing | 93.69 | 93.47 | 91.89 | 82.66 | – |
| Original ratio with padding | 90.54 | 93.92 | – | – | |
| Split in 9 & raw resizing | – | – | – | ||
Bold entries correspond to values of interest with respect to contextual information such as best performances
Classification performance obtained by testing our best trained architectures with two query image sets
| Accuracy (%) | ||||
|---|---|---|---|---|
| Query test set/Class ouput | Bacteria | Virus | Normal | Pneumonia |
| Dataset | 90.7 (Bact. vs Vir.) | 92.8 (Norm. vs Pneu.) | ||
| 3-class dataset | – | |||
| 3-class dataset | 87.84 | 81.08 | 79.05 | – |
| 3-class dataset | 86.49 | 84.46 | 63.52 | – |
| 3-class dataset | 90.54 | 83.78 | 66.89 | – |
| Sensitivity (%) | ||||
| Query test set/Class ouput | Virus | Pneumonia | ||
| 1-class blind set of varied COVID-19 images by DenseNet169 | 45.51 | 88.27 | ||
| 1-class blind set of varied COVID-19 images by Inception ResNetV2 & RNN (majority voting) | 95.12 | |||
| 1-class blind set of varied COVID-19 images by Inception ResNetV2 & RNN (by default) | 51.72 | |||
a Chest X-Ray Images (Pneumonia) dataset: https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia
The first row reflects classification results of [14]
Bold entries correspond to values of interest with respect to contextual information such as best performances
Confusion matrix with DenseNet169
| Actual/Predicted | Bacteria | Normal | Virus |
|---|---|---|---|
| Bacteria | 1 | 2 | |
| Normal | 0 | 11 | |
| Virus | 4 | 1 |
Bold entries correspond to values of interest with respect to contextual information such as best performances
Fig. 4Histogram of classification accuracy obtained for each class by using varied architecture types with the Chest X-Ray Images (Pneumonia) dataset
CNN-derived infection rates estimated from real pairs of successive X-ray images for 5 COVID-19 infected patients
| Image pairs | Elapsed days | Observations | |||
|---|---|---|---|---|---|
| ( | |||||
| (1,3), Fig. 5 of [ | 9/9 | 7 | 9/9 | elderly man | |
| “improvements” | |||||
| (1,2), Fig. 1 of [ | 1/9 | 5 | 2/9 | 67-y-old woman | |
| “wires, attenuation” | |||||
| (1,3), Fig. 2 of [ | 5/9 | 8 | 5/9 | 36-y-old man | |
| “death” | |||||
| (1,2), case 1, Fig. 5 of [ | 6/9 | 1 | 9/9 | “worse status” | |
| (1,2), case 2, Fig. 5 of [ | 7/9 | 4 | 8/9 | “worse status” |
Examples of F values for 9 patients from synthetic data
| Patients | Aggravation factors: values | F | |||
|---|---|---|---|---|---|
| Patient 1 | Age: 65 | 25.5 | |||
| nb. of infected sub-image: 3 | 33.33 | 0.79415 (79.41%) | |||
| Cardiovascular disease | 100 | ||||
| Patient 2 | Age: 55 | 7.2 | |||
| nb. of infected sub-image: 4 | 44.44 | 0.7582 (75.82 %) | |||
| Cardiovascular disease | 100 | ||||
| Patient 3 | Age: 75 | 54.4 | |||
| nb. of infected sub-image: 1 | 11.11 | 0.59405 (59.40%) | |||
| Cancer | 53.3 | ||||
| Patient 4 | Age: 82 | 100 | |||
| nb. of infected sub-image: 1 | 11.11 | 1.05 (> 100%) | |||
| Cardiovascular disease | 100 | ||||
| Patient 5 | Age: 34 | 1 | |||
| nb. of infected sub-image: 5 | 55.55 | 0.63025 (63.02%) | |||
| Diabetes | 69.5 | ||||
| Patient 6 | Age: 18 | 0.1 | |||
| nb. of infected sub-image: 7 | 77.77 | 0.68935 (68.93%) | |||
| Chronic respiratory | 60 | ||||
| Patient 7 | Age: 13 | 0.1 | |||
| nb. of infected sub-image: 4 | 44.44 | 0.2657 (26.57%) | |||
| No health condition | 8.6 | ||||
| Patient 8 | Age: 56 | 7.2 | |||
| nb. of infected sub-image: 2 | 22.22 | 0.4326 (43.26%) | |||
| Hypertension | 57,1 | ||||
| Patient 9 | Age: 47 | 2.2 | |||
| nb. of infected sub-image: 3 | 33.33 | 0.2206 (22.06%) | |||
| No health condition | 8.6 |
Distribution of F values for a patient aged between 60 and 69 considering the possible diseases (rows) and rates of infection (columns from 0 to 9)
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | |
|---|---|---|---|---|---|---|---|---|---|---|
| No health condition | 17,05 | 22,60 | 28,16 | 33,71 | 39,27 | 44,82 | 50,38 | 55,93 | 61,49 | 67,04 |
| Cancer | 39,4 | 44,95 | 50,51 | 56,06 | 61,62 | 67,17 | 72,73 | 78,28 | 83,84 | 89,39 |
| Hypertension | 41,3 | 46,85 | 52,41 | 57,96 | 63,52 | 69,07 | 74,63 | 80,18 | 85,74 | 91,29 |
| Chronic respiratory | 42,75 | 48,30 | 53,86 | 59,41 | 64,97 | 70,52 | 76,08 | 81,63 | 87,19 | 92,74 |
| Diabetes | 47,5 | 53,05 | 58,61 | 64,16 | 69,72 | 75,27 | 80,83 | 86,38 | 91,94 | 97,49 |
| Cardiovascular | 62,75 | 68,30 | 73,86 | 84,97 | 90,52 | 96,08 | 101,63 | 107,19 | 112,74 |
Bold entries correspond to values of interest with respect to contextual information such as best performances
Fig. 5Graph obtained by representing the distribution of F values for a patient aged between 60 and 69 considering the possible diseases and rates of infection given in Table 8