| Literature DB >> 35788644 |
Abhibha Gupta1, Parth Sheth2, Pengtao Xie3.
Abstract
Pneumonia is one of the diseases that causes the most fatalities worldwide, especially in children. Recently, pneumonia-caused deaths have increased dramatically due to the novel Coronavirus global pandemic. Chest X-ray (CXR) images are one of the most readily available and common imaging modality for the detection and identification of pneumonia. However, the detection of pneumonia from chest radiography is a difficult task even for experienced radiologists. Artificial Intelligence (AI) based systems have great potential in assisting in quick and accurate diagnosis of pneumonia from chest X-rays. The aim of this study is to develop a Neural Architecture Search (NAS) method to find the best convolutional architecture capable of detecting pneumonia from chest X-rays. We propose a Learning by Teaching framework inspired by the teaching-driven learning methodology from humans, and conduct experiments on a pneumonia chest X-ray dataset with over 5000 images. Our proposed method yields an area under ROC curve (AUC) of 97.6% for pneumonia detection, which improves upon previous NAS methods by 5.1% (absolute).Entities:
Mesh:
Year: 2022 PMID: 35788644 PMCID: PMC9252574 DOI: 10.1038/s41598-022-15341-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Some randomly sampled X-rays (with a size of 128 × 128) containing pneumonia.
Comparison between our method and baselines.
| Model | Sensitivity (%) | Specificity (%) | F1 (%) | AUC (%) | Accuracy (%) | Model size | Training time (h) | Inference time (ms) |
|---|---|---|---|---|---|---|---|---|
| VGG19[ | 92.7 ± 0.68 | 92.4 ± 0.93 | 93.0 ± 0.59 | 93.9 ± 0.81 | 92.7 ± 0.84 | 731 | 2.3 | 69.2 |
| InceptionV3[ | 91.8 ± 0.49 | 92.2 ± 0.70 | 91.4 ± 0.76 | 92.8 ± 0.55 | 92.6 ± 0.92 | 502 | 2.1 | 38.6 |
| DenseNet121[ | 93.8 ± 0.87 | 91.7 ± 0.92 | 92.4 ± 0.96 | 93.8 ± 0.53 | 93.1 ± 0.97 | 537 | 2.1 | 87.2 |
| AlexNet[ | 92.5 ± 1.04 | 92.7 ± 0.85 | 92.1 ± 0.62 | 94.1 ± 0.62 | 92.7 ± 0.73 | 433 | 2.0 | 32.7 |
| VGG16[ | 90.9 ± 0.75 | 94.1 ± 0.68 | 91.8 ± 1.15 | 94.3 ± 0.47 | 92.5 ± 0.61 | 737 | 2.2 | 55.3 |
| Xception[ | 90.7 ± 0.59 | 92.3 ± 0.71 | 93.6 ± 0.74 | 93.4 ± 0.62 | 92.1 ± 0.73 | 241 | 1.8 | 146.9 |
| GoogLeNet[ | 90.7 ± 1.03 | 92.5 ± 0.91 | 91.8 ± 0.72 | 95.4 ± 0.37 | 93.4 ± 0.85 | 87 | 1.5 | 38.1 |
| LeNet5[ | 84.6 ± 0.72 | 85.9 ± 0.55 | 85.4 ± 0.59 | 88.7 ± 0.36 | 89.1 ± 0.42 | 11.1 | 0.2 | 28.0 |
| Kermany et al.[ | 92.8 ± 0.59 | 92.2 ± 0.57 | 92.5 ± 0.96 | 93.7 ± 0.69 | 93.0 ± 0.68 | 403 | 2.2 | 172.6 |
| Stephen et al.[ | 92.4 ± 0.71 | 92.7 ± 0.38 | 92.4 ± 0.96 | 94.2 ± 0.71 | 93.7 ± 0.62 | 61 | 1.6 | 147.0 |
| Siddiqi[ | 94.7 ± 0.42 | 93.1 ± 1.33 | 92.7 ± 0.61 | 93.9 ± 0.33 | 93.5 ± 0.74 | 274 | 1.7 | 210.6 |
| Liang et al.[ | 89.5 ± 0.62 | 91.7 ± 0.73 | 89.9 ± 1.04 | 92.2 ± 0.68 | 92.3 ± 0.95 | 1.9 | 187.3 | |
| Meta Pseudo Label[ | 90.6 ± 0.74 | 92.3 ± 0.58 | 91.7 ± 0.94 | 93.2 ± 0.63 | 91.8 ± 0.71 | 69 | 1.7 | 162.5 |
| Liu et al.[ | 92.0 ± 0.58 | 92.7 ± 0.84 | 92.4 ± 0.81 | 93.4 ± 0.45 | 92.4 ± 0.74 | 35 | 1.4 | 85.2 |
| Kundu et al.[ | 92.4 ± 1.05 | 91.6 ± 0.69 | 91.8 ± 0.95 | 93.1 ± 0.52 | 91.9 ± 0.50 | 195 | 2.1 | 141.7 |
| Cha et al.[ | 92.1 ± 0.62 | 91.3 ± 0.62 | 91.4 ± 0.77 | 93.2 ± 0.68 | 92.0 ± 0.59 | 131 | 1.7 | 196.3 |
| DARTS[ | 88.9 ± 0.71 | 89.2 ± 0.95 | 90.1 ± 0.62 | 93.0 ± 0.85 | 89.8 ± 0.75 | 11.4 | 0.9 | 28.7 |
| LBT-DARTS (ours) | 93.0 ± 0.42 | 93.2 ± 0.86 | 92.8 ± 0.75 | 94.9 ± 0.82 | 93.3 ± 0.61 | 11.2 | 0.9 | 28.5 |
| PC-DARTS[ | 93.2 ± 0.84 | 90.9 ± 0.95 | 91.8 ± 0.62 | 92.5 ± 0.60 | 91.4 ± 0.75 | 11.3 | 0.1 | 28.5 |
| LBT-PC-DARTS (ours) | 0.1 |
Model size is in MB. Training time is in GPU hours (h). Inference time is in milliseconds (ms).
Significant values are in bold.
Comparison between our method and three junior radiologists.
| Accuracy (%) | |
|---|---|
| VGG19[ | 89.7 |
| InceptionV3[ | 90.4 |
| DenseNet121[ | 91.1 |
| AlexNet[ | 89.2 |
| VGG16[ | 90.5 |
| Xception[ | 89.6 |
| GoogLeNet[ | 88.4 |
| LeNet5[ | 82.0 |
| Kermany et al.[ | 91.0 |
| Stephen et al.[ | 90.9 |
| Siddiqi[ | 91.3 |
| Liang et al.[ | 88.7 |
| Meta Pseudo Label[ | 89.1 |
| Liu et al.[ | 90.4 |
| Kundu et al.[ | 91.2 |
| Cha et al.[ | 90.7 |
| DARTS[ | 88.5 |
| LBT-DARTS (ours) | 91.8 |
| PC-DARTS[ | 89.2 |
| LBT-PC-DARTS (ours) | 94.2 |
| Radiologist 1 | 94.5 |
| Radiologist 2 | 94.7 |
| Radiologist 3 | 94.4 |
Ablation studies.
| Ablation studies | Accuracy (%) |
|---|---|
| LBT-PC-DARTS (ours) | |
| Ablation setting 1 | 94.3 |
| Ablation setting 2 | 95.1 |
Significant values are in bold.
Figure 2Top row: accuracy of LBT-PC-DARTS under different values of the tradeoff parameter and . Middle row: normal cell (left) and reduction cell (right) searched by LBT-DARTS. Bottom row: normal cell (left) and reduction cell (right) searched by LBT-PC-DARTS.
Figure 3Column (a) shows original CXR images. Column (b) shows Grad-CAM visualization of saliency maps of LBT-PCDARTS. Column (c) shows the overlay of saliency maps on original images.
Figure 4Correct and incorrect predictions made by LBT based PC-DARTS.
Figure 5Train and validation accuracy values across epochs during the training process of LBT based PC-DARTS.