| Literature DB >> 35453963 |
Sungyeup Kim1, Beanbonyka Rim1, Seongjun Choi2, Ahyoung Lee3, Sedong Min4, Min Hong5.
Abstract
Chest X-ray radiographic (CXR) imagery enables earlier and easier lung disease diagnosis. Therefore, in this paper, we propose a deep learning method using a transfer learning technique to classify lung diseases on CXR images to improve the efficiency and accuracy of computer-aided diagnostic systems' (CADs') diagnostic performance. Our proposed method is a one-step, end-to-end learning, which means that raw CXR images are directly inputted into a deep learning model (EfficientNet v2-M) to extract their meaningful features in identifying disease categories. We experimented using our proposed method on three classes of normal, pneumonia, and pneumothorax of the U.S. National Institutes of Health (NIH) data set, and achieved validation performances of loss = 0.6933, accuracy = 82.15%, sensitivity = 81.40%, and specificity = 91.65%. We also experimented on the Cheonan Soonchunhyang University Hospital (SCH) data set on four classes of normal, pneumonia, pneumothorax, and tuberculosis, and achieved validation performances of loss = 0.7658, accuracy = 82.20%, sensitivity = 81.40%, and specificity = 94.48%; testing accuracy of normal, pneumonia, pneumothorax, and tuberculosis classes was 63.60%, 82.30%, 82.80%, and 89.90%, respectively.Entities:
Keywords: EfficientNet v2; chest X-ray image; deep learning; multi-class classification; transfer learning
Year: 2022 PMID: 35453963 PMCID: PMC9025806 DOI: 10.3390/diagnostics12040915
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
NIH data set.
| Class | Training Set | Validation Set | Total |
|---|---|---|---|
| Normal | 2676 | 669 | 3345 |
| Pneumonia | 1114 | 278 | 1392 |
| Pneumothorax | 4210 | 1053 | 5263 |
| Total | 8000 | 2000 | 10,000 |
Figure 1CXR images of NIH data set: (a) image of normal class; (b) histogram of normal class; (c) image of pneumonia class; (d) histogram of pneumonia class; (e) image of pneumothorax class; (f) histogram of pneumothorax class.
SCH data set.
| Class | Training Set | Validation Set | Testing Set | Total |
|---|---|---|---|---|
| Normal | 10,203 | 500 | 1000 | 11,703 |
| Pneumonia | 7080 | 500 | 1000 | 8580 |
| Pneumothorax | 4631 | 500 | 1000 | 6131 |
| Tuberculosis | 7368 | 500 | 1000 | 8868 |
| Total | 29,282 | 2000 | 4000 | 35,282 |
Figure 2CXR images of SCH data set: (a) image of normal class; (b) histogram of normal class; (c) image of pneumonia class; (d) histogram of pneumonia class; (e) image of pneumothorax class; (f) histogram of pneumothorax class; (g) image of tuberculosis class; (h) histogram of tuberculosis class.
Figure 3Cropped CXR images of the SCH data set: (a) original image; (b) histogram of original image; (c) bottom-cropped image; (d) histogram of bottom-cropped image.
Figure 4Augmented images were generated randomly using the RandAugment technique.
Combination samples of the original and augmented data of the training set.
| Data Set | Original Set | Augmented Set | Total |
|---|---|---|---|
| NIH | 8000 | 8000 | 16,000 |
| SCH | 29,282 | 29,282 | 58,564 |
Figure 5Our proposed transfer learning pipeline.
Family models of EfficientNet v2 training on ImageNet.
| Model | Accuracy | Param # | FLOPs | Inference | Train |
|---|---|---|---|---|---|
| EfficientNet v2-S | 84.90% | 22 M | 8.8 B | 24 ms | 9 ms |
| EfficientNet v2-M | 86.20% | 54 M | 24 B | 57 ms | 15 ms |
| EfficientNet v2-L | 86.80% | 120 M | 53 B | 98 ms | 26 ms |
| EfficientNet v2-XL | 87.30% | 208 M | 94 B | - | 45 ms |
Param # refers to number of parameters.
Our target model trained on the new top layers of the NIH or SCH data set.
| Layer | Output Shape | Param # |
|---|---|---|
| Input | (None, 600, 600, 3) | 0 |
| Efficientnetv2-m | (None, 19, 19, 1280) | 53,150,388 |
| Global_average_pooling2d | (None, 1280) | 0 |
| Dropout | (None, 1280) | 0 |
| Dense | (None, 3) or (None, 4) | 3843 or 5124 |
| Total params: 53,154,231 or 53,155,512 | ||
| Trainable params: 3843 or 5124 | ||
| Non-trainable params: 53,150,388 | ||
Param # refers to number of parameters.
Our target model trained on a complete model (base model and new top layers) of the NIH or SCH data set.
| Layer | Output Shape | Param # |
|---|---|---|
| Input | (None, 600, 600, 3) | 0 |
| Efficientnetv2-m | (None, 19, 19, 1280) | 53,150,388 |
| Global_average_pooling2d | (None, 1280) | 0 |
| Dropout | (None, 1280) | 0 |
| Dense | (None, 3) or (None, 4) | 3843 or 5124 |
| Total params: 53,154,231 or 53,155,512 | ||
| Trainable params: 52,862,199 or 52,863,480 | ||
| Non-trainable params: 292,032 | ||
Param # refers to number of parameters.
Class weights of the training set of the NIH and SCH data sets.
| Data Set | Class | Training Set | Class Weight |
|---|---|---|---|
| NIH | Normal | 5352 | 0.9965 |
| Pneumonia | 2228 | 2.3937 | |
| Pneumothorax | 8420 | 0.6334 | |
| SCH | Normal | 20,406 | 0.7537 |
| Pneumonia | 14,160 | 1.0280 | |
| Pneumothorax | 9262 | 1.4387 | |
| Tuberculosis | 14,736 | 0.9946 |
Summary of our empirical hyperparameters.
| Parameters | Value |
|---|---|
| Image shape | 600 × 600 × 3 |
| Pixel normalization | [−1, +1] |
| Data augmentation | RandAugment (N = 5, M = 20) |
| Base model | EfficientNet v2-M (pre-trained weights = ImageNet) |
| Model regular | Dropout (rate = 0.4) |
| Model optimizer | Lookahead, Rectified Adam (clip norm = 1) |
| Warm-up proportion | 0.1 |
| Learning rate | 1st train = [1e-3, 1e-4], 2nd train = [1e-4, 1e-6] |
| Loss | Categorical cross entropy (label smoothing = 0.1) |
| Classifier | Softmax |
| Class | NIH data set = 3, SCH data set = 4 |
| Epoch | 1st train = 50, 2nd train = 25 |
| Batch size | 8 |
Figure 6Train and validation performances of the NIH data set on new top layers: (a) loss; (b) accuracy; (c) sensitivity; (d) specificity.
Figure 7Train and validation performances of the NIH data set on the complete model: (a) loss; (b) accuracy; (c) sensitivity; (d) specificity.
Summary of the validation performance of the NIH data set.
| Train | Loss | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| Train from scratch (1st train) | 0.7759 | 73.35% | 61.45% | 92.02% |
| Whole train (2nd train) | 0.6933 | 82.15% | 81.40% | 91.65% |
Figure 8Confusion matrix of the pneumonia, pneumothorax, and normal classes of the validation set of the NIH data set: (a) numerical; (b) percentage.
Figure 9Example predictions on the normal, pneumonia, and pneumothorax classes of the validation set of the NIH data set: (a) the first 25 images of the normal class; (b) 25 random images.
Figure 10Training and validation performance of the SCH data set on new top layers: (a) loss; (b) accuracy; (c) sensitivity; (d) specificity.
Figure 11Training and validating performance of the SCH data set on the complete model: (a) loss; (b) accuracy; (c) sensitivity; (d) specificity.
Summary of the validation performance of the SCH data set.
| Train | Loss | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| Train from scratch (1st train) | 1.1790 | 50.55% | 29.95% | 95.42% |
| Whole train (2nd train) | 0.7658 | 82.20% | 81.40% | 94.48% |
Figure 12Confusion matrix of the Pneumonia, Pneumothorax, Tuberculosis, and Normal class of testing set of the SCH data set: (a) numerical; (b) percentage.
Figure 13Example predictions for the normal, pneumonia, pneumothorax, and tuberculosis classes of the testing set of the SCH data set: (a) the first 25 images of the normal class; (b) 25 random images.
Performance comparison of three-class classification on validation set of NIH data set (%).
| Method | Model | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| Vanilla | VGG19 | 62.40 | 62.58 | 62.03 |
| Vanilla | DenseNet201 | 30.45 | 15.92 | 59.34 |
| Vanilla | EfficientNet B7 | 60.45 | 63.86 | 53.66 |
| Ours | EfficientNet v2-M |
|
|
|
Bold face represents higher score in column.
Performance comparison on multi-class classification.
| Paper | Data Set | Image | Class | Step # | Model | Performance |
|---|---|---|---|---|---|---|
| [ | In-house | CXR | Pneumonia, | 2 | U-Net, | |
| [ | NIH, | CXR | Pneumothorax, | 2 | U-Net, | |
| [ | In-house | CXR, | Pneumonia, | 1 | Inception-ResNetV2 | |
| [ | In-house | CXR | Pneumonia, | 1 | VGG19 | |
| [ | NIH, | CXR | Tuberculosis, | 1 | Xception | |
| [ | In-house | CXR | Tuberculosis, | 1 | DenseNet121 | |
| [ | NIH | CXR | Pneumonia, | 1 | EfficientNet B7 | |
| [ | In-house | CXR | Pneumonia, | 1 | EfficientNet B7 | |
| Ours | NIH | CXR | Pneumonia, | 1 | EfficientNet v2-M | |
| Ours | In-house | CXR | Pneumonia, | 1 | EfficientNet v2-M |
Step # refers to number of steps.
Performance comparison on multi-class classification on the NIH data set (%).
| Paper | Model | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| [ | EfficientNet B7 |
| 77.97 | 88.98 |
| Ours | EfficientNet v2–M | 82.15 |
|
|
Bold face represents higher score in column.