| Literature DB >> 35626436 |
Muhammad Mujahid1, Furqan Rustam2, Roberto Álvarez3,4, Juan Luis Vidal Mazón3,5, Isabel de la Torre Díez6, Imran Ashraf7.
Abstract
Pneumonia is one of the leading causes of death in both infants and elderly people, with approximately 4 million deaths each year. It may be a virus, bacterial, or fungal, depending on the contagious pathogen that damages the lung's tiny air sacs (alveoli). Patients with underlying disorders such as asthma, a weakened immune system, hospitalized babies, and older persons on ventilators are all at risk, particularly if pneumonia is not detected early. Despite the existing approaches for its diagnosis, low accuracy and efficiency require further research for more accurate systems. This study is a similar endeavor for the detection of pneumonia by the use of X-ray images. The dataset is preprocessed to make it suitable for transfer learning tasks. Different pre-trained convolutional neural network (CNN) variants are utilized, including VGG16, Inception-v3, and ResNet50. Ensembles are made by incorporating CNN with Inception-V3, VGG-16, and ResNet50. Besides the common evaluation metrics, the performance of the pre-trained and ensemble deep learning models is measured with Cohen's kappa as well as the area under the curve (AUC). Experimental results show that Inception-V3 with CNN attained the highest accuracy and recall score of 99.29% and 99.73%, respectively.Entities:
Keywords: chest X-ray; deep learning; ensemble learning; pneumonia
Year: 2022 PMID: 35626436 PMCID: PMC9140837 DOI: 10.3390/diagnostics12051280
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Summary of the related work.
| Ref. | Dataset | Training | Testing | Classifiers/Models | Year | Accuracy |
|---|---|---|---|---|---|---|
| [ | Chest X-ray images (augmented) | 9000 | 419 | Different pre-trained CNN model | 2020 | 0.98 |
| [ | Chest X-ray images | 5232 | 624 | Ensemble model | 2020 | 97.4 |
| [ | Chest X-ray images | 5216 | 640 | CNN model | 2021 | 90.78 |
| [ | Chest X-ray images (rearranged) | 3722 | 2134 | Proposed CNN model | 2019 | 0.9373 |
| [ | COVID-19 X-ray images | 90% | 10% | Ensemble learning deep | 2020 | 99 |
| [ | Chest X-ray images | 80% | 20% | Ensemble model | 2021 | 86.3 |
| [ | Chest X-ray images (rearranged) | 4686 | 1170 | Ensemble deep learning model | 2021 | 98.81 |
| [ | COVID-19 and CXR images joined dataset | 6086 | Ensemble deep learning model | 2021 | 95.05 | |
| [ | CT chest COVID-19 images dataset | Ensemble deep learning | 2021 | 93.57 | ||
| [ | Heart disease dataset | 800 records | Ensemble deep learning | 2020 | 91 | |
Figure 1The workflow of the proposed methodology.
Figure 2Samples of CXR images. (a) Normal and (b) pneumonia.
Number of samples for train and test split.
| Class | Training Set | Testing Set | Total |
|---|---|---|---|
|
| 3089 | 786 | 3875 |
|
| 3111 | 764 | 3875 |
|
| 6200 | 1550 | 7750 |
Figure 3Architecture of CNN model used in this study.
Figure 4Architecture of three ensemble models used in this study.
Trainable parameters for different deep learning models.
| Model | Trainable Parameters |
|---|---|
| CNN | 533,977 |
| VGG-16 | 9,460,737 |
| Inception-V3 | 38,537,217 |
| ResNet50 | 262,401 |
| Inception-V3 with CNN | 26,645,113 |
| ResNet50 with CNN | 28,411,357 |
| VGG-16 with CNN | 16,052,505 |
Results of fine-tuned pre-trained models.
| Model | Accuracy | Precision | Recall | F1 Score | Cohen’s Kappa | ROC AUC |
|---|---|---|---|---|---|---|
| CNN | 98.25 | 99.19 | 97.25 | 98.21 | 96.51 | 98.24 |
| VGG-16 | 97.93 | 98.41 | 97.38 | 97.89 | 95.86 | 97.92 |
| Inception-V3 | 96.58 | 97.46 | 95.54 | 96.49 | 93.15 | 96.57 |
| ResNet50 | 97.87 | 98.15 | 97.51 | 97.83 | 95.74 | 97.87 |
Figure 5Comparison of individual models for performance evaluation metrics.
Figure 6Training and testing accuracy of ensemble models. (a): Training and testing accuracy for Inception-V2 with CNN, (b): Training and testing loss for Inception-V2 with CNN, (c): Training and testing accuracy for ResNet-50 with CNN, (d): Training and testing loss for ResNet-50 with CNN, (e): Training and testing accuracy for VVG-16 with CNN and (f): Training and testing loss for VVG-16 with CNN.
Results of ensemble models for pneumonia detection.
| Models | Accuracy | Precision | Recall | F1 Score | Cohen’s Kappa | ROC AUC |
|---|---|---|---|---|---|---|
| VGG-16 + CNN | 98.06 | 96.69 | 99.47 | 98.06 | 96.12 | 98.08 |
| Inception-V3 + CNN | 99.29 | 98.83 | 99.73 | 99.28 | 98.58 | 99.30 |
| ResNet50 + CNN | 99.09 | 98.82 | 99.34 | 99.08 | 98.19 | 99.10 |
Figure 7Performance comparison of ensemble models.
Performance comparison with other approaches.
| Ref. | Techniques | Dataset | Accuracy (%) |
|---|---|---|---|
| [ | Pre-trained CNN models | Chest X-ray | 98.00 |
| [ | Ensemble model | Chest X-ray | 97.4 |
| [ | Ensemble model | Chest X-ray | 86.3 |
| [ | CNN model | Chest X-ray | 90.78 |
| [ | Ensemble model | COVID-19, chest X-ray | 95.05 |
| [ | CNN model | Chest X-ray | 93.73 |
| [ | Ensemble model | Chest X-ray | 98.81 |
| This study | VGG with CNN | Chest X-ray | 98.06 |
| Inception-V3 with CNN | Chest X-ray | 99.29 | |
| ResNet50 with CNN | Chest X-ray | 99.08 |
Results of 10-fold cross-validation for individual and ensemble models.
| Models | Accuracy | Standard Deviation |
|---|---|---|
| CNN | 0.963 | +/− 0.05 |
| ResNet50 | 0.971 | +/−0.01 |
| Inception-V3 | 0.960 | +/−0.01 |
| VGG-16 | 0.980 | +/−0.00 |
| ResNet50 with CNN | 0.972 | +/−0.03 |
| Inception-V3 with CNN | 0.981 | +/−0.01 |
| VGG-16 with CNN | 0.984 | +/−0.02 |