| Literature DB >> 35464874 |
Thiyagarajan Padmapriya1, Thiruvenkatam Kalaiselvi1, Venugopal Priyadharshini2.
Abstract
AI-based tools were developed in the existing works, which focused on one type of image data; either CXR's or computerized tomography (CT) scans for COVID-19 prediction. There is a need for an AI-based tool that predicts COVID-19 detection from chest images such as Chest X-ray (CXR) and CT scans given as inputs. This research gap is considered the core objective of the proposed work. In the proposed work, multimodal CNN architectures were developed based on the parameters and hyperparameters of neural networks. Nine experiments evaluate optimizers, learning rates, and the number of epochs. Based on the experimental results, suitable parameters are fixed for multimodal architecture development for COVID-19 detection. We have constructed a bespoke convolutional neural network (CNN) architecture named multimodal covid network (MMCOVID-NET) by varying the number of layers from two to seven, which can predict covid or normal images from both CXR's and CT scans. In the proposed work, we have experimented by constructing 24 models for COVID-19 prediction. Among them, four models named MMCOVID-NET-I, MMCOVID-NET-II, MMCOVID-NET-III, and MMCOVID-NET-IV performed well by producing an accuracy of 100%. We obtained these results from a small dataset. So we repeated these experiments in a larger dataset. We inferred that MMCOVID-NET-III outperformed all the state-of-the-art methods by producing an accuracy of 99.75%. The experiments carried out in this work conclude that the parameters and hyperparameters play a vital role in increasing or decreasing the model's performance.Entities:
Keywords: COVID‐19; CT scans; artificial intelligence; chest X‐rays; convolutional neural networks; coronavirus disease; deep neural networks
Year: 2022 PMID: 35464874 PMCID: PMC9015522 DOI: 10.1002/ima.22712
Source DB: PubMed Journal: Int J Imaging Syst Technol ISSN: 0899-9457 Impact factor: 2.177
Existing architectures for COVID‐19 detection
| S. no | Author and year | Methodology | Type of network | Imaging modality | Hyperparameters used in the network |
|---|---|---|---|---|---|
| 1 | Wang et.al (2020) | Transfer learning | Inception v3 | Chest CT | Batch normalization, dropout (0.5) |
| 2 | Chairmaine et.al (2020) | Transfer learning | ResNet23 | Chest CT | Batch normalization |
| 3 | Linda et.al (2020) | Deep learning | CNN model named COVID‐Net | Chest X‐ray | ‐ |
| 4 | Prabira and Santhi (2020) | Machine learning | SVM | Chest X‐ray | ‐ |
| 5 | Tulin et.al (2020) | Deep learning | CNN model named DarkCovidNet | Chest X‐ray | Batch normalization |
| 6 | Ali et.al (2020) | Transfer learning | ResNet50, ResNet101, ResNet152, InceptionV3 and Inception‐ResNetV2 | Chest X‐ray | ‐ |
| 7 | Arpan et.al (2020) | Transfer learning | DenseNet121 | Chest X‐ray | ‐ |
| 8 | Farooq et.al (2020) | Transfer learning | ResNet | Chest X‐ray | ‐ |
| 9 | Lawrence et.al (2020) | Transfer learning | ResNet50, VGG16 | Chest X‐ray | ‐ |
| 10 | Mukherjee et.al (2020) | Deep learning | Three Layered CNN Architecture | Chest CT + Chest X‐ray | Image size, batch size, dropout, and number of epochs |
FIGURE 1(A) Covid positive (X‐ray); (B) Covid negative (X‐ray)
FIGURE 2(A) Covid positive (CT); (B) Covid negative (CT)
FIGURE 3Sample CT images used for training the proposed models
FIGURE 4Sample X‐ray images used for training the proposed models
FIGURE 5Flow diagram for training phase
Experiments for fixing hyperparameters
| S.no | Model hyperparameters | Accuracy | Sensitivity | Specificity | Precision | F1 score |
|---|---|---|---|---|---|---|
| 1 |
|
|
|
|
|
|
| 2 |
|
|
|
|
|
|
| 3 |
|
|
|
|
|
|
| 4 |
|
|
|
|
|
|
| 5 |
|
|
|
|
|
|
| 6 |
|
|
|
|
|
|
| 7 | Two layers + SGD + Learning Rate (0.01) | 68 | 40 | 75 | 47 | 42 |
| 8 | Two layers + SGD + Learning Rate (0.001) | 70 | 43 | 78 | 49 | 45 |
| 9 | Two layers + SGD + Learning Rate (0.0001) | 73 | 46 | 82 | 52 | 48 |
Note: The bolded rows indicates the combination of hyparameters which produced good results.
FIGURE 6Architecture of five layered MMCOVID‐NET Model
FIGURE 7Experiments on N‐layered CNN with BN (A) Two‐layered CNN (B) Three‐layered CNN (C) Four layered CNN (D) Five layered CNN (E) Six layered CNN (F) Seven‐layered CNN
FIGURE 8Flow diagram of prediction phase
Experiments for proposed MMCOVID‐NET models' selection for COVID‐19 detection in Dataset 1
| S.no | Model description | Accuracy (In percentage) | Sensitivity | Specificity | Precision | F1 score |
|---|---|---|---|---|---|---|
| 1 | 2 convolution layers + RMSprop | 87.5 | 72 | 94 | 82 | 76 |
| 2 |
|
|
|
|
|
|
| 3 | 3 convolution layers + Adam + Batch Normalization | 87.5 | 72 | 94 | 82 | 76 |
| 4 | 4 convolution layers + Adam | 84 | 62 | 91 | 73 | 67 |
| 5 |
|
|
|
|
|
|
| 6 | 5 convolution layers + Adam | 87.5 | 72 | 94 | 82 | 76 |
| 7 |
|
|
|
|
|
|
| 8 | 5 convolution layers + RMSprop | 87.5 | 72 | 94 | 82 | 76 |
| 9 | 6 convolution layers + RMSprop | 87.5 | 72 | 94 | 82 | 76 |
| 10 |
|
|
|
|
|
|
Note: The bolded rows indicates the combination of hyparameters which produced good results.
Experiments for proposed MMCOVID‐NET models' selection for COVID‐19 detection in Dataset 2
| S.no | Model description | Accuracy (In percentage) | Sensitivity | Specificity | Precision | F1 score |
|---|---|---|---|---|---|---|
| 1 | 2 convolution layers + RMSprop | 85 | 71 | 94 | 81 | 72 |
| 2 | 3 convolution layers + Adam (MMCOVID‐NET‐I) | 92 | 91 | 98 | 96 | 96 |
| 3 | 3 convolution layers + RMSprop + Batch Normalization | 87.5 | 58 | 96 | 53 | 46 |
| 4 | 4 convolution layers + Adam + Batch Normalization (MMCOVID‐NET‐II) | 94 | 93 | 99 | 96 | 96 |
| 5 | 4 convolution layers + RMSprop | 87.5 | 58 | 96 | 53 | 46 |
| 6 | 5 convolution layers + Adam | 87.5 | 58 | 96 | 53 | 46 |
| 7 |
|
|
|
|
|
|
| 8 | 7 convolution layers + Adam + Batch Normalization | 93.75 | 93 | 99 | 96 | 96 |
| 9 | 7 convolution layers + RMSprop + Batch Normalization | 94 | 93 | 99 | 96 | 96 |
| 10 | 7 convolution layers + RMSprop (MMCOVID‐NET‐IV) | 92 | 90 | 98 | 94 | 94 |
Note: The bolded rows indicates the combination of hyparameters which produced good results.
FIGURE 9AUC‐ROC curve for proposed multimodal CNN's (A) MMCOVID‐NET‐I (B) MMCOVID‐NET‐II (C) MMCOVID‐NET‐III (D) MMCOVID‐NET‐IV
AUC score for the proposed multimodal covid CNN models
| S.NO | Model | AUC score |
|---|---|---|
| 1 | Proposed MMCOVID‐NET‐I | 98.13 |
| 2 | Proposed MMCOVID‐NET‐II | 97.46 |
| 3 | Proposed MMCOVID‐NET‐III | 98.90 |
| 4 | Proposed MMCOVID‐NET‐IV | 97.19 |
Comparison of proposed MMCOVID‐NET models with other AI‐based models for COVID‐19 detection
| S.no | Method/Model | Accuracy |
|---|---|---|
| 1 | ResNet50 and VGG16 |
|
| 2 | DenseNet121 and MobileNet |
|
| 3 | Xception and InceptionV3 | 98.80 |
| 4 | ResNet50 | 98.27 |
| 5 | VGG16 | 98.93 |
| 6 | DenseNet121 | 98.27 |
| 7 | MobileNet | 97.87 |
| 8 | Xception | 96 |
| 9 | InceptionV3 | 98.27 |
| 10 | 3 Layered Tailored CNN network | 93 |
| 11 |
|
|
Note: The bolded rows indicates the combination of hyparameters which produced good results.