| Literature DB >> 33230395 |
Muhammet Fatih Aslan1, Muhammed Fahri Unlersen2, Kadir Sabanci1, Akif Durdu3.
Abstract
Coronavirus disease 2019 (COVID-2019), which emerged in Wuhan, China in 2019 and has spread rapidly all over the world since the beginning of 2020, has infected millions of people and caused many deaths. For this pandemic, which is still in effect, mobilization has started all over the world, and various restrictions and precautions have been taken to prevent the spread of this disease. In addition, infected people must be identified in order to control the infection. However, due to the inadequate number of Reverse Transcription Polymerase Chain Reaction (RT-PCR) tests, Chest computed tomography (CT) becomes a popular tool to assist the diagnosis of COVID-19. In this study, two deep learning architectures have been proposed that automatically detect positive COVID-19 cases using Chest CT X-ray images. Lung segmentation (preprocessing) in CT images, which are given as input to these proposed architectures, is performed automatically with Artificial Neural Networks (ANN). Since both architectures contain AlexNet architecture, the recommended method is a transfer learning application. However, the second proposed architecture is a hybrid structure as it contains a Bidirectional Long Short-Term Memories (BiLSTM) layer, which also takes into account the temporal properties. While the COVID-19 classification accuracy of the first architecture is 98.14%, this value is 98.70% in the second hybrid architecture. The results prove that the proposed architecture shows outstanding success in infection detection and, therefore this study contributes to previous studies in terms of both deep architectural design and high classification success.Entities:
Keywords: AlexNet; BiLSTM; COVID-19; Hybrid architecture; Transfer learning
Year: 2020 PMID: 33230395 PMCID: PMC7673219 DOI: 10.1016/j.asoc.2020.106912
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 6.725
Fig. 1Block diagram of the proposed algorithm.
Number of samples belonging to each class in the COVID-19 Radiography Database.
| Class | Number of samples |
|---|---|
| COVID-19 | 219 |
| Viral Pneumonia | 1345 |
| Normal | 1341 |
| Total | 2905 |
Fig. 2Sample images of COVID-19 Radiography Database.
Fig. 3Noises or irrelevant patterns in a sample X-ray Image.
Fig. 4Manually segmented X-ray images.
Fig. 5Proposed ANN-based segmentation.
Fig. 6Rotation operation.
Fig. 7An example transfer learning process.
Fig. 8Proposed mAlexnet architecture.
Layers and parameters of the proposed mAlexNet.
| Layer name | Size | Fiter size | Stride | Padding | Output channel | Activation function |
|---|---|---|---|---|---|---|
| conv1 | 55 × 55 | 11 × 11 | 4 | 0 | 96 | relu |
| maxpool1 | 27 × 27 | 3 × 3 | 2 | 0 | 96 | – |
| conv2 | 27 × 27 | 5 × 5 | 1 | 2 | 256 | relu |
| maxpool2 | 13 × 13 | 3 × 3 | 2 | 0 | 256 | – |
| conv3 | 13 × 13 | 3 × 3 | 1 | 1 | 384 | relu |
| conv4 | 13 × 13 | 3 × 3 | 1 | 1 | 384 | relu |
| conv5 | 13 × 13 | 3 × 3 | 1 | 1 | 256 | relu |
| maxpool5 | 6 × 6 | 3 × 3 | 2 | 0 | 256 | – |
| fc6 | – | – | – | – | 4096 | relu |
| fc7 | – | – | – | – | 4096 | relu |
| fc8 | - | – | – | – | 25 | relu |
| fc9 | - | – | – | – | 2 | softmax |
| Training parameters | ||||||
| Optimizer | Max. Epoch | Mini Batch Size | Initial Learning Rate ( | Momentum ( | ||
| SGDM | 100 | 60 | 0.001 | 0.95 | ||
Fig. 9Proposed mAlexNet–BiLSTM (Hybrid) architecture.
Hyperparameters of layers used for hybrid architecture and training options.
| BiLSTM-1 | BiLSTM-2 | fc9 | |||||
|---|---|---|---|---|---|---|---|
| Number of hidden units | State activation function | Gate activation function | Number of hidden units | State activation function | Gate activation function | Output size | State activation function |
| 125 | 100 | 3 | |||||
| Training parameters | |||||||
| Optimizer | Gradient decay factor ( | Squared Gradient decay factor ( | Max. Epoch | Mini batch size | Initial learning rate ( | Epsilon ( | |
| Adam | 0.9 | 0.999 | 200 | 512 | 0.001 | ||
Fig. 11Confusion matrices of proposed methods.
Performance metrics of the proposed architectures.
| Architecture | Acc. (%) | Error | Recall | Specificity | Precision | False Positive rate | F1-score | AUC | MCC | Kappa |
|---|---|---|---|---|---|---|---|---|---|---|
| mAlexNet | 98.14 | 0.0186 | 0.9826 | 0.9906 | 0.9816 | 0.0094 | 0.9820 | 0.9855 | 0.9726 | 0.9581 |
| mAlexNet + BiLSTM | 98.70 | 0.0130 | 0.9876 | 0.9933 | 0.9877 | 0.0067 | 0.9876 | 0.9900 | 0.9809 | 0.9707 |
Fig. 10Training graphics of proposed architectures.
Fig. 12ROC curves of the proposed methods.
Comparison of the proposed hybrid method with previous studies.
| Study | Method | Accuracy (%) |
|---|---|---|
| Wang and Wong | Deep Learning | 92.30 |
| Afshar, Heidarian, Naderkhani, Oikonomou, Plataniotis and Mohammadi | Capsule network | 95.70 |
| Chowdhury, Rahman, Khandakar, Mazhar, Kadir, Mahbub, Islam, Khan, Iqbal and Al-Emadi | Transfer Learning | 97.94 |
| Farooq and Hafeez | Transfer Learning | 96.20 |
| Ucar and Korkmaz | Bayes-SqueezeNet | 98.30 |
| Apostolopoulos and Mpesiana | Transfer Learning | 93.48 |
| Xu, Jiang, Ma, Du, Li, Lv, Yu, Ni, Chen and Su | ResNet + Location Attention | 86.70 |
| Ozturk, Talo, Yildirim, Baloglu, Yildirim and Acharya | DarkCovidNet | 87.02 |
| Narin, Kaya and Pamuk | Transfer Learning | 98.00 |
| Asif and Wenhui | Transfer Learning | 96.00 |
| Nour, Cömert and Polat | Deep-Machine Learning | 98.97 |
| Khan, Shah and Bhat | Transfer Learning | 95.00 |
| Gupta, Anjum, Gupta and Katarya | InstaCovNet-19 | 99.08 |
| Sethy and Behera | ResNet50 + SVM | 95.40 |
| Hemdan, Shouman and Karar | VGG19 | 90.00 |
| Rahimzadeh and Attar | Xception + ResNet50V2 | 91.40 |