| Literature DB >> 33191476 |
Parisa Gifani1, Ahmad Shalbaf2, Majid Vafaeezadeh3.
Abstract
PURPOSE: COVID-19 has infected millions of people worldwide. One of the most important hurdles in controlling the spread of this disease is the inefficiency and lack of medical tests. Computed tomography (CT) scans are promising in providing accurate and fast detection of COVID-19. However, determining COVID-19 requires highly trained radiologists and suffers from inter-observer variability. To remedy these limitations, this paper introduces an automatic methodology based on an ensemble of deep transfer learning for the detection of COVID-19.Entities:
Keywords: COVID-19; CT; Convolutional neural network; Ensemble model; Transfer learning
Mesh:
Year: 2020 PMID: 33191476 PMCID: PMC7667011 DOI: 10.1007/s11548-020-02286-w
Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN: 1861-6410 Impact factor: 2.924
Fig. 1Examples of positive COVID-19 CT scans
Comparison of the analyzed pre-trained CNNs
| Architecture name | Year | Main contribution | Parameters | FLOP | # trainable layers | Input shape |
|---|---|---|---|---|---|---|
| EfficientNetB0 | 2019 | Compound scaling | 4,049,564 | 0.39B | 237 | 224 × 224 |
| EfficientNetB1 | 2019 | Compound scaling | 6,575,232 | 0.70B | 338 | 240 × 240 |
| EfficientNetB2 | 2019 | Compound scaling | 7,768,562 | 1.0B | 338 | 260 × 260 |
| EfficientNetB3 | 2019 | Compound scaling | 10,783,528 | 1.8B | 383 | 300 × 300 |
| EfficientNetB4 | 2019 | Compound scaling | 17,673,816 | 4.2B | 473 | 380 × 380 |
| EfficientNetB5 | 2019 | Compound scaling | 28,513,520 | 9.9B | 575 | 456 × 456 |
| NasnetLarge | 2017 | Define blocks and cells in architectures Find the best architectures by following a search strategy that will maximize the performance | 84,916,818 | 24B | 1038 | 331 × 331 |
| NasnetMobile | 2017 | Define blocks and reduction cells in architectures Find the best architectures by following a search strategy that will maximize the performance | 4,269,716 | 0.56 B | 768 | 224 × 224 |
| Inception-V3 | 2015 | Handles the problem of a representational bottleneck Replace large size filters with small filters | 21,802,784 | 5.7B | 310 | 299 × 299 |
| Inception-ResNet | 2016 | Uses split transform merge idea and residual links | 54,336,736 | 13B | 779 | 299 × 299 |
| ResNet 50 | 2016 | Residual learning Identity mapping based skip connections | 23,546,057 | 4.1B | 189 | 224 × 224 |
| Xception | 2017 | Depth-wise convolution followed by point-wise convolution | 20,861,480 | 8.4B | 131 | 299 × 299 |
| ResNeXt | 2017 | Cardinality Homogeneous topology Grouped convolution | 23,048,137 | 4 B | 215 | 224 × 224 |
| SEResNet | 2017 | Models interdependencies between feature-maps | 26,092,144 | 4 B | 286 | 224 × 224 |
| DenseNet 121 | 2017 | Cross-layer information flow | 7,037,504 | 3 B | 426 | 224 × 224 |
Fig. 2Training accuracy for different pre-trained CNN models
Fig. 3Cross-entropy loss function for different pre-trained CNN models
Classification metrics on the test dataset using the different architecture of deep transfer learning models and also proposed ensemble method. For each model, average (± std.) performance measure is reported over the best 5 trained model checkpoints
| Model | Precision | Recall | F1-score | Accuracy | AUC |
|---|---|---|---|---|---|
| EfficientNetB0 | 0.847(± 0.03) | 0.822(± 0.11) | 0.815(± 0.05) | 0.82(± 0.02) | 0.907(± 0.02) |
| EfficientNetB1 | 0.727(± 0.06) | 0.718(± 0.09) | 0.712(± 0.03) | 0.71(± 0.02) | 0.809(± 0.02) |
| EfficientNetB2 | 0.768(± 0.03) | 0.768(± 0.12) | 0.768(± 0.05) | 0.77(± 0.03) | 0.859(± 0.03) |
| EfficientNetB3 | 0.769(± 0.03) | 0.765(± 0.07) | 0.763(± 0.03) | 0.76(± 0.03) | 0.851(± 0.01) |
| EfficientNetB4 | 0.791(± 0.02) | 0.789(± 0.05) | 0.788(± 0.01) | 0.79(± 0.01) | 0.877(± 0.01) |
| EfficientNetB5 | 0.817(± 0.03) | 0.817(± 0.11) | 0.817(± 0.05) | 0.82(± 0.03) | 0.886(± 0.01) |
| Inception_resnet_v2 | 0.773(± 0.03) | 0.774(± 0.12) | 0.773(± 0.05) | 0.77(± 0.02) | 0.856(± 0.01) |
| InceptionV3 | 0.825(± 0.03) | 0.814(± 0.07) | 0.815(± 0.03) | 0.82(± 0.02) | 0.897(± 0.02) |
| NASNetLarge | 0.772(± 0.06) | 0.770(± 0.09) | 0.768(± 0.03) | 0.77(± 0.01) | 0.836(± 0.03) |
| NASNetMobile | 0.759(± 0.03) | 0.757(± 0.12) | 0.757(± 0.05) | 0.76(± 0.04) | 0.823(± 0.02) |
| ResNet50 | 0.807(± 0.03) | 0.808(± 0.11) | 0.807(± 0.05) | 0.81(± 0.03) | 0.875(± 0.01) |
| Xception | 0.738(± 0.06) | 0.739(± 0.09) | 0.738(± 0.03) | 0.74(± 0.04) | 0.782(± 0.04) |
| DenseNet121 | 0.768(± 0.03) | 0.768(± 0.03) | 0.768(± 0.03) | 0.77(± 0.02) | 0.868(± 0.04) |
| SeResnet50 | 0.755(± 0.03) | 0.745(± 0.07) | 0.745(± 0.03) | 0.75(± 0.02) | 0.818(± 0.02) |
| ResNext50 | 0.810(± 0.03) | 0.806(± 0.12) | 0.806(± 0.05) | 0.81(± 0.02) | 0.843(± 0.02) |
| Proposed ensemble model | 0.857(± 0.02) | 0.854(± 0.05) | 0.852(± 0.01) | 0.852(± 0.01) | 0.91(± 0.01) |
Fig. 4ROC/AUC curves of different architecture of deep transfer learning models
Confusion matrix for the best ensemble model