| Literature DB >> 35228777 |
Sima Sarv Ahrabi1, Lorenzo Piazzo1, Alireza Momenzadeh1, Michele Scarpiniti1, Enzo Baccarelli1.
Abstract
We present a probabilistic method for classifying chest computed tomography (CT) scans into COVID-19 and non-COVID-19. To this end, we design and train, in an unsupervised manner, a deep convolutional autoencoder (DCAE) on a selected training data set, which is composed only of COVID-19 CT scans. Once the model is trained, the encoder can generate the compact hidden representation (the hidden feature vectors) of the training data set. Afterwards, we exploit the obtained hidden representation to build up the target probability density function (PDF) of the training data set by means of kernel density estimation (KDE). Subsequently, in the test phase, we feed a test CT into the trained encoder to produce the corresponding hidden feature vector, and then, we utilise the target PDF to compute the corresponding PDF value of the test image. Finally, this obtained value is compared to a threshold to assign the COVID-19 label or non-COVID-19 to the test image. We numerically check our approach's performance (i.e. test accuracy and training times) by comparing it with those of some state-of-the-art methods.Entities:
Keywords: COVID-19; Deep convolutional autoEencoder; Hidden representation; Kernel density estimation; Reconstruction error
Year: 2022 PMID: 35228777 PMCID: PMC8867464 DOI: 10.1007/s11227-022-04349-y
Source DB: PubMed Journal: J Supercomput ISSN: 0920-8542 Impact factor: 2.557
A synoptic overview of main related work on COVID-19 detection/classification
| Work | Algorithm specifications | Accuracy |
|---|---|---|
| Ref. [ | 0.8502 | |
| Ref. [ | 0.9849 | |
| Ref. [ | 0.9905 | |
| Ref. [ | 0.9410 | |
| 0.9410 | ||
| Ref. [ | 0.8711 | |
| Ref. [ | 0.9951 | |
| Ref. [ | 0.9943 | |
| Ref. [ | 0.9800 | |
| Ref. [ | 0.9866 | |
| Ref. [ | 0.9300 | |
| Ref. [ | 0.9500 | |
| Ref. [ | 0.9470 | |
| Ref. [ | – | |
| Ref. [ | – | |
| Ref. [ | 0.9920 | |
| Ref. [ | 1.0000 | |
BC, Binary classification; MC, Multiclassification; MLC, Multilabel classification; UC, Unary classification
Train/validation/test set compositions and related web links; WP: Web Page
| Data set | COVID-19 | Pneumonia | Cancers | Normal | Link |
|---|---|---|---|---|---|
| Training | 3200 | ||||
| Validation | 800 | WP1 | |||
| Test | 2500 | 300 | 900 | 300 | WP1 & |
1 https://kaggle.com/hgunraj/covidxct
2 https://kaggle.com/mohamedhanyyy/chest-ctscan-images
Fig. 1Samples of four lung CT scans drawn from the used training and test data sets
Fig. 2Two samples of original COVID-19 CTs and their cropped versions
Fig. 3The autoencoder architecture
The architecture of proposed DCAE; encoder input shape: ; decoder input shape: 128; number of parameters of the encoder: 31, 097, 408; number of parameters of the decoder: 31, 337, 475
| Layer | Kernel | Stride | Output shape |
|---|---|---|---|
| Conv2D | – | ||
| BatchNorm | – | – | |
| Conv2D | 3 | – | |
| MaxPool2D | 2 | – | |
| Conv2D | 3 | – | |
| BatchNorm | – | – | |
| MaxPool2D | 2 | – | |
| Flatten | – | – | 240000 |
| Dense | – | – | 128 |
| Dense | – | – | 240000 |
| Reshape | – | – | |
| Conv2DTranspose | 2 | ||
| BatchNorm | – | – | |
| Conv2DTranspose | 3 | 2 | |
| BatchNorm | – | – | |
| Conv2DTranspose | 3 | 1 | |
Main parameters of the carried out training phase
| Description | Value |
|---|---|
| Batch size | 16 |
| Number of epochs | 100 |
| Optimiser | Adam |
| Learning rate | |
| Loss function | MSE |
| Size of hidden feature vectors | 128 |
Fig. 4Numerically evaluated plots of accuracy-vs.-epochs and MSE-vs.-epochs under the training phase
Fig. 5An example of KDE estimation over data points: a copy of the kernel is placed on each data point and the copies are summed to produce the final PDF estimate
Fig. 6Estimation of the PDF of the latent space (stage 1) and classification of the test images (stage 2)
Fig. 7Evaluation of reconstruction errors of training set and the threshold (stage 1), and classification of the test images (stage 2)
Basic metrics
| Name | Description |
|---|---|
| True positive (TP) | COVID-19 image classified as COVID-19 |
| True negative (TN) | Non-COVID-19 image classified as non-COVID-19 |
| False positive (FP) | Non-COVID-19 image classified as COVID-19 |
| False negative (FN) | COVID-19 image classified as non-COVID-19 |
Utilised performance indexes
| Metrics | Formula |
|---|---|
| Recall | |
| Precision | |
| F-score | |
| Accuracy |
Fig. 8Results under the proposed approach
Performance metrics of the proposed approach and the benchmark one based on the reconstruction error
| Method | Accuracy | Precision | Recall | F1-score | Test CTs |
|---|---|---|---|---|---|
| Reconstruction error | 0.8635 | 0.8584 | 0.8531 | 0.8555 | 4000 |
| Our approach | 0.9712 | 0.9741 | 0.9659 | 0.9696 | 4000 |
Fig. 9Performance results of the reconstruction error approach
Comparison with supervised classification
| Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|
| AlexNet | 0.9840 | 0.9840 | 0.9841 | 0.9840 |
| GoogleNet | 0.9960 | 0.9960 | 0.9960 | 0.9960 |
| ResNet18 | 0.9930 | 0.9931 | 0.9930 | 0.9930 |
| Unsupervised approach | ||||
| Reconstruction error | 0.9710 | 0.9710 | 0.9710 | 0.9710 |
| Our approach | 1.0000 | 1.0000 | 1.0000 | 1.0000 |
The training set is composed of 4000 images (COVID only for the proposed method and belonging to five different classes for the supervised methods). The test set is composed of 1000 images (500 COVID, 120 pneumonia, 120 normal and 260 cancer)
Robustness tests: pneumonia CTs are present in the test set, but not in the training set for the AlexNet, GoogleNet and ResNet18
| Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|
| AlexNet | 0.9060 | 0.9060 | 0.9209 | 0.9052 |
| GoogleNet | 0.9140 | 0.9140 | 0.9266 | 0.9134 |
| ResNet18 | 0.9130 | 0.9130 | 0.9259 | 0.9123 |
| Unsupervised approach | ||||
| Reconstructio error | 0.9710 | 0.9710 | 0.9710 | 0.9710 |
| Our approach | 1.0000 | 1.0000 | 1.0000 | 1.0000 |
The test set is composed of 1000 images (500 COVID, 120 pneumonia, 120 normal and 260 cancer)
Fig. 10Confusion matrices: pneumonia CTs are present in test data set, but not in the train data set
Comparative analysis of binary classification of COVID-19 vs non-COVID-19 based on similar datasets
| Model | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|
| HB-DDCAE | 1.0000 | 1.0000 | 1.0000 | 1.0000 |
| SVM-2D | 0.9340 | 0.9417 | 0.9340 | 0.9378 |
| SVM-3D | 0.9270 | 0.9363 | 0.9270 | 0.9316 |
| SVM-RBF | 0.9170 | 0.9788 | 0.9170 | 0.9469 |
| MLP-50 | 0.7600 | 0.8132 | 0.7600 | 0.7857 |
| MLP-100 | 0.7740 | 0.7874 | 0.7740 | 0.7806 |
| MLP-200 | 0.7830 | 0.8123 | 0.7830 | 0.7974 |
| RF-100 | 0.7110 | 0.7852 | 0.7110 | 0.7463 |
| RF-500 | 0.7200 | 0.7903 | 0.7200 | 0.7535 |
| RF-1000 | 0.7210 | 0.7893 | 0.7210 | 0.7536 |
The test set is composed of 1000 images (500 COVID, 500 non-COVID)
Average test times over a batch of 10 images
| Model | Test time |
|---|---|
| AlexNet | 1.3566 |
| GoogleNet | 1.2400 |
| ResNet18 | 2.4754 |
| Proposed | 1.0026 |