| Literature DB >> 33680351 |
Hao Jiang1,2, Shiming Tang3, Weihuang Liu1,4, Yang Zhang1.
Abstract
As a recent global health emergency, the quick and reliable diagnosis of COVID-19 is urgently needed. Thus, many artificial intelligence (AI)-base methods are proposed for COVID-19 chest CT (computed tomography) image analysis. However, there are very limited COVID-19 chest CT images publicly available to evaluate those deep neural networks. On the other hand, a huge amount of CT images from lung cancer are publicly available. To build a reliable deep learning model trained and tested with a larger scale dataset, the proposed model builds a public COVID-19 CT dataset, containing 1186 CT images synthesized from lung cancer CT images using CycleGAN. Additionally, various deep learning models are tested with synthesized or real chest CT images for COVID-19 and Non-COVID-19 classification. In comparison, all models achieve excellent results (over than 90%) in accuracy, precision, recall and F1 score for both synthesized and real COVID-19 CT images, demonstrating the reliable of the synthesized dataset. The public dataset and deep learning models can facilitate the development of accurate and efficient diagnostic testing for COVID-19.Entities:
Keywords: COVID-19; Chest CT image; Classification; CycleGAN; Image synthesis; Lung cancer; Style transfer
Year: 2021 PMID: 33680351 PMCID: PMC7923948 DOI: 10.1016/j.csbj.2021.02.016
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Summary of top cited studies of deep learning-based COVID-19 analysis.
| Ref | Backbone | Task | Dataset | Results |
|---|---|---|---|---|
| U-Net | Segmentation | 6,150 CT images with lung abnormalities and their lung masks. | AUC of 0.996, sensitivity of 98.2%, and specificity of 92.2%. | |
| ResNet50 | Classification | 50 abnormal thoracic CT scans of patients that were diagnosed by a radiologist as suspicious for COVID-19. Cases were annotated for each image as normal (n = 1036) vs abnormal (n = 829). | ||
| U-Net | Classification | 4352 3D chest CT images from 3,322 patients, consisting of 1,292 COVID-19, 1,735 community-acquired pneumonia, 1,325 Non-pneumonia. | AUC of 0.96, sensitivity of 90%, and specificity of 96%. | |
| ResNet50 | ||||
| ResNet | Classification | 618 CT images (219 COVID-19, 224 Influenza-A-viral-pneumonia, and 175 healthy case). | Accuracy of 86.7%. | |
| DRE-Net based on ResNet50 | Classification | 88 COVID-19 patients with 777 CT images, 100 bacterial pneumonia patients with 505 images, and 86 healthy people with 708 images. | AUC of 0.99 and recall of 0.93. | |
| VB-Net | Segmentation | 249 CT images of 249 COVID-19 patients were collected from other centers for training. 300 CT images from 300 COVID-19 patients were collected for validation. | Dice similarity coefficients of 91.6%±10.0%. | |
| DenseNet-169 | Classification | 349 CT images of COVID-19 were extracted from 760 preprints about COVID-19 from medRxiv and bioRxiv. | F1 of 0.85, AUC of 0.95, and accuracy of 0.83. | |
| Conditional GAN | COVID-19 image generating | CT images of patients were positive or suspected of COVID-19 or other viral and bacterial pneumonias (MERS, SARS, and ARDS). | Enhanced the identification and detection capacities of the classification models. | |
| GAN | COVID-19 image generating | 2143 chest CT images, containing 327 COVID-19 cases, were acquired from 12 sites across 7 countries. | Improve lung segmentation (+6.02% lesion inclusion rate) and abnormality segmentation (+2.78% dice coefficient). | |
| Conditional GAN | COVID-19 image generating | 829 lung CT slices from 9 COVID-19 patients. | Peak signal-to-noise ratio (PSNR) of 26.89, and structural similarity index (SSIM) of 0.8936. |
Fig. 1Overview of CycleGAN-base deep learning for COVID-19. (A) Representative chest CT images. Publicly available COVID-19 pneumonia images have infected areas with GGO pattern, and images of lung cancer with distinct nodules source from LUNA16. (B) COVID-19 analysis model based on style transfer. The COVID-19 dataset synthesized from lung cancer images is used to train classifiers, and synthesized or real COVID-19 chest CT images are used for testing. (C) A graphical illustration of CycleGAN based deep learning for COVID-19 CT image construction. This structure is divided into two symmetrical parts, for domain C, Generator C tries to transform the GGO style of COVID-19 into the nodule style of lung cancer. The Discriminator C is used to compare the real COVID-19 with fake COVID-19 learned from domain N. Cycle loss is used for supervising the continuity of the input and the image circulated after two generations.
Fig. 2Deep-learning enabled CT image transformation from lung cancer to COVID-19. (A) Input lung cancer CT image. (B) Reconstructed image obtained using the CycleGAN based deep learning method. (C) Input COVID-19 CT image. Zoomed-in regions of lesion in COVID-19 and lung cancer, highlights the success of generation of GGO pattern. Experiments are repeated through the whole lung cancer dataset, achieving similar results.
Fig. 3Comparison the synthetic and real COVID-19 CT images. A and B show the grayscale image of global and local distribution using the histogram method, respectively. As shown in the histogram, the pixels in both synthetic image and real COVID-19 CT reside more on the larger scale, and hence the synthetic CT image is similar to the real COVID-19 one. C. The KL divergence is computed to compare the agreement of both synthetic COVID-19 image and lung cancer image by quantifying the level of agreement relative to real COVID-19 CT image. The increased value indicated a lower level of similarity.
Test result of different classification models on real and synthetic COVID-19 CT images.
| Model | Synthetic Test | Real Test | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | Recall | Precision | F1 | Accuracy | Recall | Precision | F1 | |
| VGG16 | 94.19 | 88.15 | 100.00 | 93.70 | 94.80 | 88.15 | 98.52 | 93.05 |
| ResNet_50 | 94.83 | 89.47 | 100.00 | 94.44 | 94.10 | 89.47 | 95.32 | 92.30 |
| Inception_v3 | 96.55 | 96.05 | 96.90 | 96.47 | 95.32 | 96.05 | 92.40 | 94.19 |
| Inception_ResNet_v2 | 95.91 | 91.67 | 100.00 | 95.65 | 96.70 | 91.67 | 100.00 | 95.65 |
| DenseNet_169 | 98.92 | 97.80 | 100.00 | 98.89 | 98.09 | 97.80 | 97.37 | 97.92 |
| Average | 96.08 | 92.63 | 99.38 | 95.83 | 95.80 | 92.63 | 96.72 | 94.62 |
Fig. 4ROC curves of five classification models on real or synthetic dataset. The lines colored by red, green, blue, yellow, pink are the ROC curves of DenseNet_169, Inception_ResNet_v2, Inception_v3, ResNet_50 and VGG16 respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)