| Literature DB >> 34934410 |
Qinghao Ye1,2, Yuan Gao3,4, Weiping Ding5, Zhangming Niu4, Chengjia Wang6, Yinghui Jiang1,7, Minhao Wang1,7, Evandro Fei Fang8, Wade Menpes-Smith4, Jun Xia9, Guang Yang10,11.
Abstract
The world is currently experiencing an ongoing pandemic of an infectious disease named coronavirus disease 2019 (i.e., COVID-19), which is caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Computed Tomography (CT) plays an important role in assessing the severity of the infection and can also be used to identify those symptomatic and asymptomatic COVID-19 carriers. With a surge of the cumulative number of COVID-19 patients, radiologists are increasingly stressed to examine the CT scans manually. Therefore, an automated 3D CT scan recognition tool is highly in demand since the manual analysis is time-consuming for radiologists and their fatigue can cause possible misjudgment. However, due to various technical specifications of CT scanners located in different hospitals, the appearance of CT images can be significantly different leading to the failure of many automated image recognition approaches. The multi-domain shift problem for the multi-center and multi-scanner studies is therefore nontrivial that is also crucial for a dependable recognition and critical for reproducible and objective diagnosis and prognosis. In this paper, we proposed a COVID-19 CT scan recognition model namely coronavirus information fusion and diagnosis network (CIFD-Net) that can efficiently handle the multi-domain shift problem via a new robust weakly supervised learning paradigm. Our model can resolve the problem of different appearance in CT scan images reliably and efficiently while attaining higher accuracy compared to other state-of-the-art methods.Entities:
Keywords: COVID-19; Medical image analysis; Multi-domain shift; Multicenter data processing; Weakly supervised learning
Year: 2021 PMID: 34934410 PMCID: PMC8667427 DOI: 10.1016/j.asoc.2021.108291
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 6.725
Fig. 1(a) Samples of CT images are taken from five different hospitals and (b) The histograms of these CT images. Compared with images from Hospital A and Hospital D, it is clear that the brightness levels are distinctive. Moreover, the contrast of the data collected from the China Consortium of Chest CT Image Investigation (CC-CCII) dataset is considerably different from CT images acquired from other hospitals. The right bottom figure demonstrates the distribution of the images from different hospitals after normalization, however, these distributions still behave distinctively. It is of note that there are no visually distinctive features across CT scan images but it is easy for human radiologists to correctly classify despite CT scanner changes. On the contrary, deep learning based automated methods may fail to generalize across CT images acquired from different hospitals.
Fig. 2The architecture of our proposed CIFD-Net. It is of note that denotes the probability of the Section , and represents the probability of the patient who is tested COVID-19 positive or not. indicates the noise transaction from the probability of the true label to the probability of the noise label . In addition, is a feature embedding function. In addition, ResNet-50 [55] is adopted for backbone network.
Fig. 3(a) The workflow of the class activation mapping (CAM) scheme and (b) The proposed explainable classification module (ECM). It shows that our ECM can generate the CAM using only one forward pass, but the original method proposed by Zhou et al. [53] needs a post-processing procedure to generate the CAM. is the th feature map from the backbone network. Besides, and are the weights for the fully connected layer and the convolutional layer. and are the class activation maps for class . is the class score for class .
The number of CT samples used for training for each class collected by four different hospitals A, B, C, and D. Besides, details of the CC-CCII dataset are also listed, which was used in the independent testing stage. The ratio of positive and negative samples in training set is approximately 1:1, and 2:1 in test dataset.
| Dataset | Number of patients | Number of CT images | Subset | ||||
|---|---|---|---|---|---|---|---|
| Total | Positive | Negative | Total | Positive | Negative | ||
| Hospital A | 424 | 0 | 424 | 24,670 | 0 | 24,670 | Train |
| Hospital B | 58 | 58 | 0 | 5,512 | 5,512 | 0 | Train |
| Hospital C | 17 | 17 | 0 | 2,611 | 2,611 | 0 | Train |
| Hospital D | 305 | 305 | 0 | 12,374 | 12,374 | 0 | Train |
| CC-CCII | 2,034 | 1,320 | 714 | 130,511 | 84,629 | 45,882 | Test |
Comparison results of our CIFD-Net method vs. state-of-the-art architectures on the CC-CCII dataset.
| Annotation | Method | Patient Acc. (%) | Precision (%) | Sensitivity (%) | Specificity (%) | AUC (%) | |
|---|---|---|---|---|---|---|---|
| Patient-level | ResNet-50 | 53.70 | 61.42 | 77.13 | 10.37 | 68.38 | 46.30 |
| COVID-Net | 53.62 | 61.35 | 77.18 | 10.06 | 68.36 | 44.53 | |
| COVNet | 67.64 | 76.03 | 73.17 | 57.34 | 74.57 | 66.13 | |
| VB-Net | 76.75 | 85.25 | 77.61 | 75.22 | 81.25 | 89.48 | |
| CIFD-Net (Ours) | |||||||
| Image-level | ResNet-50 | 67.29 | 68.23 | 92.95 | 20.40 | 78.71 | 53.43 |
| COVID-Net | 64.83 | 66.28 | 12.48 | 77.46 | 51.47 | ||
| COVNet | 70.79 | 83.09 | 68.95 | 74.10 | 75.37 | 73.08 | |
| CIFD-Net (Ours) | 84.74 | ||||||
* indicates the -value , and ** represents the -value .
Fig. 4Receiver Operating Characteristic (ROC) curves and area under ROC curves (AUC) of different models trained using patient-level annotation (a) and image-level annotation (b) on the CC-CCII dataset.
Accuracy (%) of all the cases where each proposed component is applied.
| Exp. | ResNet-50 | Patient Acc. (%) | Image Acc. (%) | ||
|---|---|---|---|---|---|
| 1 | 53.72 | 67.31 | |||
| 2 | 83.97 | 78.60 | |||
| 3 | 35.10 | 35.16 | |||
| 4 |
Fig. 5Variations in classification results by changing the hyper-parameter . The light dash line represents the case when . It shows that our model achieves the best performance with .
Fig. 6Variations in classification results by changing the hyper-parameter . Our model achieves the best performance with with section size .
Fig. 7Visualization of the CAMs and bounding boxes generated by different methods on the CC-CCII dataset. The region with a deeper red color indicates discriminative regions for the prediction by the model. is the probability for being predicted as COVID-19 positive.
Fig. 8Visualizations of infected/non-infected probabilities of each section for the patients. The -axis of the plot is the section index of the patient. The right sub-figures of the probability plot are the picture sampled from the section listed above. (a) The first few sections are recognized as COVID-19 positive with high probability and when approaching the last few sections, no obvious lesions are found thus the positive probability drops drastically. (b) It shows that the probabilities of the first three sections are close to 0.5 indicating uncertainty for these sections. (c) For the last few sections, the lesions are gradually showing up in the left and right lower lobes together with increased infected probability.