| Literature DB >> 33641077 |
Fudan Zheng1, Liang Li2, Xiang Zhang3, Ying Song4, Ziwang Huang1, Yutian Chong5, Zhiguang Chen1,6, Huiling Zhu7, Jiahao Wu8, Weifeng Chen9, Yutong Lu1,6, Yuedong Yang10,11, Yunfei Zha12, Huiying Zhao13, Jun Shen14.
Abstract
Computed tomography (CT) is one of the most efficient diagnostic methods for rapid diagnosis of the widespread COVID-19. However, reading CT films brings a lot of concentration and time for doctors. Therefore, it is necessary to develop an automatic CT image diagnosis system to assist doctors in diagnosis. Previous studies devoted to COVID-19 in the past months focused mostly on discriminating COVID-19 infected patients from healthy persons and/or bacterial pneumonia patients, and have ignored typical viral pneumonia since it is hard to collect samples for viral pneumonia that is less frequent in adults. In addition, it is much more challenging to discriminate COVID-19 from typical viral pneumonia as COVID-19 is also a kind of virus. In this study, we have collected CT images of 262, 100, 219, and 78 persons for COVID-19, bacterial pneumonia, typical viral pneumonia, and healthy controls, respectively. To the best of our knowledge, this was the first study of quaternary classification to include also typical viral pneumonia. To effectively capture the subtle differences in CT images, we have constructed a new model by combining the ResNet50 backbone with SE blocks that was recently developed for fine image analysis. Our model was shown to outperform commonly used baseline models, achieving an overall accuracy of 0.94 with AUC of 0.96, recall of 0.94, precision of 0.95, and F1-score of 0.94. The model is available in https://github.com/Zhengfudan/COVID-19-Diagnosis-and-Pneumonia-Classification .Entities:
Keywords: CT image; Deep learning network; Diagnosis of COVID-19; Pneumonia classifying
Mesh:
Year: 2021 PMID: 33641077 PMCID: PMC7914048 DOI: 10.1007/s12539-021-00420-z
Source DB: PubMed Journal: Interdiscip Sci ISSN: 1867-1462 Impact factor: 2.233
Fig. 1The workflow of data preprocessing: a converting the image into a binary image with a density threshold of − 600 HU, b removing the connected regions that are in contact with the edges of the image, c keeping the two largest areas, d performing morphological erosion, e performing binary morphological closing and filling the small holes inside the binary mask of lungs, f superimposing the binary mask on the input image and detecting the smallest effective rectangle surrounding the lungs, g filling the image with 10 translational and rotational copies of the lungs on the background
The number of persons and CT slices provided by the two hospitals after preprocessing
| COVID-19 | Healthy people | Bacterial pneumonia | Typical viral pneumonia | Total | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Patients | Slices | Persons | Slices | Patients | Slices | Patients | Slices | Persons | Slices | |
| Sun Yat-sen Memorial Hospital | – | – | 22 | 295 | 22 | 113 | 61 | 647 | 105 | 1055 |
| Renmin Hospital of Wuhan University | 262 | 2119 | 56 | 287 | 78 | 391 | 158 | 1511 | 554 | 4308 |
| Total | 262 | 2119 | 78 | 582 | 100 | 504 | 219 | 2158 | 659 | 5363 |
Fig. 2The pipeline of the proposed system. The CT images were first preprocessed, and then sent to the classification network to make predictions in image level. Then, the image-level predictions of all images of each person were aggregated to provide human-level diagnosis
Fig. 3The architecture of the classification model. Channel SE blocks were introduced to emphasize important channels and suppress less important ones
The number of persons and CT slices in the training set, validation set and test set for the quaternary classification task of all the four types of persons
| COVID-19 | Healthy people | Bacterial pneumonia | Typical viral pneumonia | |||||
|---|---|---|---|---|---|---|---|---|
| Patients | Slices | Persons | Slices | Patients | Slices | Patients | Slices | |
| Training set | 205 | 1667 | 42 | 394 | 60 | 306 | 167 | 1713 |
| Validation set | 27 | 205 | 6 | 57 | 10 | 64 | 22 | 215 |
| Test set | 30 | 247 | 30 | 131 | 30 | 134 | 30 | 230 |
| Total | 262 | 2119 | 78 | 582 | 100 | 504 | 219 | 2158 |
Performance of our classification model in identifying four different types of persons all at once
| AUC | Recall | Precision | F1-score | Accuracy | |
|---|---|---|---|---|---|
| Macro average performance | 0.96 | 0.94 | 0.95 | 0.94 | 0.94 |
| Healthy | 0.999 | 0.97 | 1 | 0.98 | – |
| COVID-19 | 0.93 | 0.97 | 0.85 | 0.91 | – |
| Bacterial pneumonia | 0.97 | 1 | 0.97 | 0.98 | – |
| Typical viral pneumonia | 0.95 | 0.83 | 0.96 | 0.89 | – |
Fig. 4The receiver-operating characteristic curves of our classification model. a Is for the quaternary classification in identifying four different types of persons all at once and b is for the binary classification in identifying COVID-19 from the other types
Fig. 5Confusion matrix of our classification model. a Is for the quaternary classification in identifying four different types of persons all at once, b–d are for the binary classification in identifying COVID-19 from the other types respectively, and e is for the binary classification in identifying COVID-19 from all the others
Fig. 6CT images of COVID-19 and typical viral pneumonia patients. Row a are CT images of COVID-19 infected patients that are correctly predicted. Row b are CT images of typical viral pneumonia patients that are incorrectly predicted as COVID-19. Row c are CT images of typical viral pneumonia patients that are correctly predicted
Fig. 7The feature maps of the CT images of four types of persons extracted by our classification model
The performance of our model in the binary classification tasks
| Task | AUC | Recall | Precision | F1-score | Accuracy |
|---|---|---|---|---|---|
| COVID-19 or healthy? | 0.99 | 1 | 0.96 | 0.98 | 0.98 |
| COVID-19 or bacterial pneumonia? | 0.94 | 0.93 | 0.81 | 0.86 | 0.86 |
| COVID-19 or typical viral pneumonia? | 0.91 | 0.92 | 0.81 | 0.86 | 0.83 |
| COVID-19 or all others (healthy, bacterial pneumonia, typical viral pneumonia)? | 0.92 | 0.90 | 0.82 | 0.86 | 0.85 |
The performance of ablation study in quaternary classification. Row (1)–(4) are the performance of different data augmentation methods. Row (5) is the performance of our model without the SE blocks. Row (6) is the performance of our model in image level without aggregation into human level. Row (7) is the performance of our model
| Flipping | Translation | Flipping + translation | SE | Aggregation | AUC | Recall | Precision | F1-score | Accuracy | |
|---|---|---|---|---|---|---|---|---|---|---|
| (1) |
|
| 0.88 | 0.73 | 0.78 | 0.73 | 0.73 | |||
| (2) |
|
|
| 0.90 | 0.77 | 0.81 | 0.77 | 0.79 | ||
| (3) |
|
|
| 0.94 | 0.83 | 0.84 | 0.82 | 0.82 | ||
| (4) |
|
|
| 0.95 | 0.87 | 0.92 | 0.88 | 0.87 | ||
| (5) |
|
|
|
| 0.94 | 0.80 | 0.82 | 0.79 | 0.80 | |
| (6) |
|
|
|
| 0.92 | 0.84 | 0.82 | 0.83 | 0.84 | |
| (7) |
|
|
|
|
| 0.96 | 0.94 | 0.95 | 0.94 | 0.94 |
Performance of our classification model comparing with other existing models
| Model | AUC | Recall | Precision | F1-score | Accuracy |
|---|---|---|---|---|---|
| DenseNet | 0.90 | 0.79 | 0.78 | 0.78 | 0.79 |
| VGG | 0.93 | 0.80 | 0.81 | 0.79 | 0.80 |
| ResNet | 0.94 | 0.80 | 0.82 | 0.79 | 0.80 |
| OURS | 0.96 | 0.94 | 0.95 | 0.94 | 0.94 |