| Literature DB >> 33936577 |
Xiaoshuo Li1, Wenjun Tan1,2, Pan Liu1, Qinghua Zhou1, Jinzhu Yang2.
Abstract
Novel coronavirus pneumonia (NCP) has become a global pandemic disease, and computed tomography-based (CT) image analysis and recognition are one of the important tools for clinical diagnosis. In order to assist medical personnel to achieve an efficient and fast diagnosis of patients with new coronavirus pneumonia, this paper proposes an assisted diagnosis algorithm based on ensemble deep learning. The method combines the Stacked Generalization ensemble learning with the VGG16 deep learning to form a cascade classifier, and the information constituting the cascade classifier comes from multiple subsets of the training set, each of which is used to collect deviant information about the generalization behavior of the data set, such that this deviant information fills the cascade classifier. The algorithm was experimentally validated for classifying patients with novel coronavirus pneumonia, patients with common pneumonia (CP), and normal controls, and the algorithm achieved a prediction accuracy of 93.57%, sensitivity of 94.21%, specificity of 93.93%, precision of 89.40%, and F1-score of 91.74% for the three categories. The results show that the method proposed in this paper has good classification performance and can significantly improve the performance of deep neural networks for multicategory prediction tasks.Entities:
Year: 2021 PMID: 33936577 PMCID: PMC8061232 DOI: 10.1155/2021/5528441
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Overview of methods and quantitative results toward COVID-19 classification.
| Author | Dataset | No. of images | Method | Quantitative results indicators |
|---|---|---|---|---|
| Gao [ | Internal | 791 | XGBoost | Acc = 94.34%; Sens = 83.33% |
| Wang [ | Internal | 540 | 3D CNN | Acc = 90.1%;ROC = 95.5% |
| Han [ | Internal | 460 | Attention mechanism + 3D multiple instance learning | Acc = 97.9%; AUC = 99.0% |
| J. HORRY [ | COVID-CT dataset | 746 | VGG19 | Acc = 84% |
| A. Waheed [ | COVID-19 chest X-ray dataset [ | 932 | VGG16 + ACGAN | Acc = 95%; Sens = 90% |
| Pathak [ | Chest CT images [ | 1790 | DBM | Acc = 98.37%; AUC = 98.32% |
| Y. Oh [ | JSRT [ | 502 | ResNet-18 | Acc = 88.9%; Spec = 96.4 |
| Wang [ | RSNA [ | 18567 | ResNet-101 + ResNet-102 | Acc = 96.1% |
| Ouyang [ | Internal | 2796 | Attention mechanism + 3D CNN | Acc = 87.5%; AUC = 94.4%; Sens = 86.9% |
| T. Siswantining [ | Internal | 170 | CNN + SVM + NN | Acc = 95% |
| Dong [ | Internal | 640 | DCNN | Acc = 93.64 ± 1.42% Sens = 93.28 ± 1.5% Spec = 94.0 ± 1.56% |
| Zhang [ | CC-CCI [ | 61775 | 3D Resnet-18 | Acc = 92.49%; Sens = 94.93%; Spec = 91.13% |
Internal is the nonpublic dataset.
Functions and accuracy of all classifiers.
| Classifier name | Classifier type | Discriminate type | Training set | Accuracy (%) |
|---|---|---|---|---|
| M3 | Multiclassifier | [COVID-19, CP, normal] | U | 88.12 |
| M31 | Multiclassifier | [COVID-19, CP, normal] | U1 | 85.57 |
| M21 | Binary classifier | [COVID-19, (CP, normal)] | U2 | 95.69 |
| M22 | Binary classifier | [CP, (COVID-19, normal)] | U3 | 94.07 |
| M23 | Binary classifier | [Normal, (COVID-19,CP)] | U4 | 95.91 |
| M24 | Binary classifier | [COVID-19, CP] | U5 | 96.49 |
| M25 | Binary classifier | [CP, normal] | U6 | 95.73 |
Figure 1Overall flow of the algorithm. Train multiple deep learning models by dividing subsets, integrate models by stacked idea, and finally output classifier prediction results by setting the threshold probability to 0.5.
Introduction to dataset size.
| Cohort | COVID-19 | Common pneumonia | Normal | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Patients | Scans | Slices | Patients | Scans | Slices | Patients | Scans | Slices | |
| Train | 115 | 183 | 4800 | 128 | 303 | 4800 | 85 | 108 | 4800 |
| Validate | 115 | 183 | 480 | 128 | 303 | 480 | 85 | 108 | 480 |
| Test | 675 | 1180 | 76000 | 256 | 441 | 18852 | 158 | 364 | 45000 |
| Total | 790 | 1363 | 80800 | 384 | 744 | 23652 | 243 | 472 | 49800 |
Figure 2Compare the accuracy, sensitivity, precision, F1-score, and specificity under deep learning based on VGG16 and based on the combination of ensemble learning and VGG16.
Figure 3Results of COVID-19, CP, and normal evaluated under two methods. (a) Accuracy evaluation of a triclassifier model based on VGG16. (b) Accuracy evaluation of a triclassifier model based on a combination of integrated learning and VGG16.