| Literature DB >> 35996746 |
Lu-Lu Jia1, Jian-Xin Zhao1, Ni-Ni Pan1, Liu-Yan Shi1, Lian-Ping Zhao2, Jin-Hui Tian3, Gang Huang2.
Abstract
Objectives: When diagnosing Coronavirus disease 2019(COVID-19), radiologists cannot make an accurate judgments because the image characteristics of COVID-19 and other pneumonia are similar. As machine learning advances, artificial intelligence(AI) models show promise in diagnosing COVID-19 and other pneumonias. We performed a systematic review and meta-analysis to assess the diagnostic accuracy and methodological quality of the models.Entities:
Keywords: 2D, two-dimensional; 3D, three-dimensional; AI, artificial intelligence; AUC, area under the curve; Artificial Intelligence; CNN, Convolutional neural network; COVID-19; COVID-19, Coronavirus disease 2019; CRP, C-reactive protein; CT, Computed tomography; CXR, Chest X-Ray; Diagnostic Imaging; GGO, ground-glass opacities; KNN, K-nearest neighbor; LASSO, least absolute shrinkage and selection operator; MEERS-COV, Middle East respiratory syndrome coronavirus; ML, machine learning; Machine learning; PLR, negative likelihood ratio; PLR, positive likelihood ratio; Pneumonia; ROI, regions of interest; RT-PCR, Reverse transcriptase polymerase chain reaction; SARS, severe acute respiratory syndrome; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; SROC, summary receiver operating characteristic; SVM, Support vector machine
Year: 2022 PMID: 35996746 PMCID: PMC9385733 DOI: 10.1016/j.ejro.2022.100438
Source DB: PubMed Journal: Eur J Radiol Open ISSN: 2352-0477
Fig. 1Flow diagram of the study selection process for this meta-analysis.
Summary of general study characteristics.
| Training validation/ Testing | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Study | Country of corresponding author | Study type | Index test | Date source | Eligibility criteria | Reference standard | Common type of pneumonia | Number of COVID-19 vs. other pneumonias | AUC | Type of validation | Number of COVID-19 vs. other pneumonias | SEN | SPC |
| Ardakani 2020 | Iran | R | CT | Single hospital | Yes | RT-PCR | Atypical, viral pneumonia | 86 vs. 69 | 0.999 | Random split | 22 vs.17 | 1.00 | 0.99 |
| Ardakani 2021 | Iran | R | CT | Single hospital | Yes | RT-PCR | Atypical and viral pneumonia | 244 vs.244 | 0.988 | Random split | 62 vs.62 | 0.935 | 0.903 |
| Ali 2021 | turkey | R | CXR | Single database | No | NA | Viral | 146 vs.901 | NR | 3fold CV | 73 vs.444 | 0.973 | NR |
| Han2021 | Korea | R | CT | 2datasets | No | NA | Viral pneumonia, bacterial pneumonia, fungal | 164 vs.320 | NR | External validation | 21 vs.40 | 0.997 | 0.959 |
| Di2020 | China | R | CT | 5hospitals | No | RT-PCR | CAP | 1933 vs. 1064 | NR | 10 fold CV | 215 vs.118 | 0.932 | 0.840 |
| Bai 2020 | China | R | CT | 10hospitals | Yes | RT-PCR | Pneumonia | 377 vs.453 | NR | Random split | 42 vs.77 | 0.950 | 0.960 |
| Panwar 2020 | Mexico | R | CXR | 3datasets | No | NA | Pneumonia | 133 vs.231 | NR | Random split | 29 vs.85 | 0.966 | 0.953 |
| Kang 2020 | China | R | CT | 3hospitals | No | RT-PCR | CAP | 1046 vs. 719 | NR | Random split | 449 vs.308 | 0.966 | 0.932 |
| Liu 2021 | China | R | CT | 2hospitals | Yes | RT-PCR | Viral infections, | 66 vs.313 | 1.000 | External validation | 20 vs.20 | 0.850 | 0.900 |
| Chen 2021 | China | R | CT | Single hospital | Yes | RT-PCR | Other types of pneumonia | 54 vs.60 | 0.984 | Random split | 9 vs.11 | 0.816 | 0.923 |
| Song 2020 | China | R | CT | 2hospitals | Yes | RT-PCR | CAP | 66 vs.66 | 0.979 | External validation | 15 vs.20 | 0.800 | 0.750 |
| Sun 2020 | China | R | CT | 6hospitals | No | RT-PCR | CAP | 1196 vs. 822 | NR | 5fold CV | 299 vs.205 | 0.931 | 0.899 |
| Wang 2021 | China | R | CT | 3hospitals | Yes | RT-PCR | Other types of | 74 vs.73 | 0.970 | External validation | 17 vs.17 | 0.722 | 0.751 |
| Zhou 2021 | China | R | CT | 12hospitals | Yes | RT-PCR | Influenza pneumonia | 118 vs.157 | NR | External validation | 57 vs.50 | 0.860 | 0.772 |
| Azouji2021 | Switzerland | R | CXR | 7datasets | No | NA | MERS, SARS | 338 vs.222 | NR | 5fold CV | 85 vs.56 | 0.989 | NR |
| Cardobi 2021 | Italy | R | CT | Single hospital | No | swab test | Interstitial pneumonias | 54 vs.30 | 0.830 | Random split | 14 vs.17 | 0.570 | 0.930 |
| Yang 2021 | China | R | CT | Single hospital | No | RT-PCR | Other pneumonias | 70 vs.70 | NR | 10fold CV | 20 vs.20 | 0.942 | 0.854 |
| Chikontwe 2021 | Korea | R | CT | Single hospital | No | RT-PCR | Bacterial pneumonia | 38 vs.49 | NR | Random split | 30 vs.39 | 1.000 | 0.975 |
| Zhu 2021 | China | R | CT | 6hospitals | No | RT-PCR | CAP | 1345 vs. 924 | NR | 10fold CV | 150 vs.103 | 0.913 | 0.910 |
| Xie 2020 | China | R | CT | 5hospitals | Yes | RT-PCR | Bacterial infection, | 227 vs.153 | NR | prospective RWD | 243 vs.73 | 0.810 | 0.820 |
| Qi 2021 | China | R | CT | 3hospitals+dataset | Yes | RT-PCR | CAP | 127 vs.90 | NR | 10fold CV | 14 vs.10 | 0.972 | 0.940 |
| Wang 2020 | China | R | CT | 7hospitals | Yes | RT-PCR | Bacterial pneumonia, Mycoplasma pneumonia, Viral pneumonia, Fungal pneumonia | 560 vs.149 | 0.900 | External validation | 102 vs.124 | 0.804 | 0.766 |
| Yang 2020 | China | R | CT | 8hospitals | No | RT-PCR | CAP | 960 vs.628 | 0.976 | External validation | 1605 vs. 452 | 0.869 | 0.901 |
| Wu 2020 | China | R | CT | 3hospitals | No | RT-PCR | Other pneumonia | 294 vs.101 | 0.767 | Random split | 37 vs.13 | 0.811 | 0.615 |
| Zhang 2021 | China | R | CT | 3hospitals | No | RT-PCR | CAP, influenza, mycoplasma pneumonia | 72 vs.127 | 0.987 | 5fold CV | 31 vs.62 | 0.879 | 0.887 |
| Xin 2021 | China | R | CT | 2hospitals | Yes | swab tests | CAP | 34 vs.48 | NR | 5fold CV | 9 vs.12 | 0.957 | 0.984 |
| Guo 2020 | China | R | CT | 2hospitals | No | RT-PCR | Seasonal flu,CAP | 8 vs.42 | 0.970 | External validation | 11 vs.44 | 0.889 | 0.935 |
| Fang2020 | China | R | CT | 2hospitals | Yes | nucleic acid detection | Viral pneumonia | 136 vs.103 | 0.959 | Random split | 56 vs.34 | 0.929 | 0.971 |
| Xia 2021 | China | R | CXR | 2hospitals | Yes | nucleic acid | Influenza A/B | 246 vs.44 | NR | Random split | 266 vs.62 | 0.869 | 0.742 |
| Huang2020 | China | R | CT | 15hospitals | Yes | RT-PCR | Viral pneumonia | 62 vs.64 | 0.849 | 5fold CV | 27 vs.28 | 0.778 | 0.786 |
| Wu 2021 | China | R | CT | Single hospital | Yes | nucleic acid | Other infectious | 76 vs.77 | NR | 5fold CV | 19 vs.19 | 0.809 | 0.842 |
| Chen2021 | China | R | CT | 2hospitals | No | RT-PCR | Viral pneumonia | 81 vs.81 | 0.807 | Random split | 27 vs.19 | 0.733 | 0.822 |
Abbreviations: AUC: area under the curve; CAP: community acquired pneumonia;CT: Computed tomography; CV: cross-validation;CXR: Chest X-Ray; R: retrospective; RT-PCR: Reverse transcriptase polymerase chain reaction; RWD: Real-world dataset; SEN: sensitivity;SPC: specificity
Summary of artificial intelligence-based prediction model characteristics described in included studies.
| Study | ROI | Segmentation | AI Method | Labeling | Pre-Processing | Augmentations | Model | Loss | Comparison between algorithms | AI vs. Radiologist |
|---|---|---|---|---|---|---|---|---|---|---|
| Ardakani 2020 | Regions of infections | 2D | DL | by a radiologist with more than 15 years of experience in thoracic imaging | Manual ROI | NA | Ten well-known | NA | Ten well-known | Yes |
| Ardakani 2021 | CT chest | 2D | ML | By two radiologists | feature extraction | random scaling | ensemble method | NA | DT, KNN, Naïve Bayes, SVM | Yes |
| Ali 2021 | Whole image | 2D | DL | NA | Normalization, transfer-learning | Horizontal, | ResNet50, ResNet101, Res Net 152 | NA | ResNet50, ResNet101, Res Net 152 | No |
| Han2021 | CT slices | 2D | DL | using the labeled COVID-19 dataset | both labeled and unlabeled data can be used | random scaling | a semi-supervised deep neural network | standard cross entropy loss | Supervised learning | No |
| Di2020 | Infected lesions | 2D | ML | NA | extracted both regional and radiomics features, Segmentation | NA | UVHL | cross- | SVM, MLP, iHL, tHL | No |
| Bai 2020 | Lung | 2D | DL | Lesions (COVID-19 or pneumo- | Normalized, Segmentation | flips, scaling, rotations, random brightness and | DNN | NA | No | Yes |
| Panwar 2020 | Whole image | 2D | DL | NA | Filter, dimension reduction, deep transfer learning | Shear, Rotation | A DL and Grad-CAM | binary cross-entropy loss | No | No |
| Kang 2020 | Lesion region | 3D | ML | NA | Segmentation, | NA | Structured Latent Multi-View | Ross-entropy loss | LR,SVM,GNB, KNN, NN | No |
| Liu 2021 | Each pneumonia lesion | 3D | ML | By three experienced radiologists | Feature | NA | LASSO regression | NA | No | Yes |
| Chen 2021 | Consolidation and ground- | 3D | ML | By fifteen radiologists | Feature | NA | SVM | NA | No | No |
| Song 2020 | CT images | 2D | DL | NA | semantic feature extraction | NA | BigBiGAN | NA | SVM, KNN | Yes |
| Sun 2020 | Infected lung | 3D | DL | NA | Feature | NA | AFS-DF | NA | LR, SVM, RF, NN | No |
| Wang 2021 | Pneumonia lesions | 3D/2D | ML | By four radiologists | manual segmentation, Feature | NA | Linear, LASSO, RF, KNN | NA | Linear, LASSO, RF, KNN | Yes |
| Zhou 2021 | Lesion regions | 2D | DL | annotated by 2 radiologists | Segmentation | randomly flipped, cropped | Trinary scheme(DL) | Binary | Plain scheme(DL) | Yes |
| Azouji2021 | X-ray images | 2D | DL | NA | Resizing x-ray images, Contrast limited adaptive histogram equalization, Deep feature extraction, Deep feature fusion | Rotation, translation | LMPL classifier | hinge loss function | NaiveBayes, KNN, SVM,DT, AdaBoostM2, TotalBoost,RF, SoftMax,VGG-Net | No |
| Cardobi 2021 | Lung area | 3D | ML | NA | Segmentation, features extraction | NA | LASSO model | NA | No | No |
| Yang 2021 | Pneumonia lesion | 3D | ML | artificially delineated | Segmentation, features extraction | spatially resampled | SVM | NA | Sigmoid-SVM, Poly-SVM, Linear-SVM, RBF-SVM | No |
| Chikontwe 2021 | CT slices | 3D | DL | NA | Segmentation | random transformations, | DA-CMIL | NA | DeCoVNet, MIL, DeepAttentionMIL, JointMIL | No |
| Zhu 2021 | CT images | 3D | DL | NA | Segmentation, | NA | GACDN | Binary cross entropy | SVM,KNN,NN | No |
| Xie 2020 | CT slices | 3D | DL | NA | Segmentation, | random horizontal flip, random rotation, random scale, random translation, and random elastic transformation | DNN | NA | No | Yes |
| Qi 2021 | Lung field | 3D | DL | NA | segmentation of the lung field, Extraction of deep features, Feature representation | Image rotation, reflection, and translation | DR-MIL | NA | MResNet-50-MIL, MmedicalNet, MResNet-50-MIL-max-pooling, MResNet-50-MIL-Noisy-AND-pooling, MResNet-50-Voting, MResNet-50-Montages | Yes |
| Wang 2020 | Lung area | 3D | DL | NA | fully automatic DL model to segment, normalization, convolutional filter | NA | DL | NA | No | No |
| Yang 2020 | Infection | 3D | DL | NA | Class Re-Sampling Strategies, Attention Mechanism | scaling | Dual-Sampling Attention Network | binary cross entropyloss | RN34 + US, Attention RN34 + US | No |
| Wu 2020 | CT slices | 3D | DL | NA | segmentation | NA | Multi-view deep learning | NA | Single-view model | No |
| Zhang 2021 | Major lesions | 3D | DL | NA | Segmentation | scaling | DL-MLP | NA | DL-SVM,DL-LR, DL-XGBoost | Yes |
| Xin 2021 | Lungs, lobes, and detected opacities | 2D | DL | Confirmed by 3 experienced radiologists and human auditing | Segmentation | NA | LR, MLP, | NA | LR, MLP, | No |
| Guo 2020 | NR | NA | ML | by two radiologists | Segmentation | NA | RF | NA | No | No |
| Fang2020 | Primary lesion | 3D/2D | ML | by two chest radiologists | Segmentation | NA | LASSO regression | NA | No | No |
| Xia 2021 | Lung areas | 2D | DL | NA | Segmentation | random rotation, | DNN | Categorical | No | Yes |
| Huang2020 | Pneumonia lesion | 3D | ML | by two chest radiologists | Segmentation | NA | Logistic model | NA | No | No |
| Wu 2021 | Maximal regions Involving inflammatory lesions | 2D | ML | by two radiologists | feature extraction, manually delineating | NA | RF | NA | No | No |
| Chen2021 | Lesion | 2D | ML | by two radiologists | Segmentation | NA | WSVM | NA | RF, SVM | Yes |
Abbreviations:AFS-DF:adaptive feature selection guided deep forest;AI:artificial intelligence;BigBiGAN: bi-directional generative adversarial network; CT: Computed tomography; CXR: Chest X-Ray; CNN: Convolutional neural network;DA-CMIL: Dual Attention Contrastive multiple instance learning; DT: Decision tree; DNN: Deep Neural Networks; DR-MIL: deep represented multiple instance learning; DL: deep learning; RF: Random Forests; GNB: Gaussian-Naive-Bayes; Grad-CAM: Gradient Weighted Class Activation Mapping; GACDN: generative adversarial feature completion and diagnosis network; IHL:Inductive Hypergraph Learning; KNN: K-nearest neighbor; LR: Logistic-Regression; LASSO: least absolute shrinkage and selection operator; LMPL: large margin piecewise linear; ML: machine learning; MLA: Machine learning algorithms; MLP: Multilayer Perceptron; MERS: Middle East respiratory syndrome; NN: Neural-Networks; ROI: Region of interest; SVM: Support vector machine; THL: Transductive Hypergraph Learning; 2D: two-dimensional;3D: three-dimensional;UVHL: Uncertainty Vertex-weighted Hypergraph Learning; WSVM: weighted support vector machine
Fig. 2Methodological quality evaluated by using the Radiomics Quality Score (RQS) tool. (A). Proportion of studies with different RQS percentage score. (B). Average scores of each RQS item (gray bars stand for the full points of each item, and red bars show actual points).
Fig. 3CLAIM items of the 19 included studies expressed as percentage of the ideal score according to the six key domains. CLAIM, Checklist for Artificial Intelligence in Medical Imaging.
Fig. 4Coupled forest plots of pooled sensitivity and specificity of diagnostic performance of chest imaging for distinguished COVID-19 and other pneumonias. The numbers are pooled estimates with 95 % CIs in parentheses; horizontal lines indicate 95 % CIs.
Fig. 5Diagnostic performance of SROC curve of an artificial intelligence model for distinguishing COVID-19 from other pneumonias on chest imaging. There was an obvious difference between the 95 % confidence and 95 % prediction regions, indicating a high possibility of heterogeneity across the studies.
The results of subgroup analysis.
| Subgroup | Number of study | Sensitivity | I2 | Specificity | I2 | PLR | I2 | NLR | I2 | AUC |
|---|---|---|---|---|---|---|---|---|---|---|
| Imaging modality | ||||||||||
| CRX | 4 | 0.91(0.88,0.94) | 85.6 | 0.96(0.95,0.98) | 95.3 | 26.04(3.73,181.94) | 93.3 | 0.04(0.00,0.41) | 92.6 | 0.9914 |
| CT | 28 | 0.89(0.88,0.90) | 78.9 | 0.89(0.87,0.90) | 62.1 | 6.92(5.35,8.96) | 69.5 | 0.14(0.11,0.19) | 80.0 | 0.9427 |
| Modeling methods | ||||||||||
| Radiomic algorithm | 13 | 0.92(0.90,0.94) | 78.4 | 0.90(0.87,0.92) | 36.8 | 7.16(4.96,10.33) | 53.0 | 0.15(0.08,0.28) | 85.6 | 0.9446 |
| Deep learning | 19 | 0.88(0.87,0.89) | 78.0 | 0.91(0.90,0.92) | 88.5 | 8.32(5.69,12.18) | 82.5 | 0.12(0.09,0.17) | 76.9 | 0.9702 |
| sample size | ||||||||||
| <100 | 18 | 0.87(0.83,0.90) | 65.4 | 0.89(0.86,0.92) | 47.8 | 6.50(4.42,9.58) | 49.3 | 0.18(0.12,0.28) | 59.0 | 0.9371 |
| >100 | 14 | 0.89(0.88,0.90) | 87.0 | 0.91(0.90,0.92) | 90.8 | 8.81(6.02,12.89) | 86.2 | 0.10(0.07,0.14) | 88.6 | 0.9725 |
| ROI | ||||||||||
| Infection regions | 15 | 0.89(0.88,0.90) | 81.0 | 0.89(0.88,0.91) | 48.8 | 6.89(5.20,9.12) | 58.0 | 0.14(0.09,0.20) | 81.3 | 0.9409 |
| others | 16 | 0.88(0.86,0.90) | 80.4 | 0.92(0.90,0.94) | 89.5 | 9.33(5.64,15.45) | 83.3 | 0.11(0.07,0.19) | 83.2 | 0.9691 |
| segmentation | ||||||||||
| 2D | 14 | 0.91(0.89,0.93) | 71.6 | 0.93(0.91,0.95) | 88.9 | 9.71(5.78,16.33) | 79.3 | 0.10(0.06,0.17) | 77.3 | 0.9740 |
| 3D | 15 | 0.88(0.87,0.90) | 85.1 | 0.89(0.87,0.90) | 64.8 | 6.77(4.79,9.57) | 76.6 | 0.15(0.10,0.22) | 85.9 | 0.9386 |
Abbreviations: AUC: area under the curve; CT: Computed tomography; CXR: Chest X-Ray; NLR: negative likelihood ratio; PLR:positive likelihood ratio; ROI: Region of interest;2D: two-dimensional;3D: three-dimensional
Fig. 6Effective sample size (ESS) funnel plots and the associated regression test of asymmetry, as reported by Deeks et al. A p value < 0.10 was considered evidence of asymmetry and potential publication bias.