| Literature DB >> 34025952 |
Eleni S Adamidi1, Konstantinos Mitsis1, Konstantina S Nikita1.
Abstract
The worldwide health crisis caused by the SARS-Cov-2 virus has resulted in>3 million deaths so far. Improving early screening, diagnosis and prognosis of the disease are critical steps in assisting healthcare professionals to save lives during this pandemic. Since WHO declared the COVID-19 outbreak as a pandemic, several studies have been conducted using Artificial Intelligence techniques to optimize these steps on clinical settings in terms of quality, accuracy and most importantly time. The objective of this study is to conduct a systematic literature review on published and preprint reports of Artificial Intelligence models developed and validated for screening, diagnosis and prognosis of the coronavirus disease 2019. We included 101 studies, published from January 1st, 2020 to December 30th, 2020, that developed AI prediction models which can be applied in the clinical setting. We identified in total 14 models for screening, 38 diagnostic models for detecting COVID-19 and 50 prognostic models for predicting ICU need, ventilator need, mortality risk, severity assessment or hospital length stay. Moreover, 43 studies were based on medical imaging and 58 studies on the use of clinical parameters, laboratory results or demographic features. Several heterogeneous predictors derived from multimodal data were identified. Analysis of these multimodal data, captured from various sources, in terms of prominence for each category of the included studies, was performed. Finally, Risk of Bias (RoB) analysis was also conducted to examine the applicability of the included studies in the clinical setting and assist healthcare providers, guideline developers, and policymakers.Entities:
Keywords: ABG, Arterial Blood Gas; ADA, Adenosine Deaminase; AI, Artificial Intelligence; ANN, Artificial Neural Networks; APTT, Activated Partial Thromboplastin Time; ARMED, Attribute Reduction with Multi-objective Decomposition Ensemble optimizer; AUC, Area Under the Curve; Acc, Accuracy; Adaboost, Adaptive Boosting; Apol AI, Apolipoprotein AI; Apol B, Apolipoprotein B; Artificial intelligence; BNB, Bernoulli Naïve Bayes; BUN, Blood Urea Nitrogen; CI, Confidence Interval; CK-MB, Creatine Kinase isoenzyme; CNN, Convolutional Neural Networks; COVID-19; CPP, COVID-19 Positive Patients; CRP, C-Reactive Protein; CRT, Classification and Regression Decision Tree; CoxPH, Cox Proportional Hazards; DCNN, Deep Convolutional Neural Networks; DL, Deep Learning; DLC, Density Lipoprotein Cholesterol; DNN, Deep Neural Networks; DT, Decision Tree; Diagnosis; ED, Emergency Department; ESR, Erythrocyte Sedimentation Rate; ET, Extra Trees; FCV, Fold Cross Validation; FL, Federated Learning; FiO2, Fraction of Inspiration O2; GBDT, Gradient Boost Decision Tree; GBM light, Gradient Boosting Machine light; GDCNN, Genetic Deep Learning Convolutional Neural Network; GFR, Glomerular Filtration Rate; GFS, Gradient boosted feature selection; GGT, Glutamyl Transpeptidase; GNB, Gaussian Naïve Bayes; HDLC, High Density Lipoprotein Cholesterol; INR, International Normalized Ratio; Inception Resnet, Inception Residual Neural Network; L1LR, L1 Regularized Logistic Regression; LASSO, Least Absolute Shrinkage and Selection Operator; LDA, Linear Discriminant Analysis; LDH, Lactate Dehydrogenase; LDLC, Low Density Lipoprotein Cholesterol; LR, Logistic Regression; LSTM, Long-Short Term Memory; MCHC, Mean Corpuscular Hemoglobin Concentration; MCV, Mean corpuscular volume; ML, Machine Learning; MLP, MultiLayer Perceptron; MPV, Mean Platelet Volume; MRMR, Maximum Relevance Minimum Redundancy; Multimodal data; NB, Naïve Bayes; NLP, Natural Language Processing; NPV, Negative Predictive Values; Nadam optimizer, Nesterov Accelerated Adaptive Moment optimizer; OB, Occult Blood test; PCT, Thrombocytocrit; PPV, Positive Predictive Values; PWD, Platelet Distribution Width; PaO2, Arterial Oxygen Tension; Paco2, Arterial Carbondioxide Tension; Prognosis; RBC, Red Blood Cell; RBF, Radial Basis Function; RBP, Retinol Binding Protein; RDW, Red blood cell Distribution Width; RF, Random Forest; RFE, Recursive Feature Elimination; RSV, Respiratory Syncytial Virus; SEN, Sensitivity; SG, Specific Gravity; SMOTE, Synthetic Minority Oversampling Technique; SPE, Specificity; SRLSR, Sparse Rescaled Linear Square Regression; SVM, Support Vector Machine; SaO2, Arterial Oxygen saturation; Screening; TBA, Total Bile Acid; TTS, Training Test Split; WBC, White Blood Cell count; XGB, eXtreme Gradient Boost; k-NN, K-Nearest Neighbor
Year: 2021 PMID: 34025952 PMCID: PMC8123783 DOI: 10.1016/j.csbj.2021.05.010
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1AI-based clinical prediction models.
Fig. 2PRISMA (preferred reporting items for systematic reviews and meta-analyses) flowchart.
Fig. 3Number of studies per year 2020 quarter.
Fig. 4Included AI based prediction models.
Results for screening models.
| Study, Country, Outcome | No. of CPP* | AI methods | Predictors | Val. methods | Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI)) | Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Yang et al. | 1,898 | LR, DT, RF, GBDT | age, gender, race and 27 routine laboratory tests | 5-FCV | AUC 0.854 (95% CI: 0.829–0.878) | L | U | H | H | H |
| Li et al. | 104 | DL | Imaging features | 5-FCV | AUC 0.999 (95%CI, 1670.997–1.000, SEN 98.2, SPE 97.8 | U | U | U | H | H |
| AS Soltan et al. | 437 | multivariate LR, RF, XGBoost | Presentation laboratory tests and vital signs | TTS, 10-FCV | ED model: AUC 0.939, SEN 77.4, SPE 95.7Admissions model: AUC 0.940, SEN 77.4, SPE 94.8Both models achieve high NPP (>99) | H | H | H | H | H |
| Nan et al. | 293 | DL, LR, SVM, DT, RF | 4 epidemiological features, 6 clinical manifestations (muscle soreness, dyspnea, fatigue, lymphocyte count, WBC, imaging features) | TTS | AUC 0.971, Acc 90, SPE 0.95 (LR optimal screening model) | H | U | H | H | H |
| Soares et al. | 81 | ML, SVM, SMOTE Boost, ensembling, k-NN | Hemogram: (Red blood cells, MCV, MCHC, MCH, RDW, Leukocytes, Basophils, Monocytes, Lymphocytes, Platelets, Mean platelet volume, Creatinine, Potassium, Sodium, CRP, Age | unspecified | AUC 86.78 (95%CI: 85.65–87.90), SEN 70.25 (95%CI: 66.57–73.12), SPE 85.98 (95%CI: 84.94–86.84), NPV 94.92 (95%CI: 94.37–95.37), PPV 44.96 (95%CI: 43.15–46.87) | L | U | H | H | H |
| Feng et al. | 32 | ML, LR (LASSO), DT, Adaboost | lymphopenia, elevated CRP and elevated IL-6 on admission | 10-FCV | AUC 0.841, SPE 72.7 | H | H | H | H | H |
| Wu et al. | 27 | RF | 11 key blood indices: TP, GLU, Ca, CK-MB, Mg, BA, TBIL, CREA, LDH, K, PDW | 10-FVC, Ext. Val. | Acc 95.95, SEN 95.12, SPE 96.97 | L | L | L | H | H |
| Banerjee et al. | 81 | RF, ANN | platelets, leukocytes, eosinophils, basophils, lymphocytes, monocytes. | 10-FCV | AUC 0.95 | H | H | H | H | H |
| Peng et al. | 32 | SRLSR, non-dominated radial slots-based algorithm, ARMED, GFS, RFE | 18 diagnostic factors: WBC, eosinophil count, eosinophil ratio, 2019 new Coronavirus RNA (2019n-CoV), Amyloid-A, Neutrophil ratio, basophil ratio, platelet, thrombocytocrit, monocyte count, procalcitonin, neutrophil count, lymphocyte ratio, lymphocyte count, monocyte ratio, MCHC, Urine SG | not performed | not performed | L | L | U | H | H |
*CPP = COVID-19 Positive Patients, Abbreviations of medical terms included in this Table are provided in the Appendix.
**L: Low, H: High, U: Unclear
Results for screening imaging models.
| Study, Country, Outcome | No. of CPP* | AI methods | Predictors | Val. methods | Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI)) | Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Abdani et al. | 219 | DL, CNN | Imaging features | 5-FCV | Acc 94 | H | U | H | H | H |
| Ahammed et al. | 285 | ML, DL, CNN, SVM, RF, k-NN, LR, GNB, BNB, DT, XGB, MLP, NC, perceptron. | Imaging features | 10-FCV | AUC 95.52, Acc 94.03, SEN 94.03, SPE 97.01 | H | H | H | H | H |
| Barstugan et al. | 53 | ML, SVM | Imaging features | 10-FCV | Acc 99.68, SEN 93, SPE 100 | U | U | U | H | H |
| Wu et al. | 368 | DL | Imaging features | TTS | AUC 0.905, Acc 83.3, SEN 82.3 | L | U | U | H | H |
| Wang et al. | 1647 | DL | Imaging features | Ext. val. | AUC 0.953 (95% CI 0.949–0.959), SEN 92.3 (95% CI 91.4–93.2), SPE 85.1 (84.2–86.0), PPV 79 (77.7–80.3), NPV 94.8 (94.1–95.4) | L | U | U | H | H |
*CPP = COVID-19 Positive Patients, Abbreviations of medical terms included in this Table are provided in the Appendix.
**L: Low, H: High, U: Unclear
Results for diagnostic models.
| Study, Country, Outcome | No. of CPP* | AI methods | Predictors | Val. methods | Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI)) | Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Diagnostic | ||||||||||
| Cabitza et al. | 845 | ML | LDH, AST, CRP, calcium, WBC, age | Int.-ext. val. | AUC 0.83–0.90 | L | H | L | L | H |
| Batista et al. | 102 | ML, NN, RF, GB trees, LR, SVM | lymphocytes, leukocytes, eosinophils | 10-FVC | AUC 0.85, SEN 68, SPE 85, PPV 78, NPV 77 | H | L | H | H | H |
| Cai et al. | 81 | DL | 9 CT quantitative features and radiomic features | TTS | AUC 0.811–0.812, SEN 76.5, SPE 62.5 | H | H | H | H | H |
| Mei et al. | 419 | DCNN | Imaging features, age, exposure to SARS-CoV-2, fever, cough, cough with sputum, WBC | TTS | AUC 0.92, SEN 84.3 | H | H | H | L | H |
| Ren et al. | 58 | AI | unclear | unspecified | AUC 0.740, SEN 91.2, SPE 58.8 | L | U | U | H | H |
*CPP = COVID-19 Positive Patients, Abbreviations of medical terms included in this Table are provided in the Appendix.
**L: Low, H: High, U: Unclear
Results for diagnostic imaging models – part 1.
| Study, Country, Outcome | No. of CPP* | AI methods | Predictors | Val. methods | Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI)) | Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Chen et al. | 51 | DL | Imaging features | TTS | Acc 95.24, SEN 100, SPE 93.55, PPV 84.62, NPV 100 | H | U | L | H | H |
| Rahimzadeh et al. | 118 | DNN, Nadam optimizer | Imaging features | TTS | Acc 99.50 | L | U | L | H | H |
| Roy et al. | 17 | DL | Imaging biomarkers | 5-FCV | F1-score 65.9 | H | U | Η | Η | Η |
| Zhou et al. | 35 | DL | Imaging features | Ext. val. | AUC > 0.93 | H | U | U | H | H |
| Ter-Sarkisov et al. | 150 | DL, CNN | Imaging features | TTS | Acc 91.66, SEN 90.80 | H | U | U | H | H |
| Qjidaa et al. | unclear | DL, CNN | Imaging features | Int.- ext. val. | Acc 92.5, SEN 92 | U | U | U | H | H |
| Babukarthik et al. | 102 | GDCNN | Imaging features | unclear | Acc 98.84, SEN 100, SPE 97.0 | H | H | H | H | H |
| Minaee et al. | unclear | CNN | Raw images without feature extraction | TTS | SEN 98, SPE 92 | U | U | U | H | H |
| Yan et al. | 206 | CNN | Imaging features | TTS | SEN 99.5 (95%CI: 99.3–99.7), SPE 95.6 (95%CI: 94.9–96.2) | L | H | H | H | H |
| Lokwani et al. | 55 | NN | Imaging features | TTS | SEN 96.4 (95% CI: 88–100), SPE 88.4 (95% CI: 82–94) | U | U | H | H | H |
| Jin et al. | 751 | DL, DNN | Imaging features | TTS | AUC 0.97, SEN 90.19, SPE 95.76 | H | U | U | H | H |
| Ko et al. | 20 | 2D DL | Imaging features | TTS, ext.val. | Acc 99.87, SEN 99.58, SPE 100.00 | U | U | U | H | H |
| Ezzat et al. | 99 | Hybrid CNN | Not applicable | TTS | Acc 98 | H | U | H | L | H |
| Ouchicha et al. | 43 | DCNN | Imaging features | 5-FCV | Acc 97.20 | U | U | L | H | H |
| Xiong et al. | 521 | DL, CNN | Imaging features | TTS, ext. val. | AUC 0.95, Acc 96 (95% CI: 90–98), SEN 95 (95% CI: 83–100), SPE 96 (95% CI: 88–99) | H | U | H | H | H |
| Li et al. | 468 | DL, CNN | Imaging features | TTS | AUC 0.96, SEN 90 (95% CI: 83–94), SPE 96 (95% CI: 93–98) | L | U | H | H | H |
| Mahmud et al. | unclear | DCNN | Imaging features | 5- FCV | Acc 97.4 | U | U | U | H | H |
| Li et al. | 305 | NN | Imaging features | TTS | Precision 93% | U | U | L | H | H |
| Sun et al. | 1495 | LR, SVM, RF, NN | 30 Imaging features: Volume features, Infected lesion number, Histogram distribution, Surface area, Radiomics features | 5-FCV | ACC 91.79, SEN 93.05, SPE 89.95 | L | U | H | H | H |
*CPP = COVID-19 Positive Patients, Abbreviations of medical terms included in this Table are provided in the Appendix.
**L: Low, H: High, U: Unclear
Results for prognostic models – part 1.
| Study, Country, Outcome | No. of CPP* | AI methods | Predictors | Val. methods | Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI)) | Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Muhammad et al. | unclear | DT, SVM, NB, LR, RF, K-NN | unclear | 5-FCV | Acc 99.85 (Decision Tree) | H | U | U | U | H |
| Cheng et al. | 1987 | RF | respiratory failure, shock, inflammation, renal failure | TTS, 10-FCV | AUC 79.9 (95% CI: 75.2–84.6), Acc 76.2 (95% CI: 74.6–77.7), SEN 72.8 (95% CI: 63.2–81.1), SPE 76.3% (95% CI: 74.7–77.9) | H | H | H | H | H |
| Kim et al. | 4787 | 55 ML models developed, (XGBoost model revealed the highest discrimination perf.) | age, sex, smoking history, body temperature, underlying comorbidities, activities of daily living (ADL), symptoms | TTS | AUC 0.897, (95% CI 0.877–0.917) | H | U | H | U | H |
| Yadaw et al. | 4802 | ML, RF, LR, SVM, XGBoost | age, minimum oxygen saturation over the course of their medical encounter, type of patient encounter (inpatient vs outpatient and telehealth visits) | TTS | AUC 91 | L | H | H | H | H |
| Klann et al. | 4227 | ML | PaCO2, PaO2, ARDS, sedatives, d-dimer, immature granulocytes, albumin, chlorhexidine, glycopyrrolate, palliative care encounter | 5-FCV, TTS | AUC 0.956 (95% CI: 0.952, 0.959) | U | U | U | H | H |
| Navlakha et al. | 354 | ML, RF, DT | 40 out of 267 clinical variables (3 most important individual lab variables: platelets, ferritin, and AST (aspartate aminotransferase) | 10-FCV | AUC 70–85 | L | H | H | H | H |
| Shashikumar et al. | 777 | DL | vital signs, laboratory values, sequential-organ failure assessment (SOFA) scores, Charlson comorbidity index scores (CCI) index, demographics, length of stay, outcomes | Ext. val., 10-FCV | AUC 0.918 | L | H | H | L | H |
| Bertsimas et al. | 3,927 | XGBoost | Increased age, decreased oxygen saturation (<93%), elevated levels of CRP (>130 mg/L), blood urea nitrogen, blood creatinine | Cross-validation | AUC 0.90 (95% CI: 0.87–0.94) | H | H | U | L | H |
*CPP = COVID-19 Positive Patients, Abbreviations of medical terms included in this Table are provided in the Appendix.
**L: Low, H: High, U: Unclear
Results for Prognostic Imaging models.
| Study, Country, Outcome | No. of CPP* | AI methods | Predictors | Val. methods | Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI)) | Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Fakhfakh et al. | 42 | RNN, CNN | Unclear | unspecified | Acc 92 | H | U | U | L | H |
| Zhu et al. | 408 | SVM, LR | Imaging features | 5-FCV | Acc 85.91 | L | U | U | H | H |
| Qi et al. | 31 | LR, RF | Imaging features (CT radiomics) | 5-FCV | AUC 0.97, SEN 100, SPE 89, (95%CI 0.83–1.0) | U | L | L | H | H |
| Xiao et al. | 408 | DL, CNN, ResNet34 (RNN) | Imaging features | 5-FCV | AUC 0.987 (95% CI: 0.968–1.00), Acc 97.4 | L | U | U | H | H |
| Cohen et al. | 80 | NN | CXR features | not performed | U | U | U | H | H | |
| Salvatore et al. | 98 | LR | Imaging features | not performed | Acc 81, SEN 88, SPE 78 | H | U | U | H | H |
| Liu et al. | 134 | CNN | Imaging features (APACHE-II, NLR, d-dimer level) | Ext. val. | AUC 0.93, (95% CI: 0.87–0.99) | L | U | U | U | U |
*CPP = COVID-19 Positive Patients, Abbreviations of medical terms included in this Table are provided in the Appendix.
**L: Low, H: High, U: Unclear
Results for diagnostic and prognostic Imaging models.
| Study, Country, Outcome | No. of CPP* | AI methods | Predictors | Val. methods | Performance (AUC, Accuracy (Acc%), Sensitivity (SEN%), Specificity (SPE%), PPV/NPV (%), (95% CI)) | Risk of Bias**: Participants/Predictors/Outcome/Analysis/Overall | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Chassagnon et al. | 693 | DL, 2D-3D CNN, RBF SVM, Linear SVM, AdaBoost, RF, DT, XGBoost | 15 radiomics features: imaging from the disease regions (5features), lung regions (5features) and heart features (5features), biological and clinical data (6features: age, sex, high blood pressure (HBP), diabetes, lymphocyte count, CRP level), image indexes (2features: disease extent and fat ratio). | TTS | Acc 70, SEN 64, SPE 77 (Holistic Multi-Omics Profiling & Staging), Acc 71, SEN 74, SPE 82 (AI prognosis model performance) | L | U | L | L | U |
*CPP = COVID-19 Positive Patients, Abbreviations of medical terms included in this Table are provided in the Appendix.
**L: Low, H: High, U: Unclear