| Literature DB >> 33842685 |
Norah Alballa1, Isra Al-Turaiki2.
Abstract
The existence of widespread COVID-19 infections has prompted worldwide efforts to control and manage the virus, and hopefully curb it completely. One important line of research is the use of machine learning (ML) to understand and fight COVID-19. This is currently an active research field. Although there are already many surveys in the literature, there is a need to keep up with the rapidly growing number of publications on COVID-19-related applications of ML. This paper presents a review of recent reports on ML algorithms used in relation to COVID-19. We focus on the potential of ML for two main applications: diagnosis of COVID-19 and prediction of mortality risk and severity, using readily available clinical and laboratory data. Aspects related to algorithm types, training data sets, and feature selection are discussed. As we cover work published between January 2020 and January 2021, a few key points have come to light. The bulk of the machine learning algorithms used in these two applications are supervised learning algorithms. The established models are yet to be used in real-world implementations, and much of the associated research is experimental. The diagnostic and prognostic features discovered by ML models are consistent with results presented in the medical literature. A limitation of the existing applications is the use of imbalanced data sets that are prone to selection bias.Entities:
Keywords: COVID-19; Machine learning; artificial intelligence; diagnosis; feature selection; prognosis
Year: 2021 PMID: 33842685 PMCID: PMC8018906 DOI: 10.1016/j.imu.2021.100564
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Nomenclature of abbreviations.
| Acronyms | Definition |
|---|---|
| ALC | absolute lymphocyte count |
| ALT | alanine aminotransferase |
| AST | aspartate aminotransferase |
| BUN | blood urea nitrogen |
| CKD | chronic kidney disease |
| CRP | C-reactive protein |
| cTnT | cardiac troponin T |
| cTnI | cardiac troponin I |
| hsCRP | high-sensitivity C-reactive protein level |
| IL-6 | interleukin-6 |
| INR | international normalized ratio |
| LDH | lactate dehydrogenase |
| MCHC | mean corpuscular hemoglobin concentration |
| MCV | mean corpuscular volume |
| RDW | red blood cell distribution width |
| WBC | white blood cells |
| TWRF | trees weighting random forest |
| NB | naive Bayes |
| LR | logistic regression |
| SVM | support vector machine |
| KNN | k-nearest neighbors |
| GBDT | gradient boosted decision tree |
| XGBDT | extreme gradient boosted decision tree |
| NN | neural network |
| XGBoost | extreme gradient boosting |
| DNN | deep neural networks |
| DT | decision tree |
| ET | extremely randomized trees |
| PLS | partial least squares |
| EN | elastic net |
| FDA | bagged flexible discriminant analysis |
| LASSO | least absolute shrinkage and selection operator |
| BN | Bayesian network |
| SGD | stochastic gradient descent |
| MLP | multilayer perceptron |
Machine learning algorithms and their main features.
| ML Algorithm | Basic Idea | Features |
|---|---|---|
| NB | Probabilistic classifier | Cannot handle missing data, stable performance [ |
| SVM | Hyperplane optimization | Highly accurate models, less likely to suffer from |
| DT | tree-structured model | Robust, for categorical data, easy to interpret. |
| RF | DT ensemble method | Effective for highly complex problems, best for high-dimensional data sets, can handle missing data and imbalanced data sets. |
| AdaBoost | Ensemble algorithm | Improves the performance of individual weak classifiers, sensitive to noise. |
| KNN | Based on a distance metric to measure the distance between data points. | Choice of a distance metric affects performance; known as |
| GBDT | Ensemble tree induction, seeks to produce a model that minimizes the loss function | Highly flexible [ |
| LR | Predicts the probability that a given data point belongs to a certain class | Easy calculation, can handle continuous numerical values, cannot handle non-linear data. |
| ANN | Inspired by networks of biological neurons | Highly accurate models, difficult to interpret the model (black-box models), requires a large number of parameters. |
| ET | Ensemble tree induction | Good performance, easy to implement, less computational time, fewer optimization parameters [ |
Fig. 1Flowchart of the study selection process.
Fig. 2Relationships between ML approaches and COVID-19 applications reviewed in this article.
Diagnosing COVID-19.
| Study; outcome | Highest-weighted features | ML approaches | Sample size (no. of positive cases) | Performance |
|---|---|---|---|---|
| Age, CT scan result, temperature, lymphocyte, fever, coughing | XGBoost | 413 patients (−) | Sensitivity of 92.5% and specificity of 97.9% | |
| MCHC, eosinophils count, albumin, INR, prothrombin activity % | RF, DNN, and XGBoost (selected) | 5333 patients (160 positive) | AUC of 97%, sensitivity of 81.9%, specificity of 97.9% | |
| Ferritin, WBC, eosinophil, temperature, CRP, LDH, D-dimer, basophil count, monocyte %, AST (in descending order of importance) | XGBoost | 75,991 patients (7335 positive) | Accuracy of 86.4%, specificity of 86.8%, sensitivity of 82.4% | |
| MISSING arterial tactic acid, age, leukocyte count, platelets, creatinine | LR, NN, RF, SVM, and (XGBoost selected) | 5644 (556 positive) | XGBoost model achieved AUC of 0.66, sensitivity of 75%, and specificity of 49% | |
| Total bilirubin, glucose, creatinine, LDH, CK-MB, potassium, total protein, calcium, magnesium, PDW, basophils | RF | 253 samples from 169 suspected patients (105 samples from 27 patients confirmed positive) | AUC of 99.26%, a sensitivity of 100%, and a specificity of 94.44% with an independent test set | |
| AST, lymphocytes, LDH, WBC, eosinophils, ALT, age | DT, ET, KNN, LR, NB, SVM, TWRF, and (RF selected) | 279 patients (177 positive) | AUC of 84%, accuracy of 82%, sensitivity of 92%, PPV of 83%, and specificity of 65% | |
| Leukocyte count, RDW, hemoglobin, serum calcium | RF | 1528 patients (65 positive) | Accuracy of 81%, area under the ROC curve of 0.74, sensitivity of 60%, and specificity of 82% | |
| Lymphocyte, eosinophil, basophil, and neutrophil cell count | Binary LR | 400 total patients (258 positive) | AUC of 88.9%, sensitivity of 80.3%, and PPV of 92.3% | |
| Age, gender, prior medical conditions, smoking habits, fever, sore throat, cough, shortness of breath, loss of taste or smell | LR | 43,752 surveys (498 self-reported COVID-19 positive) | AUC of 0.737 | |
| Neutrophil count, absolute lymphocyte count, hematocrit, male sex | LR | 2777 patients (368 PCR positive) | C-statistic of 78%, sensitivity of 86–93%, and specificity of 35–55% | |
| LDH, ferritin, CRP, calcuim, lymphocytes | LR, DT, RF, and (GBDT, selected) | 5893 patients (1402 positive) | AUC 83.8%, sensitivity 75.8%, and specificity reached 74% with an independent data set | |
| Eosinophils, basophils, and CRP, calcium, presentation oxygen requirement, respiratory rate | LR, RF and (XGBDT, selected) | 114,957 patients (437 positive) | Emergency department and admissions models: AUCs of 88.1% and 87.1%, and accuracies of 92.3% and 92.5% respectively | |
| [not mentioned] | ANN, CNN, RNN, CNNLSTM, and CNNRNN, and (LSTM, selected) | 600 patients (80 positive) | AUC of 62.50%, accuracy of 86.66%, recall of 99.42% | |
| Age, LDH, AST, CRP, calcium, fibrinogen, XDPs, WBC | RF, NB, LR, SVM, and k- KNN | 1624 patients (52% COVID-19 positive) | AUC ranged from 83% to 90% | |
| Inflammatory markers, especially LDH, CRP, and the combination of CRP, LDH, and ferritin | RF, LR, SVM, multilayer perceptron, stochastic gradient descent, XGBoost, and ADABoost | 1455 records (182 positive) | AUC of 91%, sensitivity of 93%, specificity of 64% | |
| Monocytes, platelets, leukocytes, urea, potassium, eosinophils, hemoglobin, lymphocytes, CRP (from highest to lowest) | RF, extra trees and LR as a first level, then XGBoost for the second level | 5644 patients (559 positive) | AUC of 99.38%, sensitivity of 98.72% and specificity of 99.99% | |
| Age, IL-6, systolic blood pressure, monocyte %, fever classification | LR, Ridge regularization, DT, ADABoost, and Lasso regression (selected) | 132 patients (26 positive) | AUC of 84.1% F-1 score of 0.571, recall of 1.000, specificity of 0.727, and precision of 0.400 | |
| [not mentioned]. All 16 features used: mean platelet volume, leukocytes, MCV, creatinine, red blood cells, basophils, monocytes, potassium, lymphocytes, MCHC, RDW, sodium, MCHC, eosinophils, CRP, urea | SVM, SMOTEBoost, and ensembling | 599 patients (81 positive) | Specificity of 92.16%, NPV of 95.29%, and sensitivity of 63.98% |
Mortality risk and severity of COVID-19 prediction.
| Study | Outcome | Highest-weighted features | ML approaches | Sample size (no. of survivors and non-survivors) | Performance |
|---|---|---|---|---|---|
| Prediction of mortality and critical events | Acute Kidney Injury, LDH, tachypnea, glucose, diastolic blood pressure, CRP | XGBoost | 4098 patients (−) | AUC of 80% at 3 days 79% at 5 days, 80% at 7 days, and 81% at 10 days | |
| Prediction of mortality and critical COVID-19 | LDH, lymphocytes, hsCRP | XGBoost | 375 patients (201 survivors, 174 non-survivors) | Accuracy of 93% | |
| Prediction of mortality and critical COVID-19 | LDH, lymphocytes, hsCRP | XGBoost | 404 patients (213 survivors and 191 non-survivors) | Accuracy of 90% | |
| Prediction of mortality | Age, hsCRP, SpO2, neutrophil and lymphocyte count, D-dimer, AST, GFR | XGBoost | 296 patients (19 non-survivor) | AUC of 88% | |
| Prediction of mortality | Age, male sex, higher BMI, higher respiratory rate, higher heart rate, CKD | LR, XGBoost (selected) | 8770 patients (1114 non-survivors) | AUC of 86% | |
| Prediction of mortality | Age, SpO2, CRP, BUN, blood creatinine | XGBoost | 3927 patients (−) | AUC ranged between 92% and 81% using three validation cohorts. | |
| Prediction of mortality | severity, age, serum levels of hs-CRP, LDH, ferritin, IL-10 | XGBoost | 1270 patients (−) | Precision ¿90%, sensitivity ¿85%, and F1 scores ¿0.90 | |
| Prediction of mortality risk | CRP, lactic acid, calcium, BUN, serum albumin | LR and SVM (selected) | 398 patients (355 survivors and 43 non-survivors from COVID-19) | AUC 93%, 91% sensitivity, and 91% specificity | |
| Prediction of critical COVID-19 | Age, GSH, CD3 ratio, total protein | SVM | 336 patients (26 severe/critical) | AUC of 97.57% | |
| Prediction of critical COVID-19 | Age, neutrophil %, calcium, monocyte %, urine test values (urine protein, red blood cells (occult), and pH (urine)) | SVM | 137 patients (75 severe) | Accuracy of 81.48% | |
| Prediction of critical COVID-19 | IL-6, high-sensitivity cTnI, procalcitonin, hsCRP, chest distress, calcium | SVM | 172 patients (60 severe) | SVM achieved accuracy of 91.38%, sensitivity of 90% and specificity of 94% | |
| Prediction of ICU admission | pCO2, creatinine, pH | SVM | 556 patients (35 admitted, 16 ICU) | AUC of 98%, a sensitivity of 80%, and a specificity of 96% | |
| Prediction of mortality risk | Age, hsCRP, lymphocyte count, D-dimer | PLS regression, EN model, RF, FDA, and LR (selected) | 183 patients (115 survivors and 68 non-survivors from COVID-19) | AUC of 88.1%, sensitivity of 83.9%, and specificity of 79.4% | |
| Prediction of mortality | Heart failure, procalcitonin, LDH, COPD, SpO2, heart rate, age | LR | 641 patients (195 admitted to the ICU, 82 non-survivors) | AUC of 82% | |
| Prediction of ICU admission | LDH, procalcitonin, smoking history, SpO2, lymphocyte count | LR | 641 patients (195 admitted to the ICU, 82 non-survivors) | AUC of 74% | |
| Prediction of critical COVID-19 | Comorbidities, respiratory rate, CRP, LDH | LR | 125 patients (32 severe) | AUC of 94.4%, sensitivity of 94.1%, and specificity of 90.2%. | |
| Prediction of mortality risk | Age, lymphocyte count, LDH, SpO2 | LR | 444 patients [299 training, 145 validation, 155/299 and 69/145 non-survivors] | (c = 0·89) and (c = 0·98) for internal and external validation. | |
| Prediction of critical COVID-19 | Age, CRP, D-dimer, product of N/L*CRP*D-dimer | LR | 377 patients (172 severe, 106 non-severe) | AUC of 87.9%, specificity of 73.7% and sensitivity of 88.6% | |
| Prediction of critical COVID-19 | IL-6, CRP, hypertension | LR | 127 patients (16 severe) | AUC of 90.0% | |
| Prediction of critical COVID-19 | Older age; higher LDH, CRP, RDW, BUN, and direct bilirubin; lower albumin | LASSO regression, DT, RF, and SVM, and LR (selected) | 372 patients (72 severe) | AUC of 85.3%, a sensitivity of 77.5%, and specificity of 78.4% | |
| Prediction of mortality and critical COVID-19 | cTnT | Univariate LR | 427 patients (89 non-survivors) | AUC of 94% | |
| Prediction of mortality and critical COVID-19 | LDH | Univariate LR | 427 patients (89 non-survivors) | AUC of 89% | |
| Prediction of mortality and critical COVID-19 | CRP | Univariate LR | 427 patients (89 non-survivors) | AUC of 87% | |
| Prediction of mortality and critical COVID-19 | Albumin | Univariate LR | 427 patients (89 non-survivors) | AUC of 87% | |
| Prediction of mortality and critical COVID-19 | D-dimer | Univariate LR | 427 patients (89 non-survivors) | AUC of 84% | |
| Prediction of mortality and critical COVID-19 | Ferritin | Univariate LR | 427 patients (89 non-survivors) | AUC of 77% | |
| Prediction of mortality and critical COVID-19 | Age, high LDH, low albumin | Multivariate LR | 427 patients (89 non-survivors) | AUC of 88%–89%. | |
| Prediction of mortality | Decreased lymphocyte ratio, elevated BUN, raised D-dimer | Multivariate LR | 336 severe patients (34 non-survivors) | AUC of 99.4%, sensitivity of 100.0% and specificity of 97.2% | |
| Prediction of mortality | IL-6 and lymphocyte subsets (CD8+ T cell) | LR | 1018 patients (−) | AUC of 90.7% | |
| Prediction of mortality and the outcome | Creatine kinase | LR | 127 patients (36 non-survivors) | AUC of 86.4% | |
| Prediction of mortality and outcome | CRP | LR | 127 patients (36 non-survivors) | AUC of 87% | |
| Prediction of mortality and outcome | Ferritin | LR | 127 patients (36 non-survivors) | AUC of 83.3% | |
| Prediction of mortality and outcome | IL-6 | LR | 127 patients (36 non-survivors) | AUC of 78.1% | |
| Prediction of mortality and outcome | Lymphocyte CD3+ | LR | 127 patients (36 non-survivors) | AUC of 91.5% | |
| Prediction of mortality and outcome | LDH | LR | 127 patients (36 non-survivors) | AUC of 92.8% | |
| Prediction of mortality and outcome | Troponin I | LR | 127 patients (36 non-survivors) | AUC of 93.9% | |
| Prediction of mortality and outcome | Prothrombin time | LR | 127 patients (36 non-survivors) | AUC of 92% | |
| Prediction of mortality and outcome | Procalcitonin | LR | 127 patients (36 non-survivors) | AUC of 90% | |
| Prediction of critical COVID-19 | Gender, APACHE II, SOFA, lymphocytes (including subsets), CRP, LDH, AST, cTnT, BNP, WBC, neutrophil count, urea | LR | 47 patients (24 severe) | (not specified) | |
| Prediction of critical COVID-19 | LDH | LR | 47 patients (24 severe) | AUC of 97.27%, sensitivity 100.00% and specificity 86.67% | |
| Prediction of critical COVID-19 | AST | LR | 47 patients (24 severe) | AUC of 92.31% | |
| Predict critical COVID-19 | CPR | LR | 47 patients (24 severe) | AUC of 92.92% | |
| Prediction of critical COVID-19 | Lymphocyte counts (less than 1.045 × 109/L) | LR | 47 patients (24 severe) | AUC of 98.45%, specificity 91.30% and sensitivity 95.24% | |
| Prediction of critical COVID-19 | SOFA score | LR | 47 patients (24 severe) | AUC of 94.93% | |
| Prediction of critical COVID-19 | CT score | LR | 47 patients (24 severe) | AUC of 95.28% | |
| Prediction of mortality risk | Sex, age | SVM, KNN, RF, GB, and (LR, selected) | 3524 patients (74 non-survivors) | AUC of 83% | |
| Prediction of mortality risk | Having a chronic disease; gastrointestinal, kidney, cardiac, respiratory symptoms | Autoencoder, LR, RF, SVM, one-class SVM, isolation forest, local outlier factor | Two data sets: A) 28,958 patients (530 non-survivors) B) 1448 patients (123 non-survivors) | Autoencoder model achieved around 73% AUC, and 97% accuracy. | |
| Prediction of severity | WHO severity classification, acute kidney injury, age, LDH, lymphocytes, aPTT | Bayesian network analysis | 295 patients (−) | ROC of 83.8% and 91% for the models based on WHO classification only, and EPI-SCORE, respectively. | |
| Prediction of ICU admission | Age, fever, tachypnea with or without respiratory crackles | DT | 10,504 (1353 hospitalized, 83 ICU admission) | AUC of 76%, accuracy 68%, and recall 71% | |
| Prediction of critical COVID-19 | Age, hemoptysis, unconsciousness, comorbidities, cancer history, neutrophil-to- lymphocyte ratio, LDH, direct bilirubin | LASSO then LR | 2300 patients (−) | AUCs of 88% in both the training and validation cohorts | |
| Prediction of critical COVID-19 | BUN, age, absolute neutrophil count, RDW, SpO2, serum sodium | LASSO | 11,095 patients (8499 survivors, 2596 non-survivors) | AUCs of 86%, 82%, and 82%, respectively for internal and external validation | |
| Survival analysis and discharge time | Age, sex | stagewise GB, IPCRidge, CoxPH, Coxnet, Componentwise GB, fast SVM, and fast Kernel SVM | 1182 patients (−) | C-index of stagewise GB: 71.47 | |
| Prediction of mortality | Leukomonocyte %, urea, age, SpO2 | LR, simplified LR, and (GBDT, selected) | 2924 patients (257 non-survivors) | AUC of 94.1% | |
| Prediction of mortality risk, up to 20 days in advance | Increased consciousness, male sex, sputum, BUN, respiratory rate, D-dimer, comorbidities, age. Also decreased platelet count, albumin, SpO2, lymphocytes, CKD | LR, SVM, GBDT, and NN | 2520 COVID-19 patients with known outcomes (survivors or non-survivors) | AUC ranging from 91.86% to 97.62% in an internal validation cohort and two external validation cohorts |
Fig. 3Summary of the ML algorithms selected by the reported studies.
Fig. 4Frequently reported features for predicting COVID-19 diagnosis.
Fig. 5Frequently reported features for predicting mortality and severe COVID-19.