| Literature DB >> 34575670 |
Francesca Bottino1, Emanuela Tagliente1, Luca Pasquini2,3, Alberto Di Napoli2,4, Martina Lucignani1, Lorenzo Figà-Talamanca5, Antonio Napolitano1.
Abstract
More than a year has passed since the report of the first case of coronavirus disease 2019 (COVID), and increasing deaths continue to occur. Minimizing the time required for resource allocation and clinical decision making, such as triage, choice of ventilation modes and admission to the intensive care unit is important. Machine learning techniques are acquiring an increasingly sought-after role in predicting the outcome of COVID patients. Particularly, the use of baseline machine learning techniques is rapidly developing in COVID mortality prediction, since a mortality prediction model could rapidly and effectively help clinical decision-making for COVID patients at imminent risk of death. Recent studies reviewed predictive models for SARS-CoV-2 diagnosis, severity, length of hospital stay, intensive care unit admission or mechanical ventilation modes outcomes; however, systematic reviews focused on prediction of COVID mortality outcome with machine learning methods are lacking in the literature. The present review looked into the studies that implemented machine learning, including deep learning, methods in COVID mortality prediction thus trying to present the existing published literature and to provide possible explanations of the best results that the studies obtained. The study also discussed challenging aspects of current studies, providing suggestions for future developments.Entities:
Keywords: COVID; computer Tomography (CT); deep learning; imaging; machine learning; mortality; prediction
Year: 2021 PMID: 34575670 PMCID: PMC8467935 DOI: 10.3390/jpm11090893
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Studies Data Sources, Samples and Validation Characteristics.
| Reference | Centers | Location | Survived and | Type of Validation | Sample Size Train | Sample Size Test | Online Available Dataset |
|---|---|---|---|---|---|---|---|
| [ | Five centers | United states | 2392 survived, 1323 non survived (deaths up to 3 gg:140; 5 gg: 281; 7 gg: 393;10 gg: 509) | Internal, External, Internal Prospectively, Prospectively Merged | 1514 (deaths up to 3 gg: 40; 5 gg: 74; 7 gg: 112; 10 gg: 182) (center1) | external: 2201 (deaths up to 3 gg:135; 5 gg:276; 7 gg: 382; 10 gg:494) (center 2, 3, 4, 5; time1) prospectively merge: 383 (deaths up to 3 gg: 3; 5 gg: 5; 7 gg: 11; 10 gg:15) (all five center merged; time2) | NO |
| [ | Single center | United Kingdom | 275 survived, 93 non survived | Internal | 318 | 80 | NO |
| [ | Single center | United Kingdom | 275 survived, 93 non survived | Internal | 318 | 80 | NO |
| [ | Four centers | Korea | 299 survived, 214 non survived | Internal, External | 361 (195 survived, 212 non survived) (center1) | external: 106 (survived 104, non survived 2) (center2,3,4) | NO |
| [ | Thirty centers | Italy | 3182 survived, 712 non survived, 41.5% of whom resident in Central/Southern Italian regions (15.6% death north italy; 6.4% center-south); | Merged | 2725 (all thirty centers merged) | 1169 (all thirty centers merged) | NO |
| [ | Single center | United states | 355 survived, 43 non survived | Internal | 318 | 80 | NO |
| [ | Two multicentric dataset open source | China | Cohort1: 28428 survived, 530 non survived; Cohort2: 1325 survived, 123 non survived | Two double merged | Training (1): 8687 (chort1); training (2): 434 (chort2) | double merge (1): first 14190, secondly 6081 (cohort1), double merge (2): first 710, secondly 304 (chort2) | YES |
| [ | Single center | China | 2737 survived, 259 non survived | Internal | 2339 (center1) | internal: 585 (center1); external: 72 (70 survived, 2 non survived) (center2) | NO |
| [ | Single center | China | 142 survived; 39 non survived | Internal | 154 | 27 | NO |
| [ | Two multicentric dataset open source | China | 662 survived, 57 non survived (169933 slices) | Merged | YES | ||
| [ | Single center | United states | 2985 survived, 506 non survived | Internal | 2793 | 698 | NO |
| [ | Three centers | China | 1906 survived; 254 non survived | Internal, Two External | 621 (535 survived, 86 non survived) (center1) | internal: 622 (533 survived, 89 non survived) (center1); external (1): 801 (741 survived, 60 non survived) (center2); external (2): 116 (97 survived, 19 non survived) (center3) | NO |
| [ | Thirty three centers | Italy, Spain, United States | 2302 survived; 760 non survived | Merged, Three External | 2755 (25 centers merged) | merge: 760 (1:25 centers), external (1): 323 (center26), external (2): 219 (27:32 centers), external (3): 323 (center33) | NO |
| [ | Multicentric database | Korea | 7772 survived, 228 non survived the dataset was splitted according to the ratio 7:3 | Merged | 5600 | 2400 | NO |
| [ | Two centers | China | 1198 survived; 72 non survived | Internal, External | 554 (513 survival, 41non survival) (center1) | Internal: 233 (217 survival, 16 non survival) (center1) External: 286 (279 survival, 7 non survival) (center2) | NO |
| [ | Five centers | United states | 3519 survived; 510 non survived | Five Internal | Training (1): 463 (center1), training (2): 1151 (center2), training (3): 524 (center3), training (4): 378 (center4), training (5): 340 (center2) | Internal (1): 148 (center1), internal (2): 493 (center2), internal (3): 225 (center3), internal (4): 162 (center4), internal (5): 145 (center2) | NO |
| [ | Two centers | China | 148 survived, 99 non survived | Internal, External | 183 (115 survived, 68 non survived) (center1) | external: 64 (33 survived, 31 non survived) (center2) | NO |
| [ | Single center | China | (1) 298 Survivaved, 187 non survived, (2) 189 survived, 162 non survived | Internal Prospectively Internal | Training (1): 375 (time1), training (2): 246 | internal prospectively (1): 110 (time2), internal (2): 105 | NO |
| [ | Single center | United states | 3226 survived, 1087 non survived | Internal | 3468 | 845 | NO |
| [ | Two centers | Iran, United States | 193 patients | External | 105 (center1) | 88 (center1) | NO |
| [ | Multicentric database | United States | 5308 patients | Internal | 3597 (2909 survived, 688 non survived) | 1711 | Researcher affiliated with Mass General Brigham may apply for access |
| [ | Multicentric database | United States | 648 survived, 87 non survived | Merged | NO | ||
| [ | Single center | Italy | 266 survived, 75 non survived | Internal | NO | ||
| [ | Multicentric database | United States | 2619 survived, 776 non survived | Merged | 2054 (1602 survived, 452 non survived) | Internal: 477 (361 survived 116 non survived) | NO |
Studies Features and Feature Ranking Techniques. Abbreviations: CRP, C-Reactive Protein; LDH, Lactate Dehydrogenase; OS, Oxygen Saturation; BUN, Blood Urea Nitrogen; RDW, Red Cell Distribution Width; DBP, Diastolic Blood Pressure; RP, Respiratory pathology; CKD, Chronic kidney disease; IHD, Ischemic heart disease; CE, Cerebrovascular event; EGFR, Estimated glomerular filtration rate; MPV, Mean platelet volume; PLCR, Platelet large-cell ratio; PT, Prothrombin time; PDW, Platelet distribution width; AA, Aspartate aminotransferase; ISR, International standard ratio; BMI, Body mass index; LOS, Lenght of stay in hospital; sBP, sistolic blood pressure; dBP, diastolic blood pressure; tAC, anticoagulation treatment; RR, Respiratory rate; MCV, Mean Corpuscolar Volume; IL-10: Interleukina—10. Yes if Included the study; No if Not Included in the study.
| Reference | Features Type | Features Class | Features Selection | Dimension Reduction | Most Important Features (in Order) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Binaries | Continuous | Images | Demographics | Commorbities | Syntoms | Vital Signs | Laboratory | Tratment | SHAP | LASSO | Others | |||
| [ | Yes | Yes | No | Yes | Yes | No | Yes | Yes | No | Yes | Yes | No | 73 to 10 | Age, Anion Gap, CRP, LDH, OS, BUN, Ferritin, RDW, DBP, Lactate |
| [ | Yes | No | No | Yes | Yes | Yes | No | No | Yes | Yes | No | No | 22 | Altered mental status, Dyspnea, Age, Gender, Cough, RP, Hyperthension, Fever, CKD, IHD, CE, Myalgia, Smoking history, Cardiac failure, Days of symptoms, Obesity, Diarrea or vomiting, Anosmia e/o ageusia, Liver cirrosis, Diabetes, Abdominal pain |
| [ | Yes | Yes | No | Yes | Yes | Yes | Yes | Yes | Yes | No | No | Yes | 73 to 30 | Lymphocytes count, Neutrophils, Albumin, LDH, Neutrophil count, CRP, Prothrombin activity, Calcium, Urea, EGFR, Monocytes, Globulin, Eosinophils, Glucose, RDW, HCO3-, RDW STD, Platelet count, MPV, PLCR, PT, Total protein, PDW, AA, Thrombocytopenia, Eosinophil count, Alkaline phosphatase, ISR, Age, Gender |
| [ | Yes | No | No | Yes | Yes | No | No | Yes | No | No | No | Yes | 12 | EGFR, CRP, Age, Diabetes, Gender, Hyperthension, Smoking, Lung Disease, Myocardial infarction, Obesity, Hearth failure, Cancer |
| [ | No | Yes | No | No | No | No | No | Yes | No | Yes | No | Yes | 26 to 5 | CRP, BUN, serum calcium, serum albumin, lactic acid |
| [ | Yes | No | No | Yes | Yes | Yes | No | No | Yes | No | No | No | No | No |
| [ | Yes | Yes | No | Yes | Yes | Yes | Yes | Yes | Yes | No | No | Yes | 1224 to 83 | LDH, BUN, Lymphocyte count, age, SPO2, Platlets, CRP, IL-10, HDL-C, SAO2 |
| [ | Yes | Yes | No | Yes | Yes | Yes | Yes | Yes | Yes | No | No | Yes | 56 to 5 | D-dimer, O2 index, Lymphocyte count, CRP, Doarrhea |
| [ | Yes | Yes | Yes | No | No | No | No | Yes | No | No | No | No | No | No |
| [ | Yes | Yes | No | Yes | Yes | No | Yes | No | Yes | Yes | No | No | 34 to 20 | Vented, Respiration, BMI, LOS, Race, Pulse, ICU_adm, sBP, dBP, Temp, Pressors, tAC duration in days, Steroid Treatment duration in days, Hearth, Diabetes, Cancer, Steroid, tAC, Hypertension, Gender |
| [ | Yes | Yes | No | Yes | Yes | No | Yes | No | No | No | Yes | No | 34 to 8 | Consciousness, male sex, sputum, BUN, RR, D-dimer, number of comorbidities, age |
| [ | Yes | Yes | No | Yes | Yes | No | Yes | Yes | No | Yes | No | No | 22 | Age, BUN, CRP, OS, Blood creatinine, Blood glucose, AA, Platelets, MCV, White Blood Cells |
| [ | Yes | No | No | Yes | Yes | Yes | No | No | No | No | Yes | Yes | Not specified | Cox analysis: age > 70, male sex, disability, symptoms, infection at home; LASSO: age > 80, taking of acarbose, age > 70, taking of metformin, underlying cancer; RF: cluster infection, infection from personal contact or visit, underlying hypertension, age > 80 |
| [ | Yes | Yes | No | Yes | Yes | Yes | No | Yes | Yes | No | Yes | No | 48 to 6 | Severity, CRP, Age, LDH, Serum ferritin, IL-10 |
| [ | Yes | No | No | Yes | Yes | No | No | No | No | No | No | No | No | No |
| [ | Yes | Yes | No | Yes | Yes | Yes | Yes | Yes | No | No | No | Yes | 20 to 4 | Age, CRP, Lymphocyte count and d-dimer |
| [ | Yes | Yes | No | Yes | No | Yes | No | Yes | No | No | No | Yes | 75 to 3 | LDH, Lymphocyte count, CRP |
| [ | No | Yes | No | Yes | Yes | No | No | No | Yes | Yes | No | No | 48 to 10 | sBP, dBP, Age, LDH, SPO2, RR, BUN, Troponin level, D-dimer level, Charlson comorbidity score |
| [ | No | No | Yes | No | No | No | No | No | No | No | No | No | No | No |
| [ | Yes | Yes | No | Yes | Yes | No | Yes | Yes | Yes | Yes | No | No | Not specified | EGFR, use of ventilation, Lymphocyte count, Neutrophil count, RR, procalcitonin, serum anion gap, serum potassium |
| [ | Yes | Yes | No | Yes | Yes | No | No | Yes | Yes | Yes | No | No | 109 to 10 | Age, LDH, Ferritin, Neutrophil count, INR, Procalcitonin, CRP, Hemoglobin, AA, D-dimer |
| [ | Yes | Yes | No | Yes | Yes | No | Yes | Yes | Yes | No | No | Yes | Not specified | Age, Creatinine, AA, OS, Lymphocytes, Platelets, Hemoglobin, Quick SOFA 2 |
| [ | Yes | Yes | No | Yes | No | No | Yes | Yes | No | No | No | Yes | 142 | Pulse oximetry, RR, sBP, BUN, white blood cell, Age, Length of stay, lymphocyte per cent |
Best performing methods and metrics results. Abbreviations: XGBoost, Extreme Gradient Boosting; ANN, Artificial Neural Network; DNN, Dense Neural Network; RF, Random Forest; SVM, Support Vector Machine; GBDT, Gradient Boosting Decision Tree; PLS, partial least square; LR, Logistic Regressor; NN, Neural Network; MLP, Multi-Layer Perceptron; LightGBM, Light Gradient Boosting Machine. No Not Included in the study.
| Ref. | Machine Learning Technique | Metrics | k-Fold | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | AUC-ROC | AU-PRC | Sensitivity | Specificity | PPV | NPV | F1-Score | MCC | Balanced Accuracy | |||
| [ | Ensemble (XGBoost) | int val (3 days): 97.6%; ext val (3 days): 93.6%; int val prosp (3 days): 97.1% merged val prosp (3 days): 94.2% | int val (3 days): 89%, ext val (3 days): 87.7%; int val prosp (3 days): 96.2%; merged val prosp (3 days): 87.9% | int val (3 days): 44.5%, ext val (3 days): 44.4% int val prosp (3 days): 55.1%; merged val prosp (3 days): 13.1% | int val (3 days): 44.2%, ext val (3 days): 37% int val prosp (3 days): 50% merged val prosp (3 days): 33.3% | int val (3 days):99.1%, ext val (3 days): 93.6%, int val prosp (3 days): 97.1%; merged val prosp (3 days): 94.2% | No | No | int val (3 days): 49.8%; ext val (3 days): 41.7% int val prosp (3 days): 28.6%; merged val prosp (3 days): 14.3% | No | No | 10 |
| [ | ANN | int val 86.25% | int val: 90.12% | No | int val: 87.5% | int val: 85.9% | int val: 60.87% | int val: 96.49% | No | No | No | 10 |
| [ | Cox Regressor | int val: 83.75% | int val: 86.9% | No | int val: 50% | int val: 96.6% | int val: 84.6% | int val: 83.6% | No | No | No | 10 |
| [ | Ensemble (DNN + RF) | int val: 93% ext val: 92% | No | No | int val: 92% ext val: 100% | int val: 93%, ext val: 100% | No | No | No | No | int val: 93%, ext val: 96% | 100 |
| [ | Ensemble (RF) | merged val: 83.4% | No | No | merged val: 95.2% | merged val: 30.8% | No | No | merged val: 90.4% | No | No | 10 |
| [ | SVM | No | int val: 93% | int val: 76% | int val: 91% | int val: 91% | No | No | No | No | No | Unclear |
| [ | Auto encoder | No | No | No | No | No | No | No | No | No | No | Unclear |
| [ | Ensemble (GBDT) | int val (severe): 88.9%, int val (non-severe): 92.4%, int val(total): 79.9% | int val (severe): 94.1%, int val (non-severe): 93.2%, int val(total):91.8% | No | int val (severe): 89.9%, int val (non-severe): 94% int val(total):77.4% | int val (severe): 78.8%, int val (non-severe): 61.9%; int val(total):90.3% | int val (severe): 43.2% int val (non-severe):35.1% int val(total):48.3% | int val (severe):97.8% int val (non-severe): 97.9% int val(total):97.2% | No | No | No | 5 |
| [ | DNN | No | int val: 96.8% | No | No | No | No | No | No | No | No | 5 |
| [ | PLS | merged val: 78.73% | merged val: 85.6% | No | merged val: 88.24% | merged val: 78.26% | merged val: 16.67% | merged val: 99.26% | No | merged val: 52.36% | No | 10 |
| [ | Ensemble (CatBoost) | int val: 80.3% | int val: 85% | No | No | No | int val: 79% | int val: 81.6% | No | No | No | Unclear |
| [ | Ensemble (LR,SVM,GBDT,NN) | int val: 96.21% ext val 1: 97.6% ext val 2: 92.46% | int val:92.4%, ext val 1: 95.5%, ext val 2: 87.9% | No | No | No | No | No | No | No | No | 10 |
| [ | Ensemble (XGBoost) | merged val: 85.02%, ext val 1: 74.92%, ext val 2:86.76%, ext val 3: 61.3% | merged val:90.19%, ext val 1:87.45%, ext val 2:91.62%, ext val 3:80.66% | No | No | merged val: 86.58%, ext val 1: 74.23%, ext val 2: 87.43%, ext val 3: 58.12% | merged val:66.3%, ext val 1:25.74%, ext val 2:48.94%, ext val 3:24.18% | merged val:93.02%, ext val 1: 97,3% ext val 2:97,09%, ext val 3:94,71% | No | No | No | Unclear |
| [ | SVM | No | merged val: 96.2% | No | merged val: 92.0% | merged val: 91.8% | merged val: 25.7% | merged val: 99.7% | No | No | merged val: 91.9% | 10 |
| [ | Ensemble (XGBoost) | int val: 99.1% ext val: 99.7% | No | No | int val: 87.5% ext val: 85.7% | No | int val: 99.1% ext val: 99.7% | No | No | No | No | 500 |
| [ | Federate learning (MLP) | int val: 78% | int val: 83.6% | int val: 27.6% | int val: 80.5% | int val: 70.2% | No | No | int val: 32.8% | No | No | 490 |
| [ | LR | No | int val: 89.5%, ext val: 88.1% | No | int val: 89.2% ext val: 83.9% | int val: 68.7% ext val: 79.% | No | No | No | No | No | 10 |
| [ | Ensemble (XGBoost) | No | No | No | int val: 95%ext val prosp: 98% | No | int val: 95%ext val prosp: 91% | No | int val: 95%ext val prosp: 94% | No | No | 500 |
| [ | Ensemble XGBoost | No | int val: 90.3% | int val: 79.1% | int val: 83.8% | int val: 83.6% | int val: 60,9% | int val: 94,4% | No | No | No | 10 |
| [ | LR | No | ext val: 73.6% | No | No | No | No | No | No | No | No | Unclear |
| [ | Ensemble (RF) | No | No | No | No | No | No | No | Int val: 87% | No | No | Unclear |
| [ | LightGBM | No | merged val: 88% | No | No | No | No | No | No | No | No | 10 |
| [ | Ensemble (RF) | No | int val: 84% | No | int val: 78.8% | int val: 77.4% | No | No | No | No | No | Unclear |
| [ | Ensemble GBDT | merged val prosp: 96% | merged val prosp: 99% | No | merged val prosp: 24% | merged val prosp: 97% | merged val prosp: 90% | merged val prosp: 98% | No | No | No | Unclear |