| Literature DB >> 36098861 |
N Casillas1,2, A M Torres3, M Moret4, A Gómez4, J M Rius-Peris3,5, J Mateo3.
Abstract
Recently, global health has seen an increase in demand for assistance as a result of the COVID-19 pandemic. This has prompted many researchers to conduct different studies looking for variables that are associated with increased clinical risk, and find effective and safe treatments. Many of these studies have been limited by presenting small samples and a large data set. Using machine learning (ML) techniques we can detect parameters that help us to improve clinical diagnosis, since they are a system for the detection, prediction and treatment of complex data. ML techniques can be valuable for the study of COVID-19, especially because they can uncover complex patterns in large data sets. This retrospective study of 150 hospitalized adult COVID-19 patients, of which we established two groups, those who died were called Case group (n = 53) while the survivors were Control group (n = 98). For analysis, a supervised learning algorithm eXtreme Gradient Boosting (XGBoost) has been used due to its good response compared to other methods because it is highly efficient, flexible and portable. In this study, the response to different treatments has been evaluated and has made it possible to accurately predict which patients have higher mortality using artificial intelligence, obtaining better results compared to other ML methods.Entities:
Keywords: Artificial intelligence; COVID-19; Machine learning; Mortality; Prediction; SARS-CoV-2; XGB
Mesh:
Year: 2022 PMID: 36098861 PMCID: PMC9469825 DOI: 10.1007/s11739-022-03033-6
Source DB: PubMed Journal: Intern Emerg Med ISSN: 1828-0447 Impact factor: 5.472
Demographic characteristics, comorbidities, clinical, and laboratory findings on admission
| All adults ( | Case patients ( | Controls ( | ||
|---|---|---|---|---|
| Age, median, year | 69 (28–97) | 71 (49–92) | 67 (28–97) | 0.536 |
| Sex | <0.0001 | |||
| -Male | 93 (62%) | 43 (46%) | 50 (54%) | |
| -Female | 58 (38%) | 10 (17%) | 48 (83%) | |
| Comorbidities | ||||
| -High Blood Pressure | 82 (54%) | 32 (60%) | 50 (51%) | 0.193 |
| -Dyslipemia | 58 (38%) | 24 (45%) | 34 (35%) | 0.146 |
| -Diabetes | 42 (28%) | 25 (48%) | 17 (18%) | <0.0001 |
| -Coagulopathy disease | 35 (23%) | 17 (32%) | 18 (18%) | 0.046 |
| -Hyperuricemia | 27 (18%) | 11(21%) | 16 (16%) | 0.321 |
| -COPD* | 15 (10%) | 7 (13%) | 8 (8%) | 0.238 |
| -OSA* | 15 (10%) | 7 (13%) | 8 (8%) | 0.238 |
| -Dementia | 12 (8%) | 6 (11%) | 6 (6%) | 0.206 |
| -Asthma | 4 (2%) | 2 (4%) | 2 (2%) | 0.439 |
| -Autoimmune disease | 7 (4%) | 1 (2%) | 6 (6%) | 0.226 |
| -Active Cancer | 3 (2%) | 2 (4%) | 1 (1%) | 0.286 |
| BMI | 0.011 | |||
| -Normal | 17 (11%) | 4 (8%) | 13 (13%) | |
| -Overweight I | 11 (7%) | 4 (8%) | 7 (8%) | |
| -Overweight II | 33 (22%) | 6 (11%) | 27 (27%) | |
| -Obesity I | 19 (12%) | 3 (6%) | 16 (16%) | |
| -Obesity II | 15 (10%) | 7 (13%) | 8 (8%) | |
| -Obesity III | 3 (2%) | 1 (2%) | 2 (2%) | |
| Clinical debut | ||||
| -Dyspnoea | 114 (75%) | 45 (85%) | 69 (70%) | 0.035 |
| -Cough | 111 (73%) | 41 (77%) | 70 (71%) | 0.278 |
| -Fever | 95 (63%) | 35 (66%) | 60 (61%) | 0.827 |
| -Diarrhoea | 31 (21%) | 9 (17%) | 22 (22%) | 0.283 |
| -Nausea and vomiting | 16 (10%) | 3 (6%) | 13 (13%) | 0.126 |
| -Neurological symptoms (anosmia/ageusia) | 13 (9%) | 4 (7%) | 9 (9%) | 0.495 |
| Initial laboratory tests, median | ||||
| -Hemoglobin (g/dl) | 13.56 | 13.67 | 13.5 | 0.361 |
| -Platelets (k/mcl) | 192 | 189 | 194 | 0.233 |
| -Lymphocytes (k/mcl) | 0.58 | 0.39 | 0.68 | 0.360 |
| -Fibrinogen (mg/dl) | 408 | 402 | 410 | 0.750 |
| -D-dimer (ng/ml) | 1983 | 4743 | 676 | <0.0001 |
| -Ferritin (ng/ml) | 1319 | 1748 | 1112 | 0.610 |
| -Creatinine (mg/dl) | 1.27 | 1,42 | 1.18 | 0.346 |
| -Albumin (g/dl) | 4.44 | 5.22 | 3.43 | 0.102 |
| -Creatine-kinase (IU/l) | 237 | 331 | 188 | 0.028 |
| -Lactate dehydrogenase (IU/l) | 725 | 778 | 697 | 0.959 |
| -Reactive C protein (mg/l) | ||||
| 137 | 171 | 118 | 0.194 | |
| COVID-19 Treatment | ||||
| -Antibiotics | 139 (92%) | 50 (94%) | 89 (91%) | 0.692 |
| -Azithromycin | 128 (85%) | 42 (79%) | 86 (88%) | 0.095 |
| -Hydroxychloroquine | 118 (78%) | 37 (70%) | 81 (83%) | 0.042 |
| -Antiviral | ||||
| Lopinavir-ritonavir | 43 (28%) | 16 (30%) | 27 (27%) | 0.212 |
| Darunavir-cobicistat | 30 (20%) | 14 (26%) | 16 (16%) | 0.266 |
| Remdesivir | 0 (0%) | 0 (0%) | 0 (0%) | - |
| -Steroids | 93 (61%) | 37 (70%) | 56 (57%) | 0.087 |
| Pulse | 80 (53%) | 33 (62%) | 47 (48%) | 0.065 |
| -Immunosuppressant | ||||
| Tocilizumab | 13 (9%) | 8 (15%) | 5 (5%) | 0.105 |
| Baricitinib | 2 (1%) | 1 (2%) | 1 (1%) | 0.302 |
| Cyclosporine | 6 (4%) | 3 (5%) | 3 (3%) | 0.423 |
| -N-acetylcysteine | 66 (44%) | 27 (51%) | 39 (40%) | 0.126 |
| -Anticoagulant | ||||
| Prophylactic dose | 66 (44%) | 12 (23%) | 54 (55%) | <0.0001 |
| Intermediate dose | 59 (39%) | 29 (55%) | 30 (31%) | 0.003 |
| Therapeutic dose | 16 (10%) | 9 (17%) | 7 (7%) | 0.061 |
| Coagulation disorders during admission | 13 (7%) | 8 (15%) | 5 (5%) | |
| -Bleeding | 6 (54%) | 4 (57%) | 2(40%) | 0.114 |
| Major bleeding | 2 (33%) | 2 (50%) | 0 (0%) | |
| -Thrombosis | 7 (63%) | 4 (57%) | 3(60%) | 0.358 |
| Pulmonary embolism (PE) | 4 (57%) | 3 (75%) | 1 (33%) | |
| Deep venous thrombosis (DVT) | 2 (28%) | 0 (0%) | 2 (67%) | |
| Both (PE + DVT) | 1 (14%) | 1 (25%) | 0 (0%) | |
Therapy administered and complications during hospitalization COPD Chronic Obstructive Lung Disease, OSA Obstructive sleep apnoea
Fig. 1Training and validation scheme for machine learning methods
Fig. 2Mortality predictors in COVID-19. This figure shows the different parameters that have been identified as predictors of mortality. Section A presents the initial laboratory parameters, section B comorbidities, and section C laboratory evolution and complications during hospital admission of COVID-19 patients
The table displays the mean values and standard deviation of balanced accuracy, recall, precision, AUC, score, MCC, DYI and Kappa score of the ML system and the proposed model evaluated in this work
| Method | Balanced accuracy (%) | Recall | Precision | AUC |
|---|---|---|---|---|
| GNB | 77,32 | 77,41 | 76,77 | 0,77 |
| SVM | 78,89 | 80,01 | 79,92 | 0,80 |
| DT | 82,66 | 82,76 | 82,08 | 0,82 |
| KNN | 85,84 | 85,94 | 85,23 | 0,85 |
| XGB | 93,22 | 93,33 | 92,55 | 0,93 |
| Method | F1 score | MCC | DYI | Kappa |
| GNB | 77,09 | 68,61 | 77,32 | 68,84 |
| SVM | 79,78 | 71,00 | 80,02 | 71,24 |
| DT | 82,42 | 73,35 | 82,67 | 73,60 |
| KNN | 85,59 | 76,18 | 85,85 | 76,43 |
| XGB | 92,94 | 82,72 | 93,22 | 82,99 |
Fig. 3Section A: ROC curves for the five assessed machine learning predictors. Section B: The figure shows the radar plot of the training phase (left) and test (right) for the prediction of mortality in COVID-19 patients
The table shows the results of the comparison of the proposed method and the various published methods for the prediction of mortality in COVID-19 patients
| Author | Classifier | ACC | AUC | |
|---|---|---|---|---|
| Li et al. [ | LASSO | – | >80 | – |
| Hu et al. [ | AUROCs | – | 88 | – |
| Mele et al. [ | PM2.5 | – | – | 77.8 |
| Di Castelnuovo et al. [ | RF | 83.4 | – | 90.4 |
| Proposed | XGB | 93.22 | 93 | 92.94 |