| Literature DB >> 34983496 |
Khadijeh Moulaei1, Mostafa Shanbehzadeh2, Zahra Mohammadi-Taghiabad3, Hadi Kazemi-Arpanahi4,5.
Abstract
BACKGROUND: The coronavirus disease (COVID-19) hospitalized patients are always at risk of death. Machine learning (ML) algorithms can be used as a potential solution for predicting mortality in COVID-19 hospitalized patients. So, our study aimed to compare several ML algorithms to predict the COVID-19 mortality using the patient's data at the first time of admission and choose the best performing algorithm as a predictive tool for decision-making.Entities:
Keywords: Artificial intelligence; COVID-19; Coronavirus; Machine learning; Prediction hospital mortality
Mesh:
Year: 2022 PMID: 34983496 PMCID: PMC8724649 DOI: 10.1186/s12911-021-01742-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig.1Flowchart describing patient selection
Confusion matrix
| Output | Predicted values | |
|---|---|---|
| Death (+) | Survive (−) | |
| Actual value | ||
| Death (+) | TP | FN |
| Survive (−) | FP | TN |
True Positive = the number of cases that are truly classified as positive by the algorithm
False Positive = the number of cases that are falsely classified as positive by the algorithm
False Negative = the number of cases that are falsely classified as negative by the algorithm
True Negative = the number of cases that are truly classified as negative by the algorithm
The performance evaluation measures
| Performance criteria | Calculation |
|---|---|
| Accuracy | (TP + TN) /(TP + TN + FP + FN) |
| Precision | TP/(TP + FP) |
| Sensitivity/ Recall | TP/(TP + FN) |
| Specificity | TN/(TN + FP) |
True Positive (TP), True Negative (TN), False Positive (FP), False Negative (FN)
Identifying the initial list of features affecting mortality in patients with COVID-19
| Classes | Number of suggested features | Delphi round | Final features | Included features | Excluded features | |
|---|---|---|---|---|---|---|
| < 75% | 75% < | |||||
| Demographic | 9 | 6 | 3 | 3 | Gender, age, length of hospitalization | Body mass index, blood group, marital status, ethnicity, place of birth, level of education |
| Risk factors | 10 | 2 | 7 | 7 | Smoking, ICU admission, hypertension, pneumonia, diabetes, cardiac disease, another underline disease | Recent travel, exposure type |
| Clinical manifestations | 23 | 9 | 14 | 14 | Dyspnea, sore throat, runny nose, loss of taste, loss of smell, contusion, muscular pain, chill, fever, cough, nausea/ vomiting, chest pain and pressure, headache, gastrointestinal symptoms | Weakness, sneezing, exudative pharyngitis, mucus or phlegm, conjunctivitis, hemoptysis, anorexia, dry mouth, decrease consciousness |
| Laboratory results | 24 | 11 | 13 | 13 | Blood urea nitrogen, white cell count, C-reactive protein, hypersensitive troponin, glucose, erythrocyte sedimentation rate, creatinine, alkaline phosphatase, aspartate aminotransferase, alanine aminotransferase, absolute lymphocyte count, absolute neutrophil count, | Hematocrit, red cell count, hemoglobin, total bilirubin, thromboplastin time, prothrombin time, albumin calcium, phosphorus, magnesium, sodium, potassium |
| Therapeutic plan | 1 | 0 | 1 | 1 | Oxygen therapy | |
Features affecting predicting mortality in patients with COVID-19
| Row | Features name | Degree of importance | Row | Features name | Degree of importance |
|---|---|---|---|---|---|
| 1 | Dyspnea | 0.5532 | 21 | Chest pain and pressure | 0.2256 |
| 2 | ICU admission | 0.5409 | 22 | Absolute neutrophil count | 0.2123 |
| 3 | Oxygen therapy | 0.3789 | 23 | Headache | 0.1992 |
| 4 | Age | 0.3207 | 24 | Gender | 0.186 |
| 5 | Fever | 0.3142 | 25 | Gastrointestinal symptoms | 0.1802 |
| 6 | Cough | 0.3072 | 26 | White cell count | 0.1702 |
| 7 | Loss of taste | 0.2944 | 27 | C-reactive protein | 0.1574 |
| 8 | Loss of smell | 0.2923 | 28 | Hypersensitive troponin | 0.1428 |
| 9 | Hypertension | 0.2768 | 29 | Pneumonia | 0.1066 |
| 10 | Contusion | 0.2744 | 30 | Glucose | 0.0906 |
| 11 | Muscular Pain | 0.2731 | 31 | Erythrocyte sedimentation rate | 0.0826 |
| 12 | Chill | 0.2537 | 32 | Creatinine | 0.0716 |
| 13 | Runny noise | 0.2532 | 33 | Alkaline phosphatase | 0.0678 |
| 14 | Blood urea nitrogen | 0.2524 | 34 | Length of hospitalization | 0.0626 |
| 15 | Diabetes | 0.2506 | 35 | Aspartate aminotransferase | 0.0445 |
| 16 | Sore throat | 0.25 | 36 | Smoking | 0.0427 |
| 17 | Absolute lymphocyte count | 0.2339 | 37 | Alanine aminotransferase | 0.0319 |
| 18 | Nausea/vomiting | 0.2301 | 38 | Platelet count | 0.0210 |
| 19 | Other under line disease | 0.2282 | 39 | ||
| 20 | Cardiac disease | 0.2274 |
Descriptive statistics of the Features
| Features (quantitative) | Range | Mean (SD) |
|---|---|---|
| Age (year) | 18–100 | 57.25 (17.8) |
| Leng of hospitalization | 1–32 | 61.89 (13.25) |
| Creatinine (mg/dL) | 0.1–17.9 | 51.39 (14.4) |
| White-cell count | 1300–63,000 | 82.34 (4897.4) |
| Platelet count | 108,000–691,000 | 66.2 (38.1) |
| Absolute lymphocyte count | 2–95 | 23.74 (11.8) |
| Absolute neutrophil count | 8–98 | 74.52 (12.3) |
| Blood urea nitrogen | 0.5–251 | 42.52 (31.7) |
| Aspartate aminotransferase | 3.8–924 | 44.45 (53.5) |
| Alanine aminotransferase | 2–672 | 38.29 (41.6) |
| Glucose | 18–994 | 36.09 (74.2) |
| Lactate dehydrogenase | 4.6–6973 | 55.68 (339.0) |
| Prothrombin time | 0.9–46.8 | 42.82 (23.9) |
| Alkaline phosphatase | 9.6–2846 | 21.12 (39.2) |
| Erythrocyte sedimentation rate | 2–258 | 40.65 (28.8) |
Performance evaluation of the selected ML algorithms for COVID-19 death prediction
| Algorithms | Sensitivity (%) | Specificity (%) | Accuracy (%) | Precision (%) | ROC (%) |
|---|---|---|---|---|---|
| Random forest | 90.70 | 95.10 | 95.03 | 94.23 | 99.02 |
| XGBoost | 90.89 | 95.01 | 94.25 | 92.43 | 98.18 |
| KNN | 97.38 | 82.15 | 89.56 | 80.11 | 96.78 |
| MLP | 90.81 | 91.07 | 91.25 | 87.19 | 96.49 |
| Logistic regression | 91.45 | 84.47 | 91.23 | 83.94 | 94.22 |
| J48 decision tree | 87.77 | 94.47 | 92.17 | 89.97 | 92.19 |
| Naïve Bayes | 90.44 | 84.31 | 87.47 | 81.32 | 92.05 |
Fig. 2Visual comparisons of ML algorithm capabilities for COVID-19 death prediction
Fig. 3ROC chart of selected ML algorithms