| Literature DB >> 35513811 |
Negar Bakhtiarvand1, Mehdi Khashei1,2, Mehdi Mahnam3,4, Somayeh Hajiahmadi5.
Abstract
BACKGROUND: Coronavirus outbreak (SARS-CoV-2) has become a serious threat to human society all around the world. Due to the rapid rate of disease outbreaks and the severe shortages of medical resources, predicting COVID-19 disease severity continues to be a challenge for healthcare systems. Accurate prediction of severe patients plays a vital role in determining treatment priorities, effective management of medical facilities, and reducing the number of deaths. Various methods have been used in the literature to predict the severity prognosis of COVID-19 patients. Despite the different appearance of the methods, they all aim to achieve generalizable results by increasing the accuracy and reducing the errors of predictions. In other words, accuracy is considered the only effective factor in the generalizability of models. In addition to accuracy, reliability and consistency of results are other critical factors that must be considered to yield generalizable medical predictions. Since the role of reliability in medical decisions is significant, upgrading reliable medical data-driven models requires more attention.Entities:
Keywords: COVID-19; Data analysis; Disease severity; Forecasting and modeling; Multiple linear regression (MLR); Reliability and accuracy
Mesh:
Year: 2022 PMID: 35513811 PMCID: PMC9069125 DOI: 10.1186/s12911-022-01861-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 3.298
Recent studies on predicting the severity of Covid-19 patients
| Author/[Ref.] | Scope | Attributes | Methods | Performance | Size | Country |
|---|---|---|---|---|---|---|
| Zhang et al. [ | Severity of COVID-19 | Clinical and laboratory variables | Univariable and multivariable logistic regression models | AUC=0.906 | 80 | China |
| Hajiahmadi et al. [ | ICU and death | CT severity score | Logistic regression model | AUC=0.764 | 192 | Iran |
| Homayounieh et al. [ | ICU and death | Interpretation of radiologists, clinical variables, lung radiomics | Multiple logistic regression model | AUC =0.84 (for ICU admission) | 315 | Iran |
| Huang et al. [ | Severe cases | Clinical and laboratory data | Single-factor and multivariate logistic regression | AUC = 0.985 (95% CI 0.968–1.00) | 125 | China |
| Zhou et al. [ | Severe cases | clinical, laboratory, and CT data | Multivariable logistic regression | AUC =0.952 | 134 | China |
| Xiao et al. [ | Severe illness | Demographic, clinical, laboratory, and radiological data | Univariable and multivariable logistic regression models | AUC= 0.861 (95% CI 0.811–0.902) | 243 | China |
| Wei et al. [ | Common and severe patients | Clinical and CT data | Multivariate logistic regression | AUC=0.95 | 81 | China |
| Dong et al. [ | Survival | Clinical and laboratory findungs | Multivariable Cox regression model | AUC= 0.922 (14 days) AUC= 0.881 (21 days) | 628 | China |
| Bai et al. [ | Severity of disease | Clinical, laboratory, and CT data | Logistic regression model, LDA, SVM, MLP and LSTM | AUC=0.954 | 133 | China |
| Al-Najjar and Al-Rousan [ | Recovered and death cases | Sex, birth year, country, region, group, infection reason, and confirmed date on the outcome | Neural network | Accuracy=0.938 Accuracy=0.995 | 1308 | South Korea |
| Li et al. [ | Severe cases | CT scan data and clinical biochemical attributes | Machine-learning models | AUC =0.93 | 46 | China |
| Matos et al. [ | Mechanical ventilation, death | CT scan and clinical attributes | GLM, PBR, CIT, and SVL | AUC =0.92 | 106 | Italy |
| Ning et al. [ | Negative, mild, and severe cases | CT images and clinical features | CNN, DNNs, and PLR | AUC = 0.944 (negative) AUC = 0.860 (mild) AUC = 0.884 (severe) | 1521 | China |
| Zhou et al. [ | Severe cases | Clinical factors | GA and SVM | Accuracy: over 0.94 Accuracy= 0.80 | 144 25 | China |
| Yan et al. [ | Survival for severe cases | Clinical data | XGBoost algorithm | Accuracy=0.93 | 375 | China |
| Shi et al. [ | Severe cases | Clinical and radiological findings | LASSO logistic regression | AUC= 0.890 | 196 | China |
| Bi et al. [ | Severe illness | Fibrinogen-to-albumin ratio (FAR) and platelet count (PLT) | Multivariate cox analysis | AUC=0.754 | 113 | China |
| Zhou et al. [ | Severe cases | Body temperature, cough, dyspnea, hypertension, cardiovascular disease, chronic liver disease, and chronic kidney disease | Multivariable logistic regression | AUC= 0.862 (95% CI 0.801–0.925) | 366 | China |
| Cheng et al. [ | ICU transfer | Signs, nursing assessments, laboratory features and electrocardiograms | Random forest | AUC= 0.799 (95% CI 0.752–0.846) | 1987 | USA |
| McRae et al. [ | Death | CRP, NT-proBNP, MYO, D-dimer, PCT, CK-MB, cTnI | Logistic regression model | AUC= 0.94 (95% CI 0.89–0.99) | 160 | China |
List of independent variables (clinical factors)
| Clinical factor | ID | Symbol | Clinical factor | ID | Symbol |
|---|---|---|---|---|---|
| Albumin/globulin | A/G | X1 | Mean corpuscular hemoglobin | MCH | X26 |
| Albumin | ALB | X2 | Mean corpuscular-hemoglobin concentration | MCHC | X27 |
| Alkaline phosphatase | ALP | X3 | Mean corpuscular volume | MCV | X28 |
| Glutamic-pyruvic transaminase | ALT | X4 | Absolute value of monocytes | Mono# | X29 |
| Activated partial thromboplastin time | APTT | X5 | Percentage of monocytes | Mono% | X30 |
| Glutamic oxalacetic transaminase | AST | X6 | mean platelet volume | MPV | X31 |
| Absolute value of basophil | Baso# | X7 | Platelet large cell ratio | P-LCR | X32 |
| Percentage of basophils | Baso% | X8 | PCT plateletocrit | PCT | X33 |
| Blood urea nitrogen | BUN | X9 | Platelet distribution width | PDW | X34 |
| Creatine Kinase Isoenzyme | CK-MB | X10 | Blood platelet count | PLT | X35 |
| Creatinine | CREA | X11 | Prothrombin time | PT | X36 |
| C-reactive protein | CRP | X12 | International normalized ratio | PT-INR | X37 |
| Cystatin C | CysC | X13 | Red blood cell count | RBC | X38 |
| Direct bilirubin | D-BIL | X14 | CV value of RBC distribution width | RDW-CV | X39 |
| Absolute value of eosinophils | Eos# | X15 | SD value of erythrocyte distribution width | RDW-SD | X40 |
| Percentage of eosinophils | Eos% | X16 | Sialic acid | SA | X41 |
| Fibrinogen | FIB | X17 | Total bile acid | TBA | X42 |
| Gamma-glutamyl transpeptidase | GGT | X18 | Total bilirubin | TBIL | X43 |
| Globulin | GLO | X19 | Thrombin time | TT | X44 |
| Glucose (fasting) | GLU | X20 | Uric acid | UA | X45 |
| Hemoglobin | Hb | X21 | Î | X46 | |
| Hematocrit value | Hct | X22 | Neutrophil absolute value | Neut# | X47 |
| Lactic dehydrogenase | LDH | X23 | Neutrophil percentage | Neut% | X48 |
| Lymphocyte absolute value | Lymph# | X24 | D-dimer | SF8200_D-Dimer | X49 |
| Percentage of lymphocytes | Lymph% | X25 | Cholinesterase | CHE | X50 |
Results of RbR model using all clinical factors
| Variable | Coefficient | Std. error | t-Statistic | Prob. |
|---|---|---|---|---|
| Constant | 1.195739 | 0.8949 | ||
| X1 | 0.766133 | 0.669994 | 1.143491 | 0.2556 |
| X2 | 0.369865 | 0.1150 | ||
| X3 | 0.207036 | 0.1979 | ||
| X4 | 0.234136 | 0.5351 | ||
| X5 | 0.350635 | 0.224713 | 1.560370 | 0.1218 |
| X6 | 0.124960 | 0.206834 | 0.604155 | 0.5471 |
| X7 | 0.392064 | 0.221517 | 1.769907 | 0.0798 |
| X9 | 0.419054 | 0.302267 | 1.386370 | 0.1687 |
| X10 | 0.142200 | 0.5736 | ||
| X11 | 0.228173 | 0.6627 | ||
| X12 | 0.441895 | 0.324018 | 1.363796 | 0.1757 |
| X13 | 0.782662 | 0.239895 | 3.262523 | 0.0015 |
| X14 | 0.327671 | 0.4084 | ||
| X15 | 0.483778 | 0.200362 | 2.414519 | 0.0176 |
| X17 | 0.140422 | 0.181415 | 0.774039 | 0.4407 |
| X18 | 0.446554 | 0.279914 | 1.595327 | 0.1138 |
| X19 | 0.350373 | 0.507981 | 0.689735 | 0.4920 |
| X20 | 0.193640 | 0.8168 | ||
| X21 | 3.555349 | 0.2067 | ||
| X22 | 5.660556 | 4.708489 | 1.202202 | 0.2321 |
| X23 | 0.282944 | 0.357990 | 0.790367 | 0.4312 |
| X24 | 0.212730 | 0.0164 | ||
| X26 | 0.937514 | 0.1629 | ||
| X27 | 3.450157 | 2.378494 | 1.450564 | 0.1500 |
| X28 | 1.291966 | 0.4462 | ||
| X29 | 0.626379 | 0.410115 | 1.527327 | 0.1298 |
| X30 | 0.416580 | 0.1462 | ||
| X31 | 0.316610 | 0.804939 | 0.393334 | 0.6949 |
| X32 | 0.816533 | 0.7310 | ||
| X33 | 0.826578 | 0.3663 | ||
| X34 | 0.137479 | 0.291786 | 0.471164 | 0.6385 |
| X35 | 0.518478 | 0.863308 | 0.600571 | 0.5495 |
| X36 | 3.783075 | 6.284011 | 0.602016 | 0.5485 |
| X37 | 6.329394 | 0.5760 | ||
| X38 | 3.175541 | 0.5743 | ||
| X39 | 0.850181 | 0.9268 | ||
| X40 | 0.260570 | 0.524381 | 0.496910 | 0.6203 |
| X41 | 0.303455 | 0.300802 | 1.008819 | 0.3155 |
| X42 | 0.245079 | 0.3449 | ||
| X43 | 0.249179 | 0.295612 | 0.842924 | 0.4013 |
| X44 | 0.380763 | 0.267197 | 1.425027 | 0.1573 |
| X45 | 0.192436 | 0.217392 | 0.885203 | 0.3782 |
| X46 | 0.214868 | 0.0044 | ||
| X47 | 0.458820 | 0.3271 | ||
| X49 | 0.075227 | 0.183322 | 0.410354 | 0.6824 |
| X50 | 0.206853 | 0.0102 | ||
Results of RbR model to analyze effective clinical factors on severity of COVID-19 patients
| Variable | Coefficient | Std. error | t-Statistic | Prob. |
|---|---|---|---|---|
| Constant | 0.252632 | 0.140872 | 1.793350 | 0.0751 |
| X12 | 0.386337 | 0.194240 | 1.988966 | 0.0487 |
| X13 | 0.651536 | 0.187129 | 3.481747 | 0.0007 |
| X18 | 0.456707 | 0.160788 | 2.840435 | 0.0052 |
| X21 | 0.142370 | 0.0035 | ||
| X23 | 0.719013 | 0.172711 | 4.163093 | 0.0001 |
| X25 | 0.147318 | 0.0000 | ||
| X36 | 0.466564 | 0.169386 | 2.754448 | 0.0067 |
Comparison of performance of proposed models
| Models | Evaluation metrics | ||
|---|---|---|---|
| Sensitivity (%) | Specificity (%) | Accuracy (%) | |
| Classic regression model | 95.70 | 85.50 | 90.30 |
| RbR model | 98.60 | 88.20 | 93.10 |
Fig. 1Comparison of performance of two proposed predictive models
Fig. 2The ROC curves of proposed models
The ROC analysis of proposed models
| Model | AUC | 95%CI | |
|---|---|---|---|
| Classic regression model | 0.906 | 0.851–960 | 0 |
| RbR model | 0.934 | 0.887–980 | 0 |