| Literature DB >> 34322640 |
Salman Khazaei1, Somayeh Najafi-GhOBADI2, Vajihe Ramezani-Doroh3,4.
Abstract
OBJECTIVES: Chronic kidney disease (CKD) is one of the main causes of morbidity and mortality worldwide. Detecting survival modifiable factors could help in prioritizing the clinical care and offers a treatment decision-making for hemodialysis patients. The aim of this study was to develop the best predictive model to explain the predictors of death in Hemodialysis patients by data mining techniques.Entities:
Keywords: Data mining; Decision tree; Hemodialysis; Kidney failure; Logistic regression; Neural network; Support vector machine; Survival
Mesh:
Year: 2021 PMID: 34322640 PMCID: PMC8283642 DOI: 10.15167/2421-4248/jpmh2021.62.1.1837
Source DB: PubMed Journal: J Prev Med Hyg ISSN: 1121-2233
Descriptive statistics of discrete features related to participants.
| Variables | Alive | Death | ||
|---|---|---|---|---|
| N | % | N | % | |
| 256 | 57 | 208 | 51 | |
| 76 | 17 | 6 | 1.5 | |
| 194 | 43.3 | 260 | 63.7 | |
| 287 | 64 | 248 | 61 | |
| 366 | 82 | 296 | 72 | |
| 399 | 89 | 331 | 81 | |
| 153 | 34 | 126 | 31 | |
| 407 | 91 | 361 | 89 | |
| 2 | 0.5 | 3 | 0.7 | |
| 312 | 70 | 246 | 60 | |
| 269 | 60 | 239 | 59 | |
| 91 | 20.3 | 116 | 28.4 | |
| 69 | 15.4 | 66 | 16 | |
| 8 | 2 | 7 | 1.7 | |
| 310 | 69 | 242 | 59 | |
Summary of continues variables related to participants.
| Variables | Alive | Death | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| N | Min | Max | Mean | Std. Dev | N | Min | Max | Mean | Std. Dev | |
| Age at diagnosis (yr) | 448 | 8 | 83 | 50.29 | 15.73 | 408 | 18 | 88 | 61.97 | 10.92 |
| BMI (kg/m2) | 448 | 12.33 | 41.26 | 23.09 | 4.21 | 408 | 14.69 | 38.09 | 23.31 | 3.79 |
| Dialysis duration (hour) | 448 | 2 | 4 | 3.65 | 0.46 | 408 | 2 | 4 | 3.63 | 0.46 |
| Dialysis weekly time (hour) | 448 | 2 | 16 | 10.15 | 2.33 | 408 | 2 | 16 | 10.24 | 2.22 |
| Hemoglobin (gr/dlit) | 448 | 1.3 | 17.4 | 10.48 | 2.06 | 408 | 4 | 15.7 | 10.47 | 1.75 |
| Hematocrit levels (%) | 448 | 16.1 | 60 | 192.52 | 72.085 | 408 | 16.1 | 60 | 32.46 | 6.05 |
| Plt (1,000/mm3) | 448 | 21 | 463 | 192.52 | 72.08 | 408 | 27 | 670 | 187.73 | 68.68 |
| Sodium (mg/dlit) | 448 | 105 | 198 | 138.78 | 6.8 | 408 | 106 | 193 | 138.89 | 7.56 |
| Potassium (mg/dlit) | 448 | 3 | 9.6 | 4.9 | 0.94 | 408 | 3 | 9.6 | 4.96 | 0.95 |
| Calcium (mg/dlit) | 448 | 5.1 | 12 | 8.9 | 0.90 | 408 | 5.9 | 11.3 | 8.7 | 0.84 |
| Phosphor (mg/dlit) | 448 | 2.3 | 12.3 | 5.11 | 1.55 | 408 | 1.7 | 12 | 5.13 | 1.6 |
| Iron (ug/ dlit) | 448 | 2 | 1028 | 111.90 | 115.48 | 408 | 3 | 520 | 98.3 | 73.49 |
| Uric acid (mg/dlit) | 448 | 1 | 14.6 | 6.77 | 1.54 | 408 | 3.2 | 13.7 | 6.70 | 1.42 |
| Albumin (g/dlit) | 448 | 1 | 5.6 | 3.74 | 0.73 | 408 | 1 | 6.4 | 3.59 | 0.72 |
| Alk (U/L) | 448 | 4.6 | 2612 | 316.14 | 262.65 | 408 | 4.2 | 2349 | 301.58 | 213.35 |
| Urea reduction ratio (%) | 448 | 1 | 0.96 | 0.643 | 0.124 | 408 | 0 | 0.89 | 0.62 | 0.13 |
| Time to mortality/follow-up (yr) | 448 | 0.08 | 10.70 | 2.35 | 2.38 | 408 | 0.08 | 10.30 | 2.23 | 2.18 |
Fig. 1.Importance of variables estimated by the decision tree.
Fig. 2.Importance of variables estimated by the natural network.
Fig. 3.Importance of variables estimated by the support vector machine.
Logistic regression model.
| Variables | B | OR | Wald | P-value |
|---|---|---|---|---|
| - | - | - | - | |
| -0.087 | 0.917 | 135.731 | 0.000 | |
| - | - | - | - | |
| 0.002 | 1.002 | 7.040 | 0.008 | |
| 1.562 | 4.767 | 6.120 | 0.013 | |
| - | - | - | - |
*OR: Odds Ratio which is calculated as Exp (β).
Mean and standard deviation of total accuracy, sensitivity, specificity, positive likelihood ratio and negative likelihood ratio for DT, NN, SVM and LR.
| Models | Total accuracy | Sensitivity | Specificity | Positive likelihood ratio | Negative likelihood ratio | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | Mean | SD | |
| Decision tree | 0.66 | 0.05 | 0.65 | 0.06 | 0.67 | 0.05 | 2.03 | 0.47 | 0.53 | 0.11 |
| Neural network | 0.61 | 0.05 | 0.58 | 0.09 | 0.72 | 0.07 | 1.7 | 0.38 | 0.65 | 0.17 |
| Support vector machine | 0.68 | 0.04 | 0.66 | 0.07 | 0.70 | 0.04 | 2.25 | 0.45 | 0.48 | 0.10 |
| Logistic regression | 0.71 | 0.05 | 0.69 | 0.07 | 0.72 | 0.04 | 2.48 | 0.53 | 0.43 | 0.11 |