| Literature DB >> 32775818 |
Yohei Komaru1,2, Teruhiko Yoshida1,2, Yoshifumi Hamasaki2, Masaomi Nangaku1,2, Kent Doi3.
Abstract
INTRODUCTION: For patients with end-stage renal disease (ESRD), due to the heterogeneity of the population, appropriate risk assessment approaches and strategies for further follow-up remain scarce. We aimed to conduct a pilot study for better risk stratification, applying machine learning-based classification to patients with ESRD who newly started maintenance hemodialysis.Entities:
Keywords: end-stage renal disease; hemodialysis; hierarchical clustering; machine learning; renal replacement therapy; risk prediction
Year: 2020 PMID: 32775818 PMCID: PMC7403509 DOI: 10.1016/j.ekir.2020.05.007
Source DB: PubMed Journal: Kidney Int Rep ISSN: 2468-0249
Figure 1Patient flowchart.
Variables included in the hierarchical analysis
| Domain | Variable (percentage of missing value, if not zero) |
|---|---|
| Demographic and physical characteristics | Age, BMI (2.0%), systolic blood pressure before hemodialysis |
| Blood test | Complete blood count (white blood cell, neutrophil percentage, monocyte percentage, eosinophil percentage, basophil percentage, hemoglobin, platelet, reticulocyte, |
| Urine test | pH, sodium, calcium (4.0%), urea nitrogen, creatinine, uric acid (1.0%), protein, |
ALT, alanine transaminase; BMI, body mass index; BNP, B-type natriuretic peptide; CRP, C-reactive protein; γ-GTP, γ-glutamyl transpeptidase; HbA1c, hemoglobin A1c; iPTH, intact parathyroid hormone; L-FABP, liver-type fatty acid-binding protein; MCHC, mean corpuscular hemoglobin concentration; MCV, mean corpuscular volume; NAG, N-acetyl-β-D-glucosaminidase; UIBC, unsaturated iron binding capacity.
Logarithmic transformation was conducted before clustering because of skewed distribution.
Figure 2Hierarchical clustering. Based on 46 standardized clinical variables, agglomerative hierarchical clustering was performed in 101 patients who had recently started hemodialysis. Each row represents 1 patient, and each column represents 1 clinical variable, the name of which is shown in the bottom; prefix “B-” means data obtained from blood sample, and “U-” means data from urine sample. Hierarchical clustering of both rows and columns yielded a heat map, in which colors red and blue reflect comparatively high and low value scaled by SD, respectively. ALT, alanine transaminase; BMI, body mass index; BNP, B-type natriuretic peptide; CRP, C-reactive protein; γ-GTP, γ-glutamyl transpeptidase; HbA1c, hemoglobin A1c; iPTH, intact parathyroid hormone; L-FABP, liver-type fatty acid-binding protein; MCHC, mean corpuscular hemoglobin concentration; MCV, mean corpuscular volume; NAG, N-acetyl-β-D-glucosaminidase; SD, standard deviation; UIBC, unsaturated iron binding capacity
Comparison of the 3 clusters
| Variables | Total cohort ( | Cluster 1 ( | Cluster 2 ( | Cluster 3 ( | |
|---|---|---|---|---|---|
| Age, yr | 67.0 [52.5–76.0] | 70.5 [54.0–77.3] | 54.0 [43.0–64.0] | 67.5 [52.8–75.0] | 0.006 |
| Female, | 34 (33.7) | 20 (32.3) | 10 (66.7) | 4 (16.7) | 0.005 |
| Height, cm | 163 [154–169] | 162 [154–169] | 154 [151–168] | 166 [160–170] | 0.093 |
| Body weight, kg | 62.6 [52.1–74.6] | 60.9 [54.1–71.8] | 66.5 [42.3–78.0] | 69.6 [52.4–87.2] | 0.28 |
| Systolic blood pressure, mm Hg | 150 [130–163] | 150 [140–170] | 160 [140–175] | 130 [120–140] | < 0.001 |
| Underlying renal disease, | 0.21 | ||||
| Diabetic kidney disease | 36 (35.6) | 22 (35.5) | 6 (40.0) | 8 (33.3) | |
| Nephrosclerosis | 14 (13.9) | 9 (14.5) | 3 (20.0) | 2 (8.3) | |
| Chronic glomerulonephritis | 11 (10.9) | 7 (11.3) | 2 (13.3) | 2 (8.3) | |
| ADPKD | 7 (6.9) | 7 (11.3) | 0 (0) | 0 (0) | |
| IgA nephropathy | 4 (4.0) | 4 (6.5) | 0 (0) | 0 (0) | |
| Others | 29 (28.7) | 13 (21.0) | 4 (26.7) | 12 (50.0) | |
| Blood test | |||||
| Total protein, g/dl | 6.0 [5.5–6.5] | 6.2 [5.5–6.5] | 5.8 [5.4–6.2] | 6.2 [5.5–6.7] | 0.53 |
| Blood urea nitrogen, mg/dl | 92.7 [77.0–108.4] | 91.6 [75.6–102.1] | 90.5 [78.7–119.4] | 102.0 [86.0–118.5] | 0.077 |
| Creatinine, mg/dl | 8.49 [6.98–10.97] | 8.89 [7.28–11.70] | 10.81 [7.84–12.66] | 6.74 [5.92–8.65] | < 0.00 |
| Potassium, mmol/l | 4.2 [3.8–4.8] | 4.1 [3.8–4.5] | 4.9 [4.4–5.3] | 4.3 [3.8–4.7] | 0.001 |
| White blood cell, ×103/mm3 | 5.9 [4.9–8.4] | 5.5 [4.6–6.1] | 9.3 [7.9–11.5] | 7.2 [5.4–9.4] | < 0.00 |
| Hemoglobin, g/dl | 8.9 [7.9–9.6] | 9.2 [8.4–9.9] | 7.9 [7.0–9.3] | 8.4 [7.5–9.3] | 0.020 |
| CRP, mg/dL | 0.24 [0.09–1.85] | 0.14 [0.05–0.34] | 0.75 [0.24–3.89] | 2.20 [0.41–5.64] | < 0.001 |
| Hemoglobin A1c, % | 5.7 [5.3–6.2] | 5.7 [5.3–6.3] | 5.6 [5.1–5.9] | 5.6 [5.3–6.4] | 0.56 |
| BNP, pg/ml | 221 [76–954] | 206 [80–480] | 1686 [613–2063] | 148 [43–1175] | < 0.001 |
| β2-microglobulin, mg/l | 17.2 [14.5–20.5] | 17.5 [14.9–20.1] | 20.6 [17.6–23.6] | 14.9 [13.4–18.3] | 0.014 |
| Urine test | |||||
| pH | 6.0 [5.0–7.0] | 6.5 [6.0–7.0] | 6.5 [5.5–7.0] | 5.0 [5.0–5.5] | < 0.001 |
| Creatinine, mg/dl | 59.2 [44.8–78.4] | 53.1 [45.7–74.2] | 50.8 [35.4–65.1] | 92.1 [56.5–112.8] | 0.002 |
| Protein, mg/dl | 167 [66.6–329] | 212 [81.8–325] | 345 [139–510] | 75.0 [36.3–186] | 0.001 |
| NAG, IU/L | 7.3 [4.8–11.9] | 6.8 [4.6–10.4] | 7.8 [4.6–15.5] | 9.7 [7.1–13.6] | 0.005 |
| α1-microglbulin, mg/l | 52.2 [34.4–73.3] | 51.3 [36.6–68.3] | 69.1 [42.8–80.2] | 48.3 [15.6–74.6] | 0.23 |
| L-FABP, ng/ml | 77.6 [39.2–110.8] | 89.0 [42.1–114.6] | 77.8 [64.1–119.7] | 46.7 [18.1–91.7] | 0.008 |
| Urine volume, ml/d | 1120 [743–1510] | 1100 [740–1530] | 1200 [630–2000] | 1030 [748–1313] | 0.77 |
| Hospital stay, d | 8 [5–20] | 6 [5–11] | 20 [12–42] | 21 [7–48] | < 0.001 |
| 90-d mortality, | 6 (5.9) | 1 (1.6) | 0 (0) | 5 (20.8) | 0.002 |
| 1-yr mortality, | 14 (13.9) | 3 (4.8) | 2 (13.3) | 9 (37.5) | < 0.001 |
ADPKD, autosomal dominant polycystic kidney disease; BNP, B-type natriuretic peptide; CRP, C-reactive protein; L-FABP, liver-type fatty acid-binding protein; N-acetyl-β-D-glucosaminidase.
Statistically significant difference (P < 0.05) in comparing 2 groups using the Steel-Dwass test for multiple comparison.
P < 0.05 by the Kruskal-Wallis test or Pearson’s χ2 test.
Figure 3Survival analysis of the 3 clusters suggested by the hierarchical clustering. Patients in cluster 3 showed a significantly worse survival rate compared with cluster 1, in the Kaplan-Meier analysis of the follow-ups 1 year after hemodialysis therapy was initiated. The difference between cluster 3 and cluster 1 was still significant after Bonferroni’s correction for multiple comparisons (P < 0.001). ∗P < 0.05.
Cox proportional hazard model for 1-year mortality in patients initiating hemodialysis
| Variable | Model 1 HR [95% CI] | Model 2 | Model 3 | Model 4 | Model 5 |
|---|---|---|---|---|---|
| Age | 1.07 | 1.06 | 1.07 | 1.07 | 1.07 |
| Sex: women | 0.31 [0.04–1.43] | 0.22 [0.03–1.14] | 0.26 [0.03–1.18] | 0.25 [0.03–1.33] | 0.38 [0.05–1.51] |
| Cluster 3 (cluster 1 as reference) | 10.2 | 6.44 | 8.02 | 11.8 | 10.5 |
| Systolic blood pressure | 0.98 [0.95–1.01] | ||||
| Serum creatinine | 0.76 | ||||
| Serum potassium | 0.49 [0.20–1.18] | ||||
| BNP | 1.00 [0.99–1.00] |
BNP, B-type natriuretic peptide; CI, confidence interval; HR, hazard ratio.
Model 1 (base model): age + sex + cluster; model 2: model 1 + systolic blood pressure; model 3: model 1 + serum creatinine; model 4: model 1 + serum potassium; model 5: model 1 + BNP.
P < 0.05.
P < 0.01.