| Literature DB >> 35967110 |
Thomas Ferguson1,2, Pietro Ravani3,4, Manish M Sood5, Alix Clarke3,4, Paul Komenda1,2, Claudio Rigatto1,2, Navdeep Tangri1,2.
Abstract
Introduction: Prediction of disease progression at all stages of chronic kidney disease (CKD) may help improve patient outcomes. As such, we aimed to develop and externally validate a random forest model to predict progression of CKD using demographics and laboratory data.Entities:
Keywords: CKD progression; machine learning; predictive modeling
Year: 2022 PMID: 35967110 PMCID: PMC9366291 DOI: 10.1016/j.ekir.2022.05.004
Source DB: PubMed Journal: Kidney Int Rep ISSN: 2468-0249
Baseline characteristics of development and validation cohorts
| Clinical characteristics | Training | Internal testing | External validation |
|---|---|---|---|
| Age | 59.3 (17) | 59.3 (17.1) | 55.5 (16.2) |
| Male sex | 25,829 (48%) | 11,017 (48%) | 57,168 (53%) |
| eGFR | 82.2 (27.1) | 82.2 (27.3) | 86.0 (22.4) |
| Urine ACR (mg/mmol) | 1.1 (0.5–4.7) | 1.1 (0.5–4.7) | 0.8 (0.4–2.2) |
| Comorbid conditions | |||
| Diabetes | 24,460 (45%) | 10,428 (45%) | 43,504 (41%) |
| Hypertension | 37,701 (70%) | 16,275 (70%) | 54,637 (51%) |
| Congestive heart failure | 2840 (5%) | 1187 (5%) | 5808 (5%) |
| Prior stroke | 1937 (4%) | 832 (4%) | 5590 (5%) |
| MI | 1380 (3%) | 608 (3%) | 6382 (6%) |
| Laboratory characteristics | |||
| Urea (mmol/l) | 6.6 (4.0) | 6.6 (4.1) | 6.8 (3.9) |
| Serum hemoglobin (g/l) | 134 (19) | 133 (19) | 143 (16) |
| Glucose (mmol/l) | 7.9 (4.1) | 7.9 (4.1) | 7.4 (4.1) |
| Serum albumin (g/l) | 37 (6) | 37 (6) | 41 (4) |
| Events (5 yr) | |||
| 40% decline | 3965 (7.3%) | 1658 (7.2%) | 5106 (4.8%) |
| Kidney failure | 246 (0.5%) | 102 (0.4%) | 367 (0.3%) |
| Composite | 4211 (7.8%) | 1760 (7.6%) | 5473 (5.1%) |
ACR, albumin-to-creatinine ratio; eGFR, estimated glomerular filtration rate; MI, myocardial infarction.
Results of the 22-variable random forest model in the internal testing and external validation cohorts for prediction of 40% decline in eGFR or kidney failure
| Time frame, yr | Internal testing cohort (Manitoba) | External validation cohort (Alberta) | ||
|---|---|---|---|---|
| AUC (95% CI) | Brier score (95% CI) | AUC (95% CI) | Brier score (95% CI) | |
| 1 | 0.90 (0.89–0.92) | 0.02 (0.01–0.02) | 0.87 (0.86–0.89) | 0.01 (0.01–0.01) |
| 2 | 0.88 (0.87–0.89) | 0.03 (0.03–0.04) | 0.87 (0.86–0.88) | 0.01 (0.01–0.01) |
| 3 | 0.86 (0.85–0.87) | 0.05 (0.04–0.06) | 0.86 (0.85–0.86) | 0.02 (0.02–0.02) |
| 4 | 0.85 (0.84–0.86) | 0.06 (0.05–0.07) | 0.85 (0.84–0.86) | 0.03 (0.03–0.03) |
| 5 | 0.84 (0.83–0.85) | 0.07 (0.06–0.09) | 0.84 (0.84–0.85) | 0.04 (0.04–0.04) |
ACR, albumin-to-creatinine ratio; eGFR, estimated glomerular filtration rate.
Figure 1Calibration for the 22-variable random forest model for prediction of 40% decline in eGFR or kidney failure at (a) 2 years and at (b) 5 years. eGFR, estimated glomerular filtration rate.
Figure 2Relationship between predicted risk from the random forest algorithm and the occurrence of the primary outcome (40% decline in eGFR or kidney failure) at (a) 2 years and at (b) 5 years. eGFR, estimated glomerular filtration rate.
Overview of model performance for the 22-variable random forest model
| 2 Yr | Internal testing cohort ( | 2 Yr | External validation cohort ( | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Population | Sens | Spec | NPV/PPV | Population | Sens | Spec | NPV/PPV | ||
| 0.85 | Lowest 30 | 97 | 31 | >99 | 0.84 | Lowest 30 | 96 | 31 | >99 |
| 1.61 | Lowest 45 | 95 | 47 | >99 | 1.31 | Lowest 45 | 93 | 46 | >99 |
| 1.95 | Lowest 50 | 94 | 52 | >99 | 1.51 | Lowest 50 | 91 | 51 | >99 |
NPV, negative predictive value; PPV, positive predictive value; Sens, sensitivity; Spec, specificity.
Data presented in percentage.