| Literature DB >> 34886378 |
Francesco Bellocchio1, Caterina Lonati2, Jasmine Ion Titapiccolo1, Jennifer Nadal3, Heike Meiselbach4, Matthias Schmid3, Barbara Baerthlein5, Ulrich Tschulena6, Markus Schneider3, Ulla T Schultheiss7,8, Carlo Barbieri6, Christoph Moore6, Sonja Steppan6, Kai-Uwe Eckardt4,9, Stefano Stuard6, Luca Neri1.
Abstract
Current equation-based risk stratification algorithms for kidney failure (KF) may have limited applicability in real world settings, where missing information may impede their computation for a large share of patients, hampering one from taking full advantage of the wealth of information collected in electronic health records. To overcome such limitations, we trained and validated the Prognostic Reasoning System for Chronic Kidney Disease (PROGRES-CKD), a novel algorithm predicting end-stage kidney disease (ESKD). PROGRES-CKD is a naïve Bayes classifier predicting ESKD onset within 6 and 24 months in adult, stage 3-to-5 CKD patients. PROGRES-CKD trained on 17,775 CKD patients treated in the Fresenius Medical Care (FMC) NephroCare network. The algorithm was validated in a second independent FMC cohort (n = 6760) and in the German Chronic Kidney Disease (GCKD) study cohort (n = 4058). We contrasted PROGRES-CKD accuracy against the performance of the Kidney Failure Risk Equation (KFRE). Discrimination accuracy in the validation cohorts was excellent for both short-term (stage 4-5 CKD, FMC: AUC = 0.90, 95%CI 0.88-0.91; GCKD: AUC = 0.91, 95% CI 0.86-0.97) and long-term (stage 3-5 CKD, FMC: AUC = 0.85, 95%CI 0.83-0.88; GCKD: AUC = 0.85, 95%CI 0.83-0.88) forecasting horizons. The performance of PROGRES-CKD was non-inferior to KFRE for the 24-month horizon and proved more accurate for the 6-month horizon forecast in both validation cohorts. In the real world setting captured in the FMC validation cohort, PROGRES-CKD was computable for all patients, whereas KFRE could be computed for complete cases only (i.e., 30% and 16% of the cohort in 6- and 24-month horizons). PROGRES-CKD accurately predicts KF onset among CKD patients. Contrary to equation-based scores, PROGRES-CKD extends to patients with incomplete data and allows explicit assessment of prediction robustness in case of missing values. PROGRES-CKD may efficiently assist physicians' prognostic reasoning in real-life applications.Entities:
Keywords: artificial intelligence; chronic kidney disease (CKD); end-stage kidney disease (ESKD); kidney replacement therapy (KRT); machine learning; naïve Bayes classifiers; precision medicine; risk prediction
Mesh:
Year: 2021 PMID: 34886378 PMCID: PMC8656741 DOI: 10.3390/ijerph182312649
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1The Bayesian Network structure of PROGRES-CKD. (a) PROGRESS-CKD-6; (b) PROGRESS-CKD-24.
Variables included in PROGRES-CKD models.
| PROGRES-CKD-6 | PROGRES-CKD-24 | ||
|---|---|---|---|
| Group | Variable | ||
| Demographics and anthropometrics | |||
| Age | X | X | |
| Gender | X | X | |
| BMI, Kg/m2 | X | X | |
| Smoking status | X | X | |
| Kidney function | |||
| Albumin, g/dL | X | X | |
| Albumin Creatinine Ratio (ACR), mg/mmol ** | X | X | |
| Calcium, mg/dL | X | X | |
| eGFR, (ml/min/173 m2) | X | X | |
| regressGFR * | X | X | |
| Hemoglobin, g/dL | X | X | |
| Phosphate, mg/dL | X | X | |
| Urine protein, g/24 h | X | X | |
| Parathyroid hormone, ng/L | X | X | |
| Sodium, mmol/L | X | X | |
| Ferritin, microg/L | X | X | |
| Etiology of kidney disease | |||
| Diabetes | X | X | |
| Hypertension | X | ||
| Glomerulonephritis | X | X | |
| Polycystic | X | X | |
| Comorbidities | |||
| Cerebrovascular disease | X | X | |
| Chronic Pulmonary Disease | X | X | |
| Congestive heart failure | X | X | |
| Connective tissue disorder | X | ||
| Coronary artery disease | X | ||
| Dementia | X | X | |
| Diabetes with organ damage | X | X | |
| Diabetes without complications | X | ||
| Hemiplegia | X | ||
| Hypertension | X | ||
| Mild liver disease | X | X | |
| Moderate or severe liver disease | X | X | |
| Peripheral vascular disease | X | X | |
| Other | |||
| Number of hospitalizations | X | X | |
| Systolic blood pressure | X | X | |
* Slope of linear regression of eGFR values over the last 12 months. ** Urine Protein-Creatinine Ratio was converted to ACR by ACR = Urin protein*PCR (Urine protein = 0.6) (please, see the Supplementary Material for the conversion table).
Baseline characteristics of patients from the FMC NephroCare and GCKD cohorts.
| FMC Cohort | GCKD Cohort | |||
|---|---|---|---|---|
| Variable |
| Mean ± SD or Median (IQR) or |
| Mean ± SD or Median (IQR) or |
| Stage 3 | 11,965 | 11,965 (53.1%) | 3593 | 3593 (88.54%) |
| Stage 4 | 8026 | 8026 (35.62%) | 460 | 460 (11.34%) |
| Stage 5 | 2544 | 2544 (11.29%) | 5 | 5 (0.12%) |
| Age (year) | 22,535 | 72.15 ± 11.7 | 4058 | 62.12 ± 10.50 |
| BMI (kg/cm2) | 21,655 | 30.63 ± 10.92 | 4015 | 30.03 ± 5.91 |
| eGFR ((mL/min/1.73 m2) | 22,535 | 31.93 ± 13.4 | 4058 | 41.92 ± 9.76 |
| Albumin (g/dL) | 19,004 | 4.19 ± 0.4 | 4055 | 3.85 ± 0.42 |
| Ferritin (µg/L) | 7303 | 222.18 ± 260.98 | 1044 | 200.48 ± 196.11 |
| Hemoglobin (g/dL) | 21,916 | 12.65 ± 1.83 | 3978 | 13.49 ± 1.69 |
| Phosphate (mg/dL) | 20,362 | 3.65 ± 0.74 | 4058 | 3.45 ± 0.64 |
| Calcium (mg/dL) | 20,686 | 9.36 ± 0.73 | 4058 | 9.07 ± 0.63 |
| Sodium (mmol/L) | 20,612 | 140.17 ± 3.16 | 4057 | 139.70 ± 3.14 |
| PTH (ng/L) | 9466 | 131.84 ± 150.12 | 0 | - |
| ACR (mg/mmol) | 90 | 138.67 ± 568.28 | 3999 | 393.63 ± 888.48 |
| Proteinuria (g/24 h) | 8780 | 3.58 ± 150.29 | 0 | - |
| Systolic (mmHg) | 17,963 | 137.33 ± 18.41 | 4030 | 140.27 ± 20.53 |
| CRP (mg/L) | 13,468 | 4.23 (7.63) | 4056 | 2.41 (4.27) |
| Glucose (mg/dL) | 19,499 | 126.45 ± 48.59 | 0 | - |
| HDL Cholesterol (mg/dL) | 7074 | 48.3 ± 16.74 | 4051 | 50.72 ± 17.35 |
| LDL Cholesterol (mg/dL) | 7084 | 107.59 ± 219.29 | 4051 | 116.33 ± 42.93 |
| Triglyceride (mg/dL) | 15,191 | 142.77 (95.72) | 4050 | 173.38 (126.45) |
| hsTNT (ng/L) | 0 | - | 3976 | 13 (11) |
| Uric Acid (mg/dL) | 20,273 | 6.68 ± 1.61 | 4058 | 7.40 ± 1.92 |
| Gender (M) | 22,535 | 11,349 (50.36%) | 4058 | 2510 (61.85%) |
| Etiology Diabetes | 22,535 | 3614 (16.04%) | 4058 | 666 (16.41%) |
| Etiology Polycystic | 22,535 | 477 (2.12%) | 4058 | 157 (3.87%) |
| Etiology Hypertension | 22,535 | 5281 (23.43%) | 4058 | 1011 (24.91%) |
| Etiology Glomerulonephrite | 22,535 | 987 (4.38%) | 4058 | 623 (15.35%) |
| Smoking status: ex-smoker | 3502 | 3502 (15.54%) | 1819 | 1819 (44.96%) |
| Smoking status: no smoker | 10,066 | 10,066 (44.67%) | 1649 | 1649 (40.76%) |
| Smoking status: smoker | 2274 | 2274 (10.09%) | 578 | 578 (14.29%) |
| Alcohol: abuse | 8636 | 8636 (38.32%) | 771 | 771 (19.10%) |
| Alcohol: moderate | 0 | 0 (0%) | 3265 | 3265 (80.90%) |
| Alcohol: abstinence | 6984 | 6984 (30.99%) | 0 | 0 (%) |
| Peripheral Vascular Disease | 22,535 | 1875 (8.32%) | 4058 | 424 (10.45%) |
| Coronary Artery Disease | 22,535 | 4336 (19.24%) | 4058 | 908 (22.38%) |
| Congestive Heart Failure | 22,535 | 1887 (8.37%) | 4058 | 776 (19.12%) |
| Cerebrovascular Disease | 22,535 | 1876 (8.32%) | 4058 | 472 (10.52%) |
| Connective Tissue Disorder | 22,535 | 399 (1.77%) | 0 | - |
| Cancer | 22,535 | 2469 (10.96%) | 4058 | 532 (13.11%) |
| Diabetes | 22,535 | 9021 (40.03%) | 4058 | 1545 (38.07%) |
| Anemia | 22,535 | 9800 (43.49%) | 4058 | 1057 (26.05%) |
| Hypertension | 22,535 | 17,871 (79.3%) | 4058 | 3951 (97.36%) |
| Atrial Fibrillation | 22,535 | 2337 (10.37%) | 4058 | 876 (21.59%) |
| Diabetes Without Complications (CCI) | 22,535 | 3013 (13.37%) | 4058 | 1545 (38.07%) |
| Chronic Pulmonary Disease (CCI) | 22,535 | 1618 (7.18%) | 4058 | 285 (7.02%) |
| Psychiatric Disease | 22,535 | 177 (0.79%) | 0 | - |
| Liver Disease | 22,535 | 987 (4.38%) | 0 | - |
| RRT in 24 months | 9407 | 1817 (19.32%) | 3684 | 80 (2.17%) |
| RRT in 6 months | 18,504 | 801 (4.33%) | 3888 | 11 (0.28%) |
Figure 2Calibration of (A) PROGRES-CKD-6, and (B) PROGRES-CKD-24 in the FMC cohort. Bar graph denotes the incidence of RRT initiation events observed in each quintile of risk (left axis); line graph denotes the fraction of RRT initiation events in each quintile with respect to the total number of RRT initiation events (right axis). Endpoint horizons: 6 months for PROGRES-CKD-6; 24 months for PROGRES-CKD-24.
Figure 3Calibration of (A) PROGRES-CKD-6, and (B) PROGRES-CKD-24 in the GCKD cohort. Bar graph denotes the incidence of RRT initiation events observed in each quintile of risk (left axis); line graph denotes the fraction of RRT initiation events in each quintile with respect to the total number of RRT initiation events (right axis). Endpoint horizons: 6 months for PROGRES-CKD-6; 24 months for PROGRES-CKD-24.
Comparison between discrimination ability of (A) PROGRES-CKD-6 and (B) PROGRES-CKD-24 and that of Tangri’s Kidney Failure Risk Equations (KFREs) in the FMC and the GCKD cohort. The two scores were computed considering only complete cases (column “Effective sample size”), while patients with missing data were not included in the analysis. Endpoint horizons: 6 months for PROGRES-CKD-6; 24 months for PROGRES-CKD-24. Imputation method: Listwise. Non-inferiority was defined as AUC < 0.05, while superiority was set at ΔAUC ≥ 0.05. * Delta AUC: AUC of Tangri’s KFRE–AUC of PROGRES-CKD model.
| Model | Validation Cohort | Comparator Model | AUC PROGRES-CKD | Delta AUC * | Effective Sample Size | |
|---|---|---|---|---|---|---|
| PROGRES-CKD-6 | ||||||
| FMC NephroCare | ||||||
| 4VAR | 0.90 | −0.012 | 0.3255 | 927 | ||
| 6VAR | 0.90 | −0.016 | 0.2220 | 927 | ||
| GCKD | ||||||
| 4VAR | 0.91 | −0.146 | 0.0016 | 459 | ||
| 6VAR | 0.91 | −0.149 | 0.0013 | 459 | ||
| PROGRES-CKD-24 | ||||||
| FMC NephroCare | ||||||
| 4VAR | 0.87 | 0.020 | 0.0483 | 1081 | ||
| 6VAR | 0.87 | 0.018 | 0.0888 | 1081 | ||
| GCKD | ||||||
| 4VAR | 0.85 | 0.030 | 0.0105 | 3999 | ||
| 6VAR | 0.85 | 0.027 | 0.0246 | 3999 | ||
PROGRES-CKD-24 and Experts’ ratings of CKD progression risk.
| Experts | |||||
|---|---|---|---|---|---|
| PROGRES-CKD-24 | Expert 1 | Expert 2 | Expert 3 | Expert 4 | |
| AUC | 0.96 | 0.84 | 0.72 | 0.86 | 0.76 |
| Sensitivity | 0.76 | 0.80 | 0.50 | 0.75 | 0.60 |
| Specificity | 0.96 | 0.84 | 0.89 | 1.00 | 0.82 |
Figure 4Potential impact simulation of PROGRES-CKD-24 implementation in a hypothetical CKD cohort. Flowcharts showing patients’ referral to intensified intervention programs based on (A) experts’ ratings, and (B) PROGRES-CKD scores; (C) Number of ESKD events within 24 months: both observed and saved cases are shown; D) Number of patients needed to treat to save 1 patient; “all-in strategy” involves referral of all stage 3 patients to the intensified healthcare program. Abbreviations: ESKD, end-stage kidney disease; NNT, Number needed to treat.
Figure 5Discrimination ability of PROGRES-CKD and KFREs and percentage of computed scores by each prediction tool. Only cases with complete medical information were included in this analysis. (A) RRT prediction within 6 months; (B) RRT prediction within 24 months. Bars denote AUC (left y-axis), while dots denote the percentage of computed scores on the total number of recruited patients in each cohort (right y-axis). Abbreviations: P-CKD6, PROGRES-CKD-6; P-CKD24, PROGRES-CKD-24; 4VAR, KFRE 4 variables; 6VAR, KFRE 6 variables.