| Literature DB >> 36050407 |
Zhenliang Fan1, Qiaorui Yang2, Zhuohan Xu3, Ke Sun3, Mengfan Yang2, Riping Yin2, Dongxue Zhao2, Junfen Fan1, Hongzhen Ma1, Yiwei Shen4, Hong Xia5.
Abstract
Chronic kidney disease (CKD) has become a worldwide public health problem and accurate assessment of renal function in CKD patients is important for the treatment. Although the glomerular filtration rate (GFR) can accurately evaluate the renal function, the procedure of measurement is complicated. Therefore, endogenous markers are often chosen to estimate GFR indirectly. However, the accuracy of the equations for estimating GFR is not optimistic. To estimate GFR more precisely, we constructed a classification decision tree model to select the most befitting GFR estimation equation for CKD patients. By searching the HIS system of the First Affiliated Hospital of Zhejiang Chinese Medicine University for all CKD patients who visited the hospital from December 1, 2018 to December 1, 2021 and underwent Gate's method of 99mTc-DTPA renal dynamic imaging to detect GFR, we eventually collected 518 eligible subjects, who were randomly divided into a training set (70%, 362) and a test set (30%, 156). Then, we used the training set data to build a classification decision tree model that would choose the most accurate equation from the four equations of BIS-2, CKD-EPI(CysC), CKD-EPI(Cr-CysC) and Ruijin, and the equation was selected by the model to estimate GFR. Next, we utilized the test set data to verify our tree model, and compared the GFR estimated by the tree model with other 13 equations. Root Mean Square Error (RMSE), Mean Absolute Error (MAE) and Bland-Altman plot were used to evaluate the accuracy of the estimates by different methods. A classification decision tree model, including BSA, BMI, 24-hour Urine protein quantity, diabetic nephropathy, age and RASi, was eventually retrieved. In the test set, the RMSE and MAE of GFR estimated by the classification decision tree model were 12.2 and 8.5 respectively, which were lower than other GFR estimation equations. According to Bland-Altman plot of patients in the test set, the eGFR was calculated based on this model and had the smallest degree of variation. We applied the classification decision tree model to select an appropriate GFR estimation equation for CKD patients, and the final GFR estimation was based on the model selection results, which provided us with greater accuracy in GFR estimation.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36050407 PMCID: PMC9436941 DOI: 10.1038/s41598-022-19185-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Clinical and demographic data of patients (Mean (P25–P75)).
| Training set (n = 362) | Test set (n = 156) | Total population (n = 518) | |
|---|---|---|---|
| Age (year) | 60.2 (50–71) | 61.7 (52–72) | 60.6 (51–71) |
| Male | 224 (61.9%) | 104 (66.7%) | 328 (63.3%) |
| Height (cm) | 167.6 (158–170) | 164.2 (158–170) | 166.6 (158–170) |
| Weight (kg) | 68.8 (55.6–72.2) | 65.4 (57.0–77.0) | 67.8 (55.9–73.0) |
| Body surface area (m2) | 1.7 (1.6–1.8) | 1.7 (1.6–1.9) | 1.7 (1.6–1.8) |
| BMI | 25.4 (21.3–26.6) | 24.2 (21.8–26.3) | 25.0 (21.5–26.5) |
| SBP (mmHg) | 142.2 (126.0–157.0) | 147.3 (128.3–164.0) | 143.7 (127.0–158.3) |
| DBP (mmHg) | 80.2 (70.0–88.3) | 79.2 (68.0–89.0) | 79.9 (69.0–89.0) |
| Smoking | 68 (18.8%) | 32 (20.5%) | 100 (19.3%) |
| Drinking | 46 (12.7%) | 26 (16.7%) | 72 (13.9%) |
| Unilateral nephrectomy | 6 (1.7%) | 8 (5.1%) | 14 (2.7%) |
| Grade 1 | 75 (20.7%) | 24 (15.4%) | 99 (19.1%) |
| Grade 2 | 69 (19.1%) | 36 (23.1%) | 105 (20.3%) |
| Grade 3 | 160 (44.2%) | 82 (52.6%) | 242 (46.7%) |
| Diabetes | 104 (28.7%) | 60 (38.5%) | 164 (31.7%) |
| Cardiovascular disease | 43 (11.9%) | 28 (18.0%) | 71 (13.7%) |
| Cerebral hemorrhage | 3 (0.8%) | 2 (1.3%) | 5 (1.0%) |
| Cerebral infarction | 28 (7.7%) | 9 (5.8%) | 37 (7.1%) |
| Hyperuricemia | 67 (18.5%) | 28 (18.0%) | 95 (18.3%) |
| Gout | 45 (12.4%) | 21 (13.5%) | 66 (12.7%) |
| Edema | 130 (35.9%) | 68 (43.6%) | 198 (38.2%) |
| MCD | 1 (0.3%) | 0 (0%) | 1 (0.2%) |
| IgAN | 30 (8.3%) | 7 (4.5%) | 37 (7.1%) |
| MsPGN | 13 (3.6%) | 1 (0.6%) | 14 (2.7%) |
| MPGN | 1 (0.3%) | 0 (0%) | 1 (0.2%) |
| MN | 5 (1.4%) | 1 (0.6%) | 6 (1.2%) |
| FSGS | 2 (0.6%) | 1 (0.6%) | 3 (0.6%) |
| Hypertensive kidney lnjury | 10 (2.8%) | 6 (3.9%) | 16 (3.1%) |
| Diabetic nephropathy | 44 (12.2%) | 30 (19.2%) | 74 (14.3%) |
| HSP | 2 (0.6%) | 0 (0%) | 2 (0.4%) |
| Lupus nephritis | 3 (0.8%) | 2 (1.3%) | 5 (1.0%) |
| Hyperuricemic nephropathy | 13 (3.6%) | 6 (3.9%) | 19 (3.7%) |
| Polycystic kidney | 4 (1.1%) | 0 (0%) | 4 (0.8%) |
| Without renal biopsy | 234 (64.6%) | 102 (65.4%) | 336 (64.9%) |
| Glucocorticoid | 30 (8.3%) | 14 (9.0%) | 44 (8.5%) |
| Immunosuppressor | 47 (13.0%) | 14 (9.0%) | 61 (11.8%) |
| Diuretic | 136 (37.6%) | 66 (42.3%) | 202 (39.0%) |
| Uric acid lowering therapy | 212 (58.6%) | 83 (53.2%) | 295 (57.0%) |
| SGLT2i | 3 (0.8%) | 6 (3.9%) | 9 (1.7%) |
| RASi | 117 (32.3%) | 45 (28.9%) | 162 (31.3%) |
| Statin | 132 (36.5%) | 58 (37.2%) | 190 (36.7%) |
| Calcium dobesilate | 32 (8.8%) | 10 (6.4%) | 42 (8.1%) |
| White blood cell (× 109) | 6.0 (4.5–7.0) | 6.28 (4.9–7.2) | 6.1 (4.7–7.0) |
| Red blood cell (× 109) | 4.3 (2.9–3.9) | 5.4 (2.8–3.8) | 4.6 (2.9–3.9) |
| Hemoglobin (g/L) | 102.3 (87.0–118.0) | 98.6 (82.0–113.8) | 101.2 (86.0–116.0) |
| Platelet (× 109) | 185.4 (141.0–221.0) | 188.1 (140.5–225.5) | 186.2 (141.0–222.3) |
| C-Reactive protein (mg/L) | 9.0 (1.0–4.9) | 7.7 (1.0–5.0) | 8.6 (1.0–4.9) |
| AG (mmol/L) | 8.4 (4.1–12.4) | 9.4 (6.1–13.0) | 8.7 (4.6–12.6) |
| Lactic acid (mmol/L) | 1.3 (0.9–1.5) | 1.3 (0.9–1.5) | 1.3 (0.9–1.5) |
| Osmotic pressure (mosm/L) | 283.2 (280.0 -289.0) | 283 (280.6–290.4) | 283.2 (280.1–289.6) |
| FPG (mmol/L) | 5.1 (4.2–5.3) | 5.2 (4.3–5.6) | 5.2 (4.2–5.4) |
| Glycated hemoglobin (%) | 6.0 (5.3–6.3) | 6.0 (5.3–6.5) | 6.0 (5.3–6.3) |
| Serum potassium (mmol/L) | 7.4 (3.9–4.6) | 4.3 (3.9–4.6) | 6.5 (3.9–4.6) |
| Serum sodium (mmol/L) | 141.0 (139.4–142.5) | 140.4 (139.9–143.0) | 140.8 (139.5–142.7) |
| Serum calcium (mmol/L) | 3.1 (2.1–2.3) | 2.2 (2.1–2.3) | 2.8 (2.1–2.3) |
| Serum phosphate (mmol/L) | 2.4 (1.1–1.6) | 1.5 (1.2–1.6) | 2.1 (1.1–1.6) |
| Triglyceride (mmol/L) | 1.8 (1.1–2.1) | 3.6 (1.1–2.2) | 2.4 (1.1–2.1) |
| Total cholesterol (mmol/L) | 4.5 (3.6–5.2) | 4.4 (3.5–5.1) | 4.5 (3.5–5.2) |
| High-density lipoprotein (mmol/L) | 1.1 (0.9–1.3) | 1.1 (0.9–1.3) | 1.1 (0.9–1.3) |
| Llow-density lipoprotein (mmol/L) | 2.3 (1.7–2.9) | 2.3 (1.6–2.8) | 2.3 (1.7–2.8) |
| Total serum protein (g/L) | 62.5 (57.4–67.4) | 62.3 (57.8–67.1) | 62.4 (57.6–67.3) |
| Aalbumin (g/L) | 35.1 (32.0–38.9) | 35.0 (31.9–39.1) | 35.1 (32.0–38.9) |
| Globulin (g/L) | 27.4 (24.0–30.5) | 27.2 (23.7–29.6) | 27.3 (24.0–30.1) |
| Prealbumin (mg/L) | 287.4 (241.0–336.0) | 285.3 (235.5–342.0) | 286.8 (237.8–336.3) |
| Homocysteine (μmol/L) | 26.7 (15.9–28.3) | 27.0 (16.3–29.7) | 26.8 (16.0–29.1) |
| − | 76 (21.0%) | 28 (18.0%) | 104 (20.1%) |
| ± | 33 (9.1%) | 20 (12.8%) | 53 (10.2%) |
| + | 80 (22.1%) | 32 (20.5%) | 112 (21.6%) |
| ++ | 122 (33.7%) | 48 (30.8%) | 170 (32.8%) |
| +++ | 51 (14.1%) | 28 (18.0%) | 79 (15.3%) |
| Urine α1 microglobulin (mg/L) | 42.4 (16.3–63.3) | 59.6 (22.3–70.8) | 47.6 (18.0–64.1) |
| Urine β2 microglobulin (μg/L) | 8891.3 (183.6–10,928.0) | 6720.6 (111.7–7788.1) | 8237.6 (172.9–10,512.7) |
| Microalbuminuria (mg/L) | 875.2 (74.6–1248.6) | 962.9 (61.0 -1378.3) | 901.6 (73.1–1303.8) |
| Urine NAG-enzyme (u/L) | 12.4 (6.5–14.2) | 11.6 (6.7–14.3) | 12.1 (6.6–14.2) |
| Urinary albumin creatinine ratio (mg/umol) | 0.2 (0.01–0.23) | 0.2 (0.01–0.29) | 0.2 (0.01–0.25) |
| 24-hour urine volume (L) | 6.3 (1.2–2.1) | 1.7 (1.2–2.1) | 4.9 (1.2–2.1) |
| 24-hour urinary protein quantity (mg) | 1818.9 (221.0–2665.0) | 2020.4 (231.0–2975.0) | 1879.0 (224.0–2781.9) |
| B-type natriuretic peptide (ng/L) | 195.0 (19.0–132.1) | 193.7 (27.9–179.4) | 194.6 (20.4–148.2) |
| Free triiodothyronine (pmol/L) | 4.5 (3.0–4.0) | 3.3 (2.9–3.8) | 4.1 (3.0 -3.9) |
| Free thyroxine (pmol/L) | 12.6 (11.6–14.1) | 12.1 (11.0–13.9) | 12.4 (11.4–14.1) |
| Thyroid stimulating hormone (mIU/L) | 5.3 (1.1–2.7) | 3.7 (1.0 -2.5) | 4.8 (1.0–2.7) |
| Parathyroid hormone (μg/mL) | 89.1 (11.4–136.1) | 99.9 (13.5–142.2) | 92.3 (12.1–137.3) |
| Ferroprotein (ng/mL) | 226.2 (85.1–279.0) | 255.0 (98.7–323.2) | 235.0 (89.6–292.8) |
| Uric acid (umol/L) | 437.9 (347.8–518.3) | 453.2 (363.8–532.5) | 442.5 (349.0–525.0) |
| Serum creatinine (umol/L) | 312.5 (146.5–424.8) | 371.2 (174.3–493.5) | 330.2 (150.8–451.0) |
| Urea nitrogen (mmol/L) | 17.1 (9.0–21.5) | 18.1 (9.5–22.8) | 17.4 (9.1–21.9) |
| Cystatin C (mg/L) | 5.6 (2.0–3.6) | 4.1 (2.0–3.7) | 5.6 (2.0–3.7) |
| Glomerular filtration rate (mL/min) | 28.3 (15.3–38.0) | 26.3 (14.8–34.0) | 27.7 (15.2–36.8) |
Figure 1Relative importance of variables.
Figure 2Classification decision tree model.
GFR estimated by decision tree model and traditional equations based on BSA for test set (Mean (P25–P75)).
| Equationa | Test set (n = 156) |
|---|---|
| Cockcroft gault | 64.6 (30.7–85.9) |
| MDRD | 59.9 (25.42–84.78) |
| Abbreviated MDRD | 23.4 (9.9–33.69) |
| Chinese modification MDRD | 72.0 (30.6–101.9) |
| Chinese modification abbreviated MDRD | 30.6 (12.9–43.9) |
| CKD-EPI(Cr) | 23.9 (9.3–33.4) |
| CKD-EPI(CysC) | 24.2 (13.2–28.8) |
| CKD-EPI(Cr-CysC) | 22.7 (10.7–29.9) |
| Asian modified CKD-EPI(Cr) | 25.2 (9.8–35.3) |
| BIS-2 | 27.6 (15.8–34.2) |
| MacIsaac | 31.3 (19.3–38.7) |
| Ruijin | 28.2 (14.6–38.2) |
| Xiangya | 38.8 (27.1–49.0) |
| Decision tree classifier | 25.3 (13.6–32.1) |
| sGFRb | 26.6 (15.1–34.6) |
aAll GFR estimation equations were converted to a uniform unit, mL/min per 1.73 m2.
bsGFR: GFR was measured by 99mTc-DTPA, and the GFR was converted to 1.73 m2 standard body surface area based on the patient's body surface area.
RMSE and MAE of various estimation equations.
| RMSE | MAE | |
|---|---|---|
| Test set (n = 156) | Test set (n = 156) | |
| Cockcroft gault | 50.4 | 38.7 |
| MDRD | 48.8 | 35.0 |
| Abbreviated MDRD | 12.3 | 9.6 |
| Chinese modification MDRD | 63.7 | 46.3 |
| Chinese modification abbreviated MDRD | 15.9 | 11.7 |
| CKD-EPI(Cr) | 12.7 | 9.9 |
| CKD-EPI(CysC) | 14.0 | 9.5 |
| CKD-EPI(Cr-CysC) | 12.8 | 9.3 |
| Asian modified CKD-EPI(Cr) | 13.2 | 10.1 |
| BIS-2 | 11.9 | 8.8 |
| MacIsaac | 15.0 | 10.8 |
| Ruijin | 11.6 | 8.9 |
| Xiangya | 16.2 | 13.9 |
| Decision tree classifier | 12.2 | 8.5 |
Figure 3Variations in estimates of GFR in different equations of test data. (a) eGFR: Glomerular filtration rate was estimated based on an equation or model; (b) sGFR: GFR was measured by 99mTc-DTPA, and the GFR was converted to 1.73 m2 standard body surface area based on the patient's body surface area.
Figure 4Bland–Altman diagram for the test set. (a) eGFR: Glomerular filtration rate was estimated based on an equation or model; (b) sGFR: GFR was measured by 99mTc-DTPA, and the GFR was converted to 1.73 m2 standard body surface area based on the patient's body surface area.