| Literature DB >> 35845598 |
Xiaoqian Yan1, Ximin Li1, Ying Lu1, Dongfang Ma2, Shenghong Mou2, Zhiyuan Cheng2, Yuan Ding3, Bin Yan3, Xianzhen Zhang1, Gang Hu1.
Abstract
Objective: To establish a prediction model for the risk evaluation of chronic kidney disease (CKD) to guide the management and prevention of CKD.Entities:
Year: 2022 PMID: 35845598 PMCID: PMC9286960 DOI: 10.1155/2022/6561721
Source DB: PubMed Journal: Evid Based Complement Alternat Med ISSN: 1741-427X Impact factor: 2.650
Comparison of clinical data between the CKD and non-CKD groups.
| Features | CKD group | Non-CKD group | Statistics |
|
|---|---|---|---|---|
| Age (year) | 63.87 ± 19.84 | 50.22 ± 20.45 |
| <0.001 |
| Gender (male/female) | 681/582 | 1031/917 |
| 0.582 |
| White blood cell count ( | 24.92 ± 0.41 | 24.88 ± 0.39 |
| 0.010 |
| Lymphocytes/monocytes | 4.0 (2.9, 5.7) | 4.0 (2.5, 5.5) |
| <0.001 |
| Hemoglobin (g/L) | 127.39 ± 20.24 | 128.21 ± 19.97 |
| 0.256 |
| Platelet count ( | 28.23 ± 0.41 | 28.36 ± 0.41 |
| <0.001 |
| Urine glucose (positive (%)) | 99 (9.04%) | 111 (6.09%) |
| 0.003 |
| Urine white blood cells (positive (%)) | 316 (28.67%) | 342 (18.62%) |
| <0.001 |
| Urinary occult blood (positive (%)) | 148 (13.52%) | 181 (9.93%) |
| 0.004 |
| Urine white blood cells (cells/ | 6 (2.22) | 5 (2.14) |
| <0.001 |
| Urine red blood cells (cells/ | 7 (4.14) | 5 (3.11) |
| <0.001 |
| Blood potassium (mmol/L) | 4.00 ± 0.45 | 3.96 ± 0.38 |
| 0.009 |
| Blood sodium (mmol/L) | 140.77 ± 3.67 | 140.66 ± 3.01 |
| 0.412 |
| Total cholesterol (mmol/L) | 4.78 ± 1.31 | 4.61 ± 1.21 |
| 0.001 |
| Triglyceride (mmol/L) | 1.30 (0.90, 2.05) | 1.22 (0.87, 1.82) |
| 0.006 |
| Low-density lipoprotein (mmol/L) | 2.68 ± 0.95 | 2.72 ± 0.84 |
| 0.177 |
| Total bilirubin ( | 11.00 (8.20, 15.10) | 11.35 (8.40, 15.40) |
| 0.163 |
| Direct bilirubin ( | 3.3 (2.3, 4.9) | 2.9 (2.0, 4.2) |
| <0.001 |
| Alanine aminotransferase (mmol/L) | 17.0 (12.0, 25.0) | 17.0 (12.0, 26.5) |
| 0.211 |
| Fasting blood glucose (mmol/L) | 5.52 (4.86, 6.70) | 5.05 (4.56, 5.71) |
| <0.001 |
| Blood urea nitrogen (mmol/L) | 5.26 ± 1.97 | 4.77 ± 1.79 |
| <0.001 |
| Serum creatinine ( | 70.79 ± 18.11 | 62.19 ± 16.38 |
| <0.001 |
| Blood uric acid ( | 302.18 ± 104.63 | 292.58 ± 94.92 |
| 0.010 |
| Albumin (g/L) | 41.08 ± 4.95 | 40.67 ± 5.23 |
| 0.031 |
| Globulin (g/L) | 26.76 ± 5.31 | 26.10 ± 4.61 |
| <0.001 |
| Thrombin time (s) | 17.27 ± 1.91 | 16.64 ± 2.94 |
| <0.001 |
| International normalized ratio | 1.03 ± 0.24 | 1.00 ± 0.12 |
| <0.001 |
| Fibrinogen (g/L) | 3.11 ± 1.13 | 3.09 ± 1.12 |
| 0.807 |
Note: lymphocyte/monocyte, ratio of peripheral blood lymphocyte count to monocyte count.
Correlation analysis between the incidence of CKD and laboratory test indicators.
| Features |
|
|
|---|---|---|
|
| 0.122 | <0.001 |
|
| 0.053 | 0.003 |
|
| 0.102 | <0.001 |
| Red blood cell count ( | −0.075 | <0.001 |
|
| −0.185 | <0.001 |
| Platelet distribution width (%) | −0.077 | <0.001 |
|
| 0.129 | <0.001 |
| Urinary occult blood (positive/negative) | 0.053 | 0.004 |
|
| 0.117 | <0.001 |
| Urine white blood cells (cells/ | 0.077 | <0.001 |
|
| 0.160 | <0.001 |
| Prothrombin time (s) | 0.111 | <0.001 |
| International normalized ratio | 0.026 | 0.167 |
|
| 0.136 | <0.001 |
|
| 0.226 | <0.001 |
|
| 0.047 | 0.008 |
|
| 0.054 | 0.002 |
| Albumin (g/L) | 0.026 | 0.141 |
Note: refers to the feature retained after deletion of features with similar clinical significance according to the size of the correlation coefficient.
Ranking of the top 15 XG Boost model features.
| Features |
|
|---|---|
| Protein | 0.220 |
| Urine red blood cells | 0.209 |
| Age | 0.050 |
| Serum-creatinine | 0.032 |
| Gender | 0.017 |
| Albumin-creatinine ratio | 0.015 |
| Leukocyte | 0.013 |
| Erythrocyte | 0.013 |
| Platelet distribution width | 0.009 |
| High-sensitivity C-reactive protein | 0.008 |
| Hemoglobin | 0.008 |
| Hemoglobin A1c | 0.008 |
| Platelet | 0.008 |
| Albumin | 0.007 |
| Potassium | 0.007 |
Prediction performance of the different models.
| Model | Indicator | Indication with MD-BERT-LGBM | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | Precision | Recall | AUC | Accuracy | Precision | Recall | AUC | |
| XGBoost | 0.9088 | 0.9175 | 0.8244 | 0.9549 | 0.9357 | 0.9425 | 0.8782 | 0.9719 |
| SVM | 0.8048 | 0.8330 | 0.5828 | 0.8705 | 0.7992 | 0.8392 | 0.5575 | 0.8704 |
| NB | 0.7811 | 0.8326 | 0.4973 | 0.8460 | 0.8086 | 0.8670 | 0.5556 | 0.7693 |
| RF | 0.9020 | 0.9318 | 0.7905 | 0.9519 | 0.9108 | 0.9550 | 0.7927 | 0.9716 |
| LR | 0.8276 | 0.7868 | 0.7225 | 0.8903 | 0.8489 | 0.8187 | 0.7551 | 0.9045 |
SVM, support vector machine, RF, random forest, NB, Naïve Bayes, LR, logistic regression model.
Figure 1Receiver operating characteristic curves of the different models. SVM, support vector machine, RF: random forest, NB, Naive Bayes, LR, Logistic regression model.