| Literature DB >> 32024311 |
Cheng-Sheng Yu1,2, Chang-Hsien Lin1,2, Yu-Jiun Lin1,2, Shiyng-Yu Lin1,2, Sen-Te Wang1,2, Jenny L Wu1,2, Ming-Hui Tsai3, Shy-Shin Chang1,2.
Abstract
BACKGROUND: Preventive medicine and primary health care are essential for patients with chronic kidney disease (CKD) because the symptoms of CKD may not appear until the renal function is severely compromised. Early identification of the risk factors of CKD is critical for preventing kidney damage and adverse outcomes. Early recognition of rapid progression to advanced CKD in certain high-risk populations is vital.Entities:
Keywords: chronic kidney disease; clustering; heatmap; multivariate statistical analysis; risk factors
Year: 2020 PMID: 32024311 PMCID: PMC7073732 DOI: 10.3390/jcm9020403
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Descriptive statistics and testing of medical variables in health examination data with chronic kidney disease.
| CKD Stage 1 | CKD Stage 2 | CKD Stage 3–5 | ||
|---|---|---|---|---|
| Factors | No. (%) | No. (%) | No. (%) | |
| Sex | ||||
| Female | 1017 (59.3%) | 96(25.9%) | 63(31.2%) | <0.001 |
| Male | 698 (40.7%) | 274(74.1%) | 139(68.8%) | |
| Hypertension | ||||
| Normal | 1299 (75.7%) | 214(57.8%) | 112(55.4%) | <0.001 |
| High | 416 (24.3%) | 156(42.2%) | 90(44.6%) | |
| Median (IQR) | ||||
| Age, years | 42 (35–51) | 53 (46–60.75) | 63.5 (55.25–73) | <0.001 |
| Albumin, g/dL | 4.6 (4.4–4.8) | 4.6 (4.5–4.8) | 4.3 (3.8–4.5) | <0.001 |
| BMI, kg/m2 | 23.4 (21.1–25.99) | 24.7 (22.7–27.02) | 25.05 (22.15–27.98) | <0.001 |
| WC, cm | 80.5 (73.5–88) | 86 (80–92) | 87.58 (82–95.75) | <0.001 |
| AFP, ng/mL | 2.35 (1.62–3.27) | 2.62 (1.98–3.62) | 2.69 (2.15–4.48) | <0.001 |
| ALKp, IU/L | 60 (50–72) | 67 (55–79) | 79.69 (65.68–96) | <0.001 |
| GOT, IU/L | 20 (17–24) | 23 (20–28) | 23.1 (19–34) | <0.001 |
| GPT, IU/L | 18 (13–28) | 23 (17–33) | 22 (15–35) | <0.001 |
| T_Bilirubin, mg/dL | 0.6 (0.4–0.8) | 0.6 (0.5–0.8) | 0.6 (0.4–0.9) | 0.002 |
| γGT, U/L | 17 (12–27) | 23 (17–34) | 34.63 (21–70.90) | <0.001 |
| CAPscore, dB/m | 241 (208–281) | 260 (227–301.8) | 246 (203–296.5) | <0.001 |
| Escore, kPa | 4.3 (3.5–5.1) | 4.6 (3.7–5.6) | 7.9 (4.925–13.525) | <0.001 |
| BUN, mg/dL | 12 (10–14.34) | 15 (13–18) | 23.5 (16.78–38.82) | <0.001 |
| Creatinine, mg/dL | 0.7 (0.6–0.8) | 1 (1–1.1) | 1.7 (1.425–3.9) | <0.001 |
| UA, mg/dL | 5.1 (4.3–6.3) | 6.3 (5.5–7.3) | 6.6 (5.7–7.975) | <0.001 |
| Cholesterol, mg/dL | 187 (165–209) | 191.5 (165–217) | 169 (130.2–195) | <0.001 |
| HbA1C, % | 5.4 (5.2–5.6) | 5.525 (5.3–5.9) | 5.7 (5.3–6.3) | <0.001 |
| HDL, mg/dL | 54 (45–65) | 49 (41–58) | 46 (37–57.9) | <0.001 |
| LDL, mg/dL | 121 (102–143) | 131 (102.2–152) | 107 (80.25–128.75) | <0.001 |
| TSH, μIU/mL | 1.81(1.2–2.545) | 2.075 (1.465–2.835) | 2.009 (1.54–2.565) | <0.001 |
CKD, chronic kidney disease; BMI, body mass index; WC, waist circumference; AFP, alpha-fetoprotein; ALKp, alkaline phosphatase; GOT, serum glutamic-oxalocetic transaminase; GPT, serum glutamic-pyruvic transaminase; γGT, γ-Glutamyl transpeptidase; BUN, blood urea nitrogen; UA, uric acid; HDL, high-density lipoprotein; LDL, low-density lipoprotein; HbA1C, glycated hemoglobin; TSH, thyroid-stimulating hormone; IU, international units; U, μmol/min; μIU, micro-international units. The upper part includes categorical variables and the others are continuous variables, which are general physiological indices in the first section, hepatic indices and nephritic elements in the second section, and blood lipid and thyroid indices in the final section.
Multivariate logistic regression analysis of whole biomarkers related to chronic kidney disease.
| Factors | Odds Ratio | 95% CI OR | VIF | ΔVIF | |
|---|---|---|---|---|---|
| BMI, kg/m2 | 0.904 | (0.854, 0.958) | 3.977 | 3.975 |
|
| WC, cm | 1.046 | (1.023, 1.069) | 4.170 | 4.169 |
|
| Cholesterol, mg/dL | 0.983 | (0.973, 0.994) |
|
|
|
| Hypertension | 0.213 | (0.654, 1.099) | 1.123 | 1.122 | 0.213 |
| HbA1C, % |
| (1.087, 1.434) | 1.223 | 1.178 |
|
| HDL, mg/dL | 1.005 | (0.994, 1.016) | 1.945 | 1.334 | 0.363 |
| LDL, mg/dL | 1.016 | (1.005, 1.028) |
|
|
|
| Albumin, g/dL | 0.793 | (0.536, 1.174) | 1.355 | 1.364 | 0.247 |
| ALKp, IU/L | 1.001 | (0.995, 1.006) | 1.202 | 1.209 | 0.761 |
| GOT, IU/L | 1.030 | (1.012, 1.050) | 3.832 | 3.801 |
|
| GPT, IU/L | 0.977 | (0.966, 0.988) | 3.847 | 3.803 |
|
| γGT, U/L | 1.002 | (0.997, 1.006) | 1.452 | 1.448 | 0.425 |
| T_Bilirubin, mg/dL | 1.088 | (0.930, 1.273) | 1.182 | 1.180 | 0.291 |
| CAPscore, dB/m | 1.003 | (1.000, 1.006) | 1.646 | 1.647 |
|
| Escore, kPa | 1.012 | (0.985, 1.040) | 1.622 | 1.595 | 0.393 |
| AFP, ng/mL | 1.000 | (1.000, 1.000) | 1.141 | 1.129 | 0.528 |
| BUN, mg/dL |
| (1.190, 1.270) | 1.063 | 1.062 |
|
| UA, mg/dL |
| (1.349, 1.619) | 1.346 | 1.333 |
|
| TSH, μIU/mL | 1.047 | (1.004, 1.091) | 1.016 | 1.015 | 0.031 |
* indicates the p value < 0.001, † indicates the p-value < 0.05. BMI, body mass index; WC, waist circumference; HbA1C, glycated hemoglobin; HDL, high-density lipoprotein; LDL, low-density lipoprotein; ALKp, alkaline phosphatase; GOT, serum glutamic-oxalocetic transaminase; GPT, serum glutamic-pyruvic transaminase; γGT, γ-Glutamyl transpeptidase; AFP, alpha-fetoprotein; BUN, blood urea nitrogen; UA, uric acid; TSH, thyroid-stimulating hormone; IU, international units; U, μmol/min; μIU, micro-international units. Chronic kidney disease is the dependent variable, and all the biomarkers are the independent variables in the logistic analysis. The odds ratio represents the exp(β), which is the exponential of the estimator in logistic regression with 95% confidence interval. In addition, the variance inflation factor (VIF) for each variable is calculated to check multicollinearity. Factors with high odds ratio or significant p-value are marked in bold. Factors with high VIF values are shaded. ΔVIF records the variance inflation factor (VIF) after removing the predictor variables with high VIF value.
Figure 1A flowchart of the step-by-step procedure from data collection and preprocessing to statistical analyses.
Figure 2(a) Variable importance ordered by the accuracy of mean decrease in random forest. The leading variables obtained by random forest list in a with a darker blue; conversely, less prominent variables are indicated in a lighter blue. (b) area under the ROC curve of random forest. The rainbow bar indicates the value of specificity in the false-positive rate.
Figure 3A clustering heatmap illustrating the classification of chronic kidney disease (CKD) in health examination data. Both the rows of CKD patients and the columns of biomarkers have been clustered, respectively; row data is also normalized into Z-score, simultaneously. Moreover, the mapping grids in the center have been colored according to their z scores.