| Literature DB >> 32741377 |
Lijuan Wu1,2, Yong Hu1,2, Xiangzhou Zhang1,2, Weiqi Chen1,2, Alan S L Yu3, John A Kellum4, Lemuel R Waitman5, Mei Liu6.
Abstract
BACKGROUND: Likelihood of developing acute kidney injury (AKI) increases with age. We aimed to explore whether the predictability of AKI varies between age groups and assess the volatility of risk factors using electronic medical records (EMR).Entities:
Keywords: Acute kidney injury (AKI); Electronic medical record; Machine learning; Risk factor; Risk prediction
Year: 2020 PMID: 32741377 PMCID: PMC7397647 DOI: 10.1186/s12882-020-01980-w
Source DB: PubMed Journal: BMC Nephrol ISSN: 1471-2369 Impact factor: 2.388
Clinical variables extracted for the study cohort
| Feature Category | # of Variables | Details |
|---|---|---|
| 3 | Age, race, gender; | |
| 5 | BMI, diastolic BP, systolic BP, pulse, temperature; | |
| 14 | Albumin, ALT, AST, Ammonia, Calcium, BUN, Bilirubin, CK-MB, CK, Glucose, Lipase, Platelets, Troponin, WBC; | |
| 315 | University Health System Consortium (UHC) APR-DRG; (e.g. liver transplant, heart &/or lung transplant, etc.) | |
| 1271 | All medications are mapped to RxNorm ingredient; (e.g. lithium carbonate, pentostatin, ospemifene, oxybutynin, etc.) | |
| 280 | ICD9 codes mapped to CCS major diagnoses. (e.g. Nervous system congenital anomalies, other congenital anomalies, etc.) |
Demographic characteristics and AKI onset time of patients by age category
| Variable | Age 18–35 ( | Age 36–55 ( | Age 56–65 ( | Age > 65 ( | ||||
|---|---|---|---|---|---|---|---|---|
| AKI | Non-AKI | AKI | Non-AKI | AKI | Non-AKI | AKI | Non-AKI | |
| n (%) | 983 (7.29) | 11,890 (92.71) | 2222 (8.82) | 22,975 (91.18) | 1906 (10.53) | 16,192 (89.47) | 2193 (10.55) | 18,596 (98.45) |
| White | 660 (67.14) | 8038 (67.60) | 1537 (69.17) | 16,652 (72.48) | 1444 (75.76) | 13,024 (80.43) | 1767 (80.57) | 15,463 (83.15) |
| Black | 150 (15.26) | 1958 (16.47) | 420 (18.90) | 3808 (16.57) | 264 (13.85) | 1926 (11.89) | 231 (10.53) | 1644 (8.84) |
| Asian | 7 (0.71) | 147 (1.24) | 12 (0.54) | 190 (0.83) | 17 (0.89) | 112 (0.69) | 18 (0.82) | 151 (0.81) |
| Other | 121 (12.31) | 1792 (15.07) | 253 (11.39) | 2325 (10.12) | 181 (9.50) | 1130 (6.98) | 177 (8.07) | 1338 (7.20) |
| Male | 549 (55.85) | 6066 (51.02) | 1302 (58.60) | 12,337 (53.70) | 1170 (61.39) | 9297 (57.42) | 1288 (58.73) | 10,150 (54.58) |
| Unknown | 39 (4.16) | 1209 (10.13) | 92 (4.14) | 1848 (8.04) | 57 (2.99) | 830 (5.13) | 65 (2.96) | 947 (5.09) |
| < 18.5 | 81 (8.63) | 563 (4.72) | 56 (2.52) | 537 (2.34) | 44 (2.31) | 479 (2.96) | 69 (3.15) | 680 (3.66) |
| 18.5–24.9 | 325 (34.65) | 4076 (34.15) | 460 (20.70) | 5133 (22.34) | 379 (19.88) | 3630 (22.42) | 535 (24.40) | 5542 (29.80) |
| 25.0–29.9 | 201 (21.43) | 2594 (21.73) | 567 (25.52) | 5879 (25.59) | 493 (25.87) | 4454 (27.51) | 701 (31.97) | 5950 (32.00) |
| > 30.0 | 292 (31.13) | 3493 (29.27) | 1047 (47.12) | 9578 (41.69) | 933 (48.95) | 6799 (41.99) | 823 (37.53) | 5477 (29.45) |
| LT | 9 (0.96) | 9 (0.08) | 68 (3.06) | 49 (0.21) | 65 (3.41) | 69 (0.43) | 15 (0.68) | 20 (0.11) |
| CF | 121 (12.90) | 564 (4.73) | 26 (1.17) | 125 (0.54) | 6 (0.31) | 39(.24) | 0 (0.00) | 3 (0.02) |
| HF | 6 (0.64) | 19 (0.16) | 45 (2.03) | 127 (0.55) | 36 (1.89) | 121 (0.75) | 62 (2.83) | 277 (1.49) |
| ND | 193 (20.58) | 1040 (8.71) | 203 (9.14) | 1844 (8.03) | 156 (8.18) | 1416 (8.75) | 172 (7.84) | 1721 (9.25) |
| ED | 112 (11.94) | 585 (4.90) | 228 (10.26) | 2283 (9.94) | 239 (12.54) | 1959 (12.10) | 271 (12.36) | 2551 (13.72) |
| EH | 104 (11.09) | 886 (7.42) | 648 (29.16) | 5834 (25.39) | 825 (43.28) | 5968 (36.86) | 1053 (48.02) | 8715 (6.86) |
| T | 403 (42.96) | 2089 (17.50) | 791 (35.60) | 3742 (16.29) | 567 (29.75) | 2662 (16.44) | 515 (23.48) | 2892 (15.55) |
| V | 366 (39.02) | 1936 (16.22) | 762 (34.29) | 4485 (19.52) | 620 (32.53) | 3656 (22.58) | 655 (29.87) | 489 (22.53) |
| A | 755 (80.49) | 9883 (82.81) | 1816 (81.73) | 19,782 (86.10) | 1585 (83.16) | 14,097 (87.06) | 1889 (86.14) | 16,601 (89.27) |
| Days | 3 [2–6] | – | 3 [2–5] | – | 3 [2–6] | – | 3 [2–6] | – |
Note: AKI Acute kidney injury, Non-AKI Not acute kidney injury. Values for categorical variables are given as number (percentage), AKI onset time is given as median [interquartile range] of admission days
Fig. 1Venn diagram for the top 200 features identified in four age groups. This figure shows the number of overlapping features identified as top 200 across the four age groups
Fig. 2Heat map of the top-200 risk factors appeared in all four age groups. The figure shows the corresponding ranking of each factor in the GBM model
Fig. 3Prediction trends of the top-ranking features with under-sampling for the four age groups across different machine learning models
Fig. 4ROC curves of random forest without under-sampling for the four age groups
Significant level of pairwise comparison
| Age Group | G1 ~ G2 | G1 ~ G3 | G1 ~ G4 | G2 ~ G3 | G2 ~ G4 | G3 ~ G4 |
|---|---|---|---|---|---|---|
| Prevalence (%) | 7.29 ~ 8.82 | 7.29 ~ 10.53 | 7.29 ~ 10.55 | 8.82 ~ 10.53 | 8.82 ~ 10.55 | 10.53 ~ 10.55 |
| 0.97 | ||||||
| AUC (LR) | 0.759 [0.728–0.789] ~ 0.760 [0.750–0.770] | 0.759 [0.728–0.789] ~ 0.746 [0.731–0.760] | 0.759 [0.728–0.789] ~ 0.725 [0.713–0.736] | 0.760 [0.750–0.770] ~ 0.746 [0.731–0.760] | 0.760 [0.750–0.770] ~ 0.725 [0.713–0.736] | 0.746 [0.731–0.760] ~ 0.725 [0.713–0.736] |
| 0.81 | 0.13 | |||||
| AUC (SVM) | 0.767 [0.743–0.790] ~ 0.766 [0.753–0.780] | 0.767 [0.743–0.790] ~ 0.742 [0.729–0.756] | 0.767 [0.743–0.790] ~ 0.723 [0.702–0.744] | 0.766 [0.753–0.780] ~ 0.742 [0.729–0.756] | 0.766 [0.753–0.780] ~ 0.723 [0.702–0.744] | 0.742 [0.729–0.756] ~ 0.723 [0.702–0.744] |
| 0.10 | 0.12 | 0.06 | ||||
| AUC (LB) | 0.750 [0.728–0.773] ~ 0.753 [0.733–0.772] | 0.750 [0.728–0.773] ~ 0.738 [0.727–0.750] | 0.750 [0.728–0.773] ~ 0.712 [0.701–0.724] | 0.753 [0.733–0.772] ~ 0.738 [0.727–0.750] | 0.753 [0.733–0.772] ~ 0.712 [0.701–0.724] | 0.738 [0.727–0.750] ~ 0.712 [0.701–0.724] |
| 0.73 | 0.33 | 0.09 | ||||
| AUC (RF) | 0.784 [0.769–0.800] ~ 0.766 [0.754–0.777] | 0.784 [0.769–0.800] ~ 0.754 [0.741–0.768] | 0.784 [0.769–0.800] ~ 0.723 [0.709–0.737] | 0.766 [0.754–0.777] ~ 0.754 [0.741–0.768] | 0.766 [0.754–0.777] ~ 0.723 [0.709–0.737] | 0.754 [0.741–0.768] ~ 0.723 [0.709–0.737] |
| 0.11 | 0.11 |
Abbreviation:AUC the area under the receiver operating characteristic curve for top-200 features with under-sampling, LR Logistic Regression, SVM Support Vector Machine, LB LogistBoost, RF Random Forest, G1 18–35 age group; G2 36–55 age group, G3 56–65 age group, G4 > 65 age group. P value in bold represents p < 0.05