| Literature DB >> 32838613 |
Yang Li1,2,3,4,5, Xiaohong Chen1,2,3,4,5, Ziyan Shen1,2,3,4,5, Yimei Wang1,2,3,4,5, Jiachang Hu1,2,3,4,5, Yunlu Zhang1,2,3,4,5, Jiarui Xu1,2,3,4,5, Xiaoqiang Ding1,2,3,4,5.
Abstract
BACKGROUND: This study attempts to establish a Bayesian networks (BNs) based model for inferring the risk of AKI in gastrointestinal cancer (GI) patients, and to compare its predictive capacity with other machine learning (ML) models.Entities:
Keywords: Bayesian network; Gastrointestinal cancer; Group LASSO; acute kidney injury; disease prediction; machine learning
Mesh:
Year: 2020 PMID: 32838613 PMCID: PMC7472473 DOI: 10.1080/0886022X.2020.1810068
Source DB: PubMed Journal: Ren Fail ISSN: 0886-022X Impact factor: 2.606
Figure 1.AKI incidence in varied demographics among patients with gastrointestinal cancers.
Risk factors of AKI in patients with gastrointestinal cancers in the derivation cohort.
| Variable | Total | AKI (%) | χ2 | cOR (95%CI) | aOR (95%CI) | |
|---|---|---|---|---|---|---|
| Comorbidities | ||||||
| Hypertension | 683 | 118 (17.3) | 5.511† | 0.019 | 1.29 (1.04∼1.60) | 1.15 (0.92∼1.44) |
| Diabetes | 348 | 78 (22.4) | 19.757† | <0.001 | 1.80 (1.39∼2.35) | 1.66 (1.27∼2.17) |
| Cancer category | ||||||
| Esophagus | 908 | 186 (20.5) | 35.104† | <0.001 | 1.80 (1.47∼2.20) | 1.83 (1.49∼2.25) |
| Stomach | 2453 | 340 (13.9) | 1.12 (0.95∼1.33) | 1.14 (0.97∼1.35) | ||
| Intestine | 2484 | 311 (12.5) | 1.00 | 1.00 | ||
| Cancer stage | ||||||
| Loco-regional | 5338 | 777 (14.6) | 2.796† | 0.095 | 0.79 (0.60∼1.04) | 0.78 (0.59∼1.03) |
| Metastases | 507 | 60 (11.8) | 1.00 | 1.00 | ||
| In-hospital condition | ||||||
| Emergent | 554 | 107 (19.3) | 12.441† | <0.001 | 1.50 (1.19∼1.87) | 1.38 (1.10∼1.74) |
| Normal | 5291 | 730 (13.8) | 1.00 | 1.00 | ||
| Treatment | ||||||
| Surgery | 2859 | 640 (22.4) | 301.444† | <0.001 | 7.34 (3.99∼13.50) | 8.87 (4.79∼16.44) |
| Chemotherapy | 2179 | 162 (7.4) | 2.04 (1.10∼3.81) | 2.40 (1.28∼4.50) | ||
| Interventional | 516 | 24 (4.7) | 1.24 (0.60∼2.57) | 1.49 (0.72∼3.11) | ||
| Untreated/palliative | 291 | 11 (3.8) | 1.00 | 1.00 | ||
| Liver function | ||||||
| ALT (≥80 U/L) | 311 | 76 (24.4) | 27.404† | <0.001 | 2.03 (1.55∼2.66) | 2.07 (1.58∼2.72) |
| AST (≥70 U/L) | 272 | 52 (19.1) | 5.352† | 0.021 | 1.44 (1.06∼1.97) | 1.42 (1.04∼1.95) |
| TBiL (≥20.4 μmol/L) | 508 | 93 (18.3) | 7.209† | 0.007 | 1.38 (1.09∼1.76) | 1.34 (1.05∼1.70) |
| Renal function | ||||||
| SCr (≥115 μmol/L) | 209 | 118 (56.5) | 313.701† | <0.001 | 8.87 (6.67∼11.79) | 7.93 (5.93∼10.60) |
| eGFR (≥90 mL/min/1.73m2) | 3263 | 302 (9.3) | 282.507‡ | <0.001 | 1.00 | 1.00 |
| eGFR (60∼89 mL/min/1.73m2) | 2282 | 390 (17.1) | 2.02 (1.72∼2.37) | 1.98 (1.67∼2.33) | ||
| eGFR (≤59 mL/min/1.73m2) | 300 | 145 (48.3) | 9.17 (7.10∼11.84) | 8.82 (6.73∼11.56) | ||
| SUA (≤359 μmol/L) | 4670 | 589 (12.6) | 91.391‡ | <0.001 | 1.00 | 1.00 |
| SUA (360∼420 μmol/L) | 709 | 117 (16.5) | 1.37 (1.10∼1.70) | 1.35 (1.08∼1.69) | ||
| SUA (421∼480 μmol/L) | 300 | 76 (25.3) | 2.35 (1.79∼3.09) | 2.25 (1.70∼2.98) | ||
| SUA (≥481 μmol/L) | 166 | 55 (33.1) | 3.43 (2.46∼4.80) | 3.29 (2.34∼4.62) | ||
| Biochemical test | ||||||
| Hypoalbuminemia | 1960 | 416 (21.2) | 114.578† | <0.001 | 2.22 (1.91∼2.57) | 2.08 (1.79∼2.42) |
| Anemia | 4205 | 725 (17.2) | 104.251† | <0.001 | 2.84 (2.31∼3.50) | 2.75 (2.23∼3.40) |
| Hyponatremia | 1290 | 352 (27.3) | 293.274† | <0.001 | 3.39 (2.90∼3.96) | 3.28 (2.80∼3.84) |
| Hypernatremia | 113 | 42 (37.2) | 5.34 (3.60∼7.92) | 5.08 (3.41∼7.56) | ||
| Hypokalemia | 796 | 187 (23.5) | 206.422† | <0.001 | 2.24 (1.86∼2.69) | 2.23 (1.85∼2.69) |
| Hyperkalemia | 90 | 51 (56.7) | 9.52 (6.22∼14.57) | 8.99 (5.86∼13.81) | ||
aaOR was adjusted for age, gender, and body mass index.
†refers to Pearson test (for binary and unordered categorical variables).
‡refers to Cochran–Mantel–Haenszel test (for ordered categorical variables).
AKI: Acute kidney injury; cOR: crude Odds ratio; aOR: adjusted Odds ratio; ALT: Alanine aminotransferase; AST: Aspartate aminotransferase; TBiL: Total Bilirubin; SCr: Serum creatinine; eGFR: estimated Glomerular filtration rate; SUA: Serum uric acid.
Predictive variables of AKI selected by gLASSO regression.
| Variate | OR (95% CI) | ||
|---|---|---|---|
| Diabetes | 1.64 (1.21∼2.24) | 0.002 | |
| Cancer Category (Esophagus) | 2.83 (2.24∼3.58) | <0.001 | |
| Cancer Category (Stomach) | 1.28 (1.07∼1.54) | 0.008 | |
| Treatment (Surgery) | 8.52 (4.39∼16.55) | <0.001 | |
| Treatment (Chemotherapy) | 2.23 (1.14∼4.38) | 0.019 | |
| Treatment (Interventional) | 2.04 (0.93∼4.47) | 0.074 | |
| ALT (≥80 U/L) | 1.79 (1.32∼2.44) | <0.001 | |
| SCr (≥115 umol/L) | 3.57 (2.01∼6.35) | <0.001 | |
| eGFR (60∼89 mL/min/1.73m2) | 1.81 (1.51∼2.16) | <0.001 | |
| eGFR (≤59 mL/min/1.73m2) | 4.04 (2.42∼6.76) | <0.001 | |
| SUA (360∼420 μmol/L) | 1.09 (0.85∼1.40) | 0.501 | |
| SUA (421∼480 μmol/L) | 1.79 (1.28∼2.51) | 0.001 | |
| SUA (≥481 μmol/L) | 1.79 (1.16∼2.77) | 0.008 | |
| Hypoalbuminemia | 1.26 (1.06∼1.50) | 0.008 | |
| Anemia | 1.61 (1.28∼2.02) | <0.001 | |
| Hyponatremia | 2.17 (1.82∼2.59) | <0.001 | |
| Hypernatremia | 3.34 (2.14∼5.22) | <0.001 | |
| Hypokalemia | 1.78 (1.44∼2.18) | <0.001 | |
| Hyperkalemia | 2.52 (1.53∼4.14) | <0.001 |
gLASSO: group LASSO; OR: Odds ratio; ALT: Alanine aminotransferase; SCr: Serum creatinine; eGFR: estimated Glomerular filtration rate; SUA: Serum uric acid.
Figure 2.Bayesian Network model of AKI risk factors in patients with gastrointestinal cancers. AKI: Acute kidney injury; ALT: Alanine aminotransferase; SCr: Serum creatinine; eGFR: estimated Glomerular filtration rate; SUA: Serum uric acid.
Figure 3.Receiver operating characteristic curves of AKI prediction models based on different ML algorithms.
Internal and external validation of AKI prediction models based on different ML algorithms.
| Models | Internal validation | External validation | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | Recall | PPV | NPV | Accuracy | Recall | PPV | NPV | |
| Bayesian Network | 87.0% | 26.5% | 60.7% | 88.8% | 85.1% | 24.8% | 54.3% | 87.4% |
| Decision tree | 86.1% | 13.5% | 56.2% | 87.2% | 84.6% | 10.9% | 52.4% | 85.7% |
| Random forest | 87.3% | 22.1% | 67.3% | 88.3% | 85.5% | 19.8% | 60.6% | 86.9% |
| Support vector machine | 86.2% | 14.1% | 56.7% | 87.2% | 84.5% | 10.9% | 50.0% | 85.7% |
| Logistic-score | 85.7% | 7.4% | 65.3% | 86.1% | 85.4% | 7.9% | 80.0% | 85.5% |
| Naive Bayes | 86.2% | 30.5% | 53.3% | 89.2% | 84.9% | 28.7% | 52.7% | 87.9% |
Accuracy rate is the sum of correctly classified cases test divided by the data set size (TP + TN)/(TP + TN + FP + FN). Recall rate is the positively classified cases divided by the positive cases TP/(TP + FN). Positive predictive value (PPV) is the proportion of positive cases that are true positives TP/(TP + FP). Negative predictive value (NPV) is the proportion of negative cases that are true negatives TN/(TN + FN).