| Literature DB >> 36060071 |
Luming Zhang1,2, Zichen Wang2,3, Zhenyu Zhou4, Shaojin Li5, Tao Huang2, Haiyan Yin1, Jun Lyu2.
Abstract
Sepsis-associated acute kidney injury (S-AKI) is very common and early prediction is beneficial. This study aiming to develop an accurate ensemble model to predict the risk of S-AKI based on easily available clinical information. Patients with sepsis from the United States (US) database Medical Information Mart for Intensive Care-IV were used as a modeling cohort to predict the occurrence of AKI by combining Support Vector Machine, Random Forest, Neural Network, and Extreme Gradient Boost as four first-level learners via stacking algorithm. The external validation databases were the eICU Collaborative Research Database from US and Critical Care Database comprising infection patients at Zigong Fourth People's Hospital from China, whose AUROC values for the ensemble model 48-12 h before the onset of AKI were 0.774-0.788 and 0.756-0.813, respectively. In this study, an ensemble model for early prediction of S-AKI onset was developed and it demonstrated good performance in multicenter external datasets.Entities:
Keywords: Artificial intelligence; Medicine; Nephrology
Year: 2022 PMID: 36060071 PMCID: PMC9429796 DOI: 10.1016/j.isci.2022.104932
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Flow chart for participant inclusion and model processing in the study
Baseline characteristics of included sepsis patients from three databases
| MIMIC-IV | eICU-CRD | ZG | p-value | |
|---|---|---|---|---|
| N | 21,038 | 24,352 | 505 | |
| Age (year) | 67 (56, 78) | 67(55, 77) | 72(62, 81) | <0.001 |
| Gender (%) | <0.001 | |||
| Male | 12,111 (57.6) | 12,871 (52.9) | 324 (64.2) | |
| Female | 8,927 (42.4) | 11,479 (47.1) | 181 (35.8) | |
| Weight (kg) | 79.0 (66.4, 95.0) | 80.6(65.8, 99.1) | / | / |
| Height (cm) | 170.0 (163.0, 178.0) | 168.0 (160.0, 177.8) | / | / |
| APS | 57.0 (42.0, 76.0) | 54.0 (40.0, 72.0) | / | / |
| Unit type (%) | <0.001 | |||
| MICU/SICU | 12,342 (58.7) | 19,405 (79.7) | 110 (21.8) | |
| Others | 8,696 (41.3) | 4,947 (20.3) | 395 (78.2) | |
| Ethnicity (%) | / | |||
| White | 14,057 (66.8) | 18,747 (77.0) | / | |
| Others | 6,981 (33.2) | 5,605 (23.0) | / | |
| Vasopressor (%) | <0.001 | |||
| No | 13,431 (63.8) | 18,943 (77.8) | 297 (58.8) | |
| Yes | 7,607 (36.2) | 5,409 (22.2) | 208 (41.2) | |
| Ventilator (%) | <0.001 | |||
| No | 5,826 (27.7) | 7,533 (30.9) | 114 (22.6) | |
| Yes | 15,212 (72.3) | 16,819 (69.1) | 391 (77.4) | |
| RRT (%) | <0.001 | |||
| No | 20,477 (97.3) | 22,896 (94.0) | 475 (94.1) | |
| Yes | 561 (2.7) | 1456 (6.0) | 30 (5.9) | |
| AKI (%) | <0.001 | |||
| No | 5,253 (25.0) | 18,140(74.5) | 403(79.8) | |
| Yes | 15,785 (75.0) | 6,212(25.5) | 102(20.2) | |
| Length of ICU stay (day) | 4.5 (3.0, 8.3) | 4.3 (2.9, 7.6) | 7.7 (3.9, 16.5) | <0.001 |
| Length of hospital stay (day) | 11.0 (7.0, 20.0) | 10.5 (6.5, 17.7) | 15.8 (6.9, 28.5) | <0.001 |
| ICU mortality (%) | <0.001 | |||
| No | 18,335 (87.2) | 21,903 (89.9) | 379 (75.0) | |
| Yes | 2,703 (12.8) | 2,449 (10.1) | 126 (25.0) |
APS: Acute Physiology Score; MICU: Medical Intensive Care Unit; SICU: Surgical Intensive Care Unit; Some of clinical information was not recorded in ZG database therefore replaced by ‘/’.
p-value for continuous variables were calculated by Kruskal-Wallis test and p-value for categorical variables were calculated by Chi-square test.
Figure 2The value of features among datasets 48–12 h before AKI onset
For the three databases, the relationship between mean values of features and time before AKI onset was visualized as a line graph; The distance between the error bars and the mean represents SE Although the age of individuals remained constant in the present study, since the composition of the training and testing cohorts are not identical, the mean and SE for each feature’s summary points were different.
Figure 3The predictive performance of first-level leaners and the ensemble model
AUROC values of four first-level leaners and the ensemble model predicting AKI 12, 24, 36, and 48 h before onset as tested by eICU-CRD datasets (a) and ZG datasets (b).
Evaluation metrics of the ensemble model in testing cohorts
| Hours to AKI | Sensitivity | Specificity | PPV | NPV | F1 | Accuracy | Balanced Accuracy |
|---|---|---|---|---|---|---|---|
| 48 | 0.650 | 0.764 | 0.412 | 0.896 | 0.505 | 0.741 | 0.707 |
| 36 | 0.690 | 0.737 | 0.400 | 0.903 | 0.506 | 0.727 | 0.713 |
| 24 | 0.724 | 0.723 | 0.400 | 0.912 | 0.516 | 0.724 | 0.724 |
| 12 | 0.695 | 0.738 | 0.404 | 0.905 | 0.511 | 0.729 | 0.717 |
| 48 | 0.700 | 0.757 | 0.398 | 0.917 | 0.507 | 0.746 | 0.728 |
| 36 | 0.685 | 0.771 | 0.411 | 0.913 | 0.514 | 0.754 | 0.728 |
| 24 | 0.840 | 0.716 | 0.408 | 0.951 | 0.549 | 0.740 | 0.778 |
| 12 | 0.780 | 0.743 | 0.415 | 0.935 | 0.542 | 0.750 | 0.762 |
PPV: Positive Predictive Values; NPV: Negative Predictive Values; Balanced Accuracy: (Sensitivity + Sensitivity)/2.
Figure 4Model explanations for the ensemble model
XAI methods for one S-AKI onset and one control patient are exhibited in (a) and (b); (a.1; b.1) represented LIME method; (a.2; b2) represented SHAP method, (a.3; b.3) represented Break Down method; (a.4; b.4) represented iBreakDown method. Bar plots to right direction represented positive prediction and bar plots to left direction represented negative prediction. Boxplots for iBreakDown represented the uncertainty of features contributions.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Medical Information Mart for Intensive Care-IV | Physionet | |
| eICU Collaborative Research Database | Physionet | |
| Critical Care Database comprising infection patients at Zigong Fourth People’s Hospital | Physionet | |
| Structured Query Language | Github | |
| SQL Server (2016) | Microsoft data platform | |
| R (v3.6.3) | R CRAN | |
| caret R package (v3.6.3) | R CRAN | |
| caretEnsemble R package (v3.6.3) (Ensemble model construction) | R CRAN | |
| ranger R package (v3.6.3) (Random Forest model construction) | R CRAN | |
| kernlab R package (v3.6.3) (Support Vector Machine construction) | R CRAN | |
| nnet R package (v3.6.3) (Neural Network construction) | R CRAN | |
| xgboost R package (v3.6.3) (eXtreme Gradient Boosting construction) | R CRAN | |
| DALEX R package (v3.6.3) | R CRAN | |
| DALEXtra R package (v3.6.3) | R CRAN | |
| lime R package (v3.6.3) | R CRAN | |
| iBreakDown R package (v3.6.3) | R CRAN | |
| dplyr R package (v3.6.3) | R CRAN | |
| ggplot2 R package (v3.6.3) | R CRAN | |
| shiny R package (v3.6.3) | R CRAN | |