| Literature DB >> 33664585 |
Qing Qian1,2, Jinming Wu2, Jiayang Wang2, Haixia Sun2, Lei Yang1.
Abstract
PURPOSE: To assess the performance of models for early prediction of acute kidney injury (AKI) in the Intensive Care Unit (ICU) setting. PATIENTS AND METHODS: Data were collected from the Medical Information Mart for Intensive Care (MIMIC)-III database for all patients aged ≥18 years who had their serum creatinine (SCr) level measured for 72 h following ICU admission. Those with existing conditions of kidney disease upon ICU admission were excluded from our analyses. Seventeen predictor variables comprising patient demographics and physiological indicators were selected on the basis of the Kidney Disease Improving Global Outcomes (KDIGO) and medical literature. Six models from three types of methods were tested: Logistic Regression (LR), Support Vector Machines (SVM), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Decision Machine (LightGBM), and Convolutional Neural Network (CNN). The area under receiver operating characteristic curve (AUC), accuracy, precision, recall and F-measure (F1) were calculated for each model to evaluate performance.Entities:
Keywords: acute kidney injury; deep learning; intensive care unit; machine learning; prediction models
Year: 2021 PMID: 33664585 PMCID: PMC7921629 DOI: 10.2147/IJGM.S289671
Source DB: PubMed Journal: Int J Gen Med ISSN: 1178-7074
Figure 1Time windows for data collection.
Figure 2Handling and splitting of imbalanced data.
Demographic Characteristics of the Patient Cohort at Baseline
| Age (Years) | BMI (kg/m2) | Male/Female | ||||
|---|---|---|---|---|---|---|
| AKI | Non-AKI | AKI | Non-AKI | AKI | Non-AKI | |
| Mean | 68.19 | 64.58 | 29.15 | 28.51 | – | – |
| Standard deviation | 14.90 | 16.92 | 7.71 | 8.09 | – | – |
| P (95% CI) | < 2.2×10−16 | 1.977×10−8 | – | |||
| Ratio | 1.46 | 1.19 | ||||
Statistic Summary of the 10-Times AUC Result for Each Model
| Group | Sum | Average | Variance |
|---|---|---|---|
| LightGBM | 9.04933 | 0.904933 | 0.000678 |
| Logistic regression | 7.355297 | 0.73553 | 9.51×10−5 |
| Random forest | 7.284963 | 0.728496 | 0.000132 |
| Support vector machine | 7.371453 | 0.737145 | 8.81×10−5 |
| eXtreme Gradient Boosting | 7.6001 | 0.76001 | 5.96×10−5 |
| Convolutional neural network | 7.183812 | 0.718381 | 0.000311 |
ANOVA of the 10-Times AUC Result for Each Model
| Source of Variation | SS | df | MS | F | P-value | F Crit |
|---|---|---|---|---|---|---|
| Between groups | 0.247513 | 5 | 0.049503 | 217.7206 | 1.71E-34 | 2.38607 |
| Within groups | 0.012278 | 54 | 0.000227 | |||
| Total | 0.259791 | 59 |
Abbreviations: SS, sum of squares of deviation from mean; DF, degree of freedom; MS, mean square; F, F-measure.
Predictive Models Results
| Accuracy | Precision | Recall | F1 | AUC | |
|---|---|---|---|---|---|
| LightGBM | 0.905 | 0.971 | 0.836 | 0.897 | 0.905 |
| eXtreme Gradient Boosting | 0.76 | 0.745 | 0.8 | 0.771 | 0.76 |
| Support vector machine | 0.737 | 0.763 | 0.695 | 0.727 | 0.737 |
| Logistic regression | 0.735 | 0.756 | 0.694 | 0.724 | 0.736 |
| Random forest | 0.729 | 0.727 | 0.743 | 0.735 | 0.728 |
| Convolutional neural network | 18 | 0.715 | 0.730 | 0.722 | 18 |
Studies on Predicting Acute Kidney Injury in ICU
| Studies | Models Tested | Best Model |
|---|---|---|
| Mengxin Sun et al | LR, RF, NB, SVM, CNN | SVM |
| Lindsay P. Zimmerman et al | LR, RF, MP | MP |
| Yikuan L et al | LR, RF, NB, SVM, GBDT, CNN | LR and SVM |
| Yuan Zhan et al | LR, RF, LightGBM | LightGBM |
Abbreviations: LR, logistic regression; RF, random forest; NB, Naïve Bayes; SVM, support vector machine; CNN, convolutional neural networks; MP, multilayer perceptron; GBDT, gradient boosting decision tree; LightGBM, Light Gradient Boosting Decision.
Figure 3The importance of each feature in each model.