| Literature DB >> 32957977 |
Yuan Wang1,2, Yake Wei3, Hao Yang4, Jingwei Li3,5, Yubo Zhou1, Qin Wu6.
Abstract
BACKGROUND: Acute Kidney Injury (AKI) is a shared complication among Intensive Care Unit (ICU), marked by high cost, high morbidity and high mortality. As the early prediction of AKI is critical for patients' outcomes and data mining is such a powerful prediction tool, many AKI prediction models based on machine learning methods have been proposed. Our motivation is inspired by the fact that the incidence of AKI is a changing temporal sequence affected by the joint action of patients' daily drug combinations and their physiological indexes. However, most existing models have not considered such a temporal correlation. Besides, due to great challenges caused by sparse, high-dimensional and highly imbalanced clinical data, it is hard to achieve ideal performance.Entities:
Keywords: Acute kidney injury (AKI); Drug combination; ETSM; Ensemble learning; Prediction
Mesh:
Year: 2020 PMID: 32957977 PMCID: PMC7507620 DOI: 10.1186/s12911-020-01245-4
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Feature geeration process of explicit indicator group. Take patient i as an example, the duration of ICU stay is from 20110912 to 20110920. Each color block stands for a series of vital signs and laboratory results value on a certain. This figure was generated by PowerPoint 2019
Fig. 2Feature generation process of implicit indicator group. P is the patient ID number. Each color block stands for a series of drugs taken on a certain day. This figure was generated by PowerPoint 2019
Fig. 3Framework of ETSM. This figure was generated by PowerPoint 2019
Study sample characteristics of ICUC and MIMIC III
| Original samples | 13053 | 52152 |
| AKI samples | 2035 | 29344 |
| Timing of AKI onset | Avg 3.92 | Avg 2.00 |
| Timing of AKI onset | Max 30 | Max 7 |
| Timing of AKI onset | Min 0 | Min 0 |
| Vital signs and laboratory results | 101 | 38 |
| Distinct drug | 75 | 3235 |
| Drug combination | 5154 | 3085 |
| Most widely used drug in samples | 91.49% | 68.82% |
| Patients with insufficient information | 1401 | 5559 |
| Samples used to predict AKI 24 hours ahead | 11501 | 46593 |
| Samples used to predict AKI 48 hours ahead | 10921 | 30217 |
Fig. 4Study sample characteristics of ICUC and MIMIC III. (a) ICUC-timing of AKI onset (b) MIMIC III-timing of AKI onset (c) ICUC-drugs popularity (d) MIMIC III-drugs popularity. These figure were generated by Excel 2019
Performance of imbalanced learning techniques on ICUC
| AUC | Sensitivity | F1-socre | AP | |||||
|---|---|---|---|---|---|---|---|---|
| 24h | 48h | 24h | 48h | 24h | 48h | 24h | 48h | |
| Random Undersample | 0.81 | 0.78 | 0.75 | 0.68 | 0.58 | 0.44 | 0.59 | 0.41 |
| Random Oversample | 0.78 | 0.69 | 0.64 | 0.43 | 0.62 | 0.44 | 0.66 | 0.46 |
| Cost-sensitive XGBoost | 0.78 | 0.70 | 0.64 | 0.45 | 0.61 | 0.46 | 0.67 | 0.47 |
Performance of prediction models on ICUC
| Model | AUC | Sensitivity | F1-score | AP | ||||
|---|---|---|---|---|---|---|---|---|
| 24h | 48h | 24h | 48h | 24h | 48h | 24h | 48h | |
| ETSM | 0.81 | 0.78 | 0.75 | 0.68 | 0.58 | 0.44 | 0.59 | 0.41 |
| AdaBoost | 0.78 | 0.75 | 0.66 | 0.62 | 0.54 | 0.41 | 0.60 | 0.41 |
| Random Forest | 0.73 | 0.75 | 0.51 | 0.59 | 0.54 | 0.44 | 0.60 | 0.40 |
| Naive Bayes | 0.53 | 0.52 | 0.09 | 0.05 | 0.15 | 0.07 | 0.60 | 0.41 |
| k-Nearest Neighbor | 0.63 | 0.62 | 0.37 | 0.30 | 0.36 | 0.31 | 0.59 | 0.41 |
Performance of prediction models on MIMIC III
| Model | AUC | Sensitivity | F1-score | AP | ||||
|---|---|---|---|---|---|---|---|---|
| 24h | 48h | 24h | 48h | 24h | 48h | 24h | 48h | |
| ETSM | 0.95 | 0.95 | 0.95 | 0.98 | 0.96 | 0.98 | 0.98 | 0.98 |
| AdaBoost | 0.89 | 0.93 | 0.93 | 0.97 | 0.93 | 0.96 | 0.98 | 0.98 |
| Random Forest | 0.78 | 0.78 | 0.91 | 0.97 | 0.86 | 0.91 | 0.93 | 0.97 |
| Naive Bayes | 0.67 | 0.65 | 0.61 | 0.66 | 0.68 | 0.73 | 0.82 | 0.86 |
| k-Nearest Neighbor | 0.72 | 0.82 | 0.64 | 0.83 | 0.76 | 0.88 | 0.83 | 0.93 |
Performance of derived ETSM on ICUC in the experiment of predicting AKI 24 hours ahead
| Model | AUC | Sensitivity | F1-score | AP |
|---|---|---|---|---|
| (95% CI) | (95% CI) | (95% CI) | (95% CI) | |
| ETSM | 0.810 ±0.002 | 0.746 ±0.003 | 0.577 ±0.003 | 0.594 ±0.004 |
| ETSM-ex | 0.737 ±0.002* | 0.629 ±0.004* | 0.470 ±0.002* | 0.470 ±0.003* |
| ETSM-bool | 0.759 ±0.002* | 0.654 ±0.005* | 0.512 ±0.003* | 0.530 ±0.004* |
| ETSM-times | 0.803 ±0.002* | 0.726 ±0.003* | 0.579 ±0.003 | 0.647 ±0.003 |
Note: CI = confident interval
*indicates ETSM significantly outperforms the baseline with p <0.01 using Student t-test
Performance of derived ETSM on ICUC in the experiment of predicting AKI 48 hours ahead
| Model | AUC | Sensitivity | F1-score | AP |
|---|---|---|---|---|
| (95% CI) | (95% CI) | (95% CI) 7 (95% CI) | ||
| ETSM | 0.776 ±0.002 | 0.683 ±0.004 | 0.437 ±0.003 | 0.406 ±0.004 |
| ETSM-ex | 0.775 ±0.003 | 0.684 ±0.006 | 0.434 ±0.003 | 0.396 ±0.005* |
| ETSM-bool | 0.786 ±0.003 | 0.702 ±0.005 | 0.453 ±0.003 | 0.434 ±0.006 |
| ETSM-times | 0.806 ±0.002 | 0.739 ±0.006 | 0.476 ±0.003 | 0.476 ±0.005 |
Note: CI = confident interval
*indicates ETSM significantly outperforms the baseline with p <0.01 using Student t-test
Fig. 5Performance of ETSM with different initialization on ICUC in the experiment of predicting AKI 24 hours ahead. This figure was generated by Excel 2019
Drug combinations with top 10 feature importance
| Rank | Drug Combination |
|---|---|
| 1 | 5, 21 |
| 2 | 43, 69 |
| 3 | 21, 46 |
| 4 | 21, 43 |
| 5 | 21, 23, 43 |
| 6 | 1, 43, 69 |
| 7 | 5, 21, 50 |
| 8 | 21, 43, 69 |
| 9 | 21, 32 |
| 10 | 1, 21, 43 |
Note: Each number represents a kind of drugs. The number is the index for this drug in the ICUC dataset
1: norvancomycin
5: ciprofloxacin lactate and sodium chloride injection
21: indometacin enteric-coated tablets
23: piperacillin sodium/tazobactam sodium
32: ibuprofen
43: ceftazidime for injection
46: cefathiamidine for injection
50: aztreonam for injection
69: naproxen