| Literature DB >> 29589567 |
Weiqi Chen1,2, Yong Hu3,4, Xiangzhou Zhang1,2, Lijuan Wu1,2, Kang Liu1,2, Jianqin He1,2, Zilin Tang1,2, Xing Song5, Lemuel R Waitman5, Mei Liu6.
Abstract
BACKGROUND: Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible. It is important to effectively discover the true risk factors in order to identify high-risk AKI patients and allow better targeting of tailored interventions. However, Stage-3 AKI patients are very rare (only 0.2% of AKI patients) with a large scale of features available in EHR (1917 potential risk features), yielding a scenario unfeasible for any correlation-based feature selection or modeling method. This study aims to discover the key factors and improve the detection of Stage-3 AKI.Entities:
Keywords: Acute kidney injury (AKI); Causal feature selection; Causality discovery; Dimension reduction; Machine learning; Predictive modeling
Mesh:
Year: 2018 PMID: 29589567 PMCID: PMC5872516 DOI: 10.1186/s12911-018-0597-7
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
The KDIGO staging system for AKI
| AKI Stage | Serum Creatinine (SCr) Criteria |
|---|---|
| 1 | Increase > 26.4 μmol/L (0.3 mg/dL) or 1.5–1.9 times baseline |
| 2 | Increase 2.0–2.9 times baseline |
| 3 | Increase creatinine > 354 μmol/L (4.0 mg/dL) or 3 times baseline |
Clinical variables considered in building predictive models for Stage-3 AKI
| Feature Category | # of Variable | Details |
|---|---|---|
| Demographics | 3 | Age, gender, race |
| Vitals | 5 | BMI, diastolic BP, systolic BP, pulse, temperature |
| Lab Tests | 14 | Albumin, ALT, AST, Ammonia, Blood Bilirubin, BUN, Ca, CK-MB, CK, Glucose, Lipase, Platelets, Troponin, WBC |
| Comorbidities | 28 | UHC comorbidity |
| Admission Diagnosis | 129 | UHC APR-DRG |
| Medications | 482 | All medications are mapped to RxNorm ingredient |
| Medical History | 230 | ICD9 codes mapped to CCS major diagnoses |
Categories for vital signs
| Vitals | Categories |
|---|---|
| BMI | < 18.5, [18.5–24.9], [25.0–29.9], > 30.0, Unknown |
| Diastolic BP | < 80, [80–89], [90–99], > 100, Unknown |
| Systolic BP | < 120, [120–139], [140–159], > 160, Unknown |
| Pulse | < 50, [50–65], [66–80], [81–100], > 100, Unknown |
| Temperature | < 95.0, [95.0–97.6], [97.7–99.5], [99.5–104.0], > 104.0, Unknown |
Fig. 1The causal discovery process of McDsL. The structure learning phase is employed for dimensionality reduction, which the potential indirect causal features are deleted. The direction learning phase is proposed for discover many-to-one causalities
The converted model of two features
|
|
|
|
| |
|---|---|---|---|---|
| states |
|
|
| 1 |
| ⋮ | ⋮ | ⋮ | ⋮ | |
|
|
|
|
| |
|
|
|
| ||
| ⋮ | ⋮ | ⋮ | ⋮ | |
|
|
|
|
|
OR and 95% CI of combinations of discovered risk factors
| Combinations of risk factors | Risk factors | Odd Ratio [95% CI] | |||
|---|---|---|---|---|---|
| #1 | #2 | #3 | #4 | ||
| CoRF1 | 1 | 0 | 0 | 0 | 1.24 [0.69, 2.23] |
| CoRF2 | 0 | 1 | 0 | 0 | 1.47 [0.92, 2.34] |
| CoRF3 | 0 | 0 | 1 | 0 | 0.64 [0.26, 1.56] |
| CoRF4 | 0 | 0 | 0 | 1 | 0.66 [0.37, 1.19] |
| CoRF5 | 1 | 0 | 0 | 1 | 0.62 [0.30, 1.25] |
| CoRF6 | 1 | 0 | 1 | 0 | 0.48 [0.18, 1.29] |
| CoRF7 | 1 | 1 | 0 | 0 | 1.56 [0.90, 2.70] |
| CoRF8 | 0 | 1 | 0 | 1 | 1.48 [0.97, 2.26] |
| CoRF9 | 0 | 1 | 1 | 0 | 1.24 [0.67, 2.28] |
| CoRF10 | 0 | 0 | 1 | 1 | 0.30 [0.04, 2.12] |
| CoRF11 | 1 | 0 | 1 | 1 | 0.28 [0.04, 1.97] |
| CoRF12 | 1 | 1 | 0 | 1 | 0.94 [0.58, 1.53] |
| CoRF13 | 1 | 1 | 1 | 0 | 0.92 [0.52, 1.62] |
| CoRF14 | 0 | 1 | 1 | 1 | 1.20 [0.53, 2.72] |
| CoRF15 | 1 | 1 | 1 | 1 | 1.18 [0.66, 2.12] |
Fig. 2The prediction results of McDSL with five machine learning methods. AUC, F-score, Precision, Recall, and 95% confidence intervals obtained using different machine learning methods, with 10-fold cross-validation, for predicting Stage-3 AKI during hospital stay
The comparison of different feature selected features
| Models (# of variable) | AUC | F-score | Recall | Precision |
|---|---|---|---|---|
| McDSL (4) | 0.812 | 0.753 | 0.761 | 0.743 |
| McDSL + PLoS one (6) | 0.814 | 0.748 | 0.744 | 0.738 |
| LR (88) | 0.837 | 0.775 | 0.810 | 0.734 |