| Literature DB >> 28122032 |
Danielle Saly1, Alina Yang1, Corey Triebwasser2, Janice Oh2, Qisi Sun1, Jeffrey Testani1, Chirag R Parikh1,3,4, Joshua Bia3, Aditya Biswas3, Chess Stetson5, Kris Chaisanguanthum5, F Perry Wilson1,3,4.
Abstract
Despite recognition that Acute Kidney Injury (AKI) leads to substantial increases in morbidity, mortality, and length of stay, accurate prognostication of these clinical events remains difficult. It remains unclear which approaches to variable selection and model building are most robust. We used data from a randomized trial of AKI alerting to develop time-updated prognostic models using stepwise regression compared to more advanced variable selection techniques. We randomly split data into training and validation cohorts. Outcomes of interest were death within 7 days, dialysis within 7 days, and length of stay. Data elements eligible for model-building included lab values, medications and dosages, procedures, and demographics. We assessed model discrimination using the area under the receiver operator characteristic curve and r-squared values. 2241 individuals were available for analysis. Both modeling techniques created viable models with very good discrimination ability, with AUCs exceeding 0.85 for dialysis and 0.8 for death prediction. Model performance was similar across model building strategies, though the strategy employing more advanced variable selection was more parsimonious. Very good to excellent prediction of outcome events is feasible in patients with AKI. More advanced techniques may lead to more parsimonious models, which may facilitate adoption in other settings.Entities:
Mesh:
Year: 2017 PMID: 28122032 PMCID: PMC5266278 DOI: 10.1371/journal.pone.0169305
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Baseline Characteristics at the Onset of AKI.
| Training Cohort (n = 1098) | Validation Cohort (n = 1143) | P-Value | |
|---|---|---|---|
| Age (yr) | 62.1 (51.7–71.7) | 62.8 (51.8–71.7) | 0.85 |
| Male Sex (%) | 617 (56.3) | 628 (55.5) | 0.68 |
| Black (%) | 294 (26.8) | 309 (27.0) | 0.89 |
| Hispanic (%) | 28 (2.6) | 37 (3.3) | 0.33 |
| BMI | 28.0 (24.0–33.0) | 27.0 (23.0–32.0) | 0.02 |
| Anion Gap, per 1 unit | 8.0 (7.0–10.0) | 8.0 (6.0–10.0) | 0.95 |
| Bicarbonate, meq/L | 24.0 (22.0–27.0) | 25.0 (22.0–27.0) | 0.87 |
| BUN, mg/dL | 21.0 (12.0–32.0) | 21.0 (13.0–33.0) | 0.54 |
| BUN Slope, mg/dl/24h | 4.7 (0.9–9.7) | 4.5 (0.9–9.8) | 0.98 |
| Calcium, mg/dL | 8.4 (7.9–8.9) | 8.4 (7.9–8.9) | 0.96 |
| Chloride, meq/L | 104.0 (101.0–108.0) | 104.0 (100.0–108.0) | 0.97 |
| Creatinine, mg/dL | 1.4 (1.0–1.8) | 1.4 (1.0–1.9) | 0.39 |
| Creatinine Slope, mg/dl/24h | 0.4 (0.2–0.7) | 0.4 (0.2–0.7) | 0.69 |
| Glucose, mg/dL | 123.0 (100.0–156.0) | 122.0 (98.0–156.0) | 0.48 |
| Hematocrit, % | 31.0 (27.0–35.0) | 30.0 (27.0–35.0) | 0.17 |
| Hemoglobin, g/dL | 10.2 (8.9–11.7) | 10.1 (8.9–11.5) | 0.10 |
| Magnesium, meq/L | 2.0 (1.8–2.2) | 2.0 (1.8–2.2) | 0.83 |
| Mean Corpuscular Hemoglobin (MCH), pg/cell | 30.1 (2.8) | 30.1 (2.9) | 0.63 |
| MCH Concentration, g/dL | 33.0 (32.0–34.0) | 33.0 (32.0–34.0) | 0.16 |
| Platelet Count, 1000/uL | 185.0 (117.0–263.0) | 189.0 (131.0–261.0) | 0.26 |
| Potassium, meq/L | 4.2 (3.8–4.6) | 4.2 (3.8–4.6) | 0.50 |
| Red Cell Distribution, % | 15.6 (14.2–17.4) | 15.9 (14.4–18.0) | 0.002 |
| Sodium, meq/L | 137.0 (135.0–140.0) | 137.0 (134.0–140.0) | 0.92 |
| White Blood Cell Count, 100/uL | 9.5 (6.5–13.9) | 9.7 (6.6–14.4) | 0.25 |
| Pantoprazole Use | 51 (4.6%) | 54 (4.7%) | 0.93 |
| CHF | 356 (32.4) | 375 (32.9) | 0.82 |
| Diabetes | 309 (28.1) | 368 (32.2) | 0.04 |
| Cancer | 294 (26.8) | 288 (25.2) | 0.41 |
| Chronic Kidney Disease | 279 (25.4) | 307 (26.9) | 0.42 |
| Liver Disease | 164 (14.9) | 158 (13.8) | 0.45 |
Baseline characteristics at AKI onset. Comparisons between continuous covariates were made with rank-sum tests, and categorical covariates with chi-square tests.
1 AKI = Acute Kidney Injury as defined by KDIGO creatinine criteria.
2 CHF = Congestive Heart Failure.
3Pantoprazole was the only proton-pump inhibitor on formulary at the hospital at the time of this study.
Comparison of prognostic models in the validation cohort.
| Outcome | Conventional Model, Training Cohort | Alternative Model, Training Cohort | Conventional Model, Validation Cohort | Alternative Model, Validation Cohort |
|---|---|---|---|---|
| 0.89 (0.86–0.93) | 0.88 (0.86–0.90) | 0.82 (0.76–0.88) | 0.84 (0.80–0.89) | |
| 0.90 (0.88–0.93) | 0.85 (0.81–0.90) | 0.80 (0.75–0.84) | 0.80 (0.76–0.85) | |
| 0.44 (0.32–0.57) | 0.26 (0.21–0.30) | 0.17 (0.10–0.24) | 0.20 (0.14–0.26)* |
1 Death and dialysis are evaluated in terms of area under the receiver-operator characteristic curve (AUC).
2 Length of stay is evaluated in terms of the model R2.
3 * = p<0.05 compared to conventional model.
Fig 1Receiver-Operator Characteristic curves for Dialysis.
Receiver-operator characteristic (ROC) curves comparing the performance of conventional vs. alternative models in the prediction of dialysis in the validation cohort. Area under the curve for conventional model: 0.82 (0.76–0.88), alternative model 0.84 (0.80–0.89).
Fig 2Principal Components Analysis.
Colored points reflect individual level data, where individuals are mapped to a coordinate plane based upon 2 principal components derived from laboratory (panel A) and medication (panel B) data. Next to the colored plots, the covariate map appears. Covariates are mapped along the same two principal component vectors, helping to illustrate the correlations among several of the covariates. A) Laboratory covariates as mapped on two principal components. Based on laboratory values, a patient (represented as a dot) can be put anywhere on the coordinate plane. For the outcome of death within 7 days, red dots indicate an individual who died in that time frame, black an individual who did not. For LOS analyses, blue dots indicate shorter lengths of stay, with red dots indicating longer lengths of stay. Clustering of colors along one dimension of the plot suggests a significant relationship between that principal component and the outcome. Next to the patient plots is a plot showing each lab on the same two principal coordinate axes. Labs that are closer together a more correlated (for example, creatinine and BUN). Size of the text indicates strength of association between a given lab and that principal component. B) Medication covariates as mapped on two principal components. Based on medications received, a patient (represented as a dot) can be put anywhere on the coordinate plane. For the outcome of death within 7 days, red dots indicate an individual who died in that time frame, black an individual who did not. For LOS analyses, blue dots indicate shorter lengths of stay, with red dots indicating longer lengths of stay. Clustering of colors along one dimension of the plot suggests a significant relationship between that principal component and the outcome. Next to the patient plots is a plot showing each medication on the same two principal coordinate axes. Medications that are closer together a more correlated (for example, vancomycin and fentanyl). Size of the text indicates strength of association between a given lab and that principal component. Covariates ending in "category" are binary (ie D50 category is a 1 if the patient has received 50% dextrose infusion), whereas those ending in "dose" reflect the actual dose received. Higher resolution figures are available in S2 File.
Fig 3Receiver-Operator Characteristic curves for death.
Comparing the performance of conventional vs. alternative models in the prediction of death in the validation cohort. Area under the curve for conventional model: 0.80 (0.75–0.84), alternative model 0.80 (0.76–0.85).