| Literature DB >> 35055358 |
Shuo-Ming Ou1,2,3,4, Kuo-Hua Lee1,2,3,4, Ming-Tsun Tsai1,2,3,4, Wei-Cheng Tseng1,2,3,4, Yuan-Chia Chu5,6,7, Der-Cherng Tarng1,2,3,4,8.
Abstract
Sepsis survivors have a higher risk of long-term complications. Acute kidney injury (AKI) may still be common among sepsis survivors after discharge from sepsis. Therefore, our study utilized an artificial-intelligence-based machine learning approach to predict future risks of rehospitalization with AKI between 1 January 2008 and 31 December 2018. We included a total of 23,761 patients aged ≥ 20 years who were admitted due to sepsis and survived to discharge. We adopted a machine learning method by using models based on logistic regression, random forest, extra tree classifier, gradient boosting decision tree (GBDT), extreme gradient boosting, and light gradient boosting machine (LGBM). The LGBM model exhibited the highest area under the receiver operating characteristic curves (AUCs) of 0.816 to predict rehospitalization with AKI in sepsis survivors and followed by the GBDT model with AUCs of 0.813. The top five most important features in the LGBM model were C-reactive protein, white blood cell counts, use of inotropes, blood urea nitrogen and use of diuretics. We established machine learning models for the prediction of the risk of rehospitalization with AKI in sepsis survivors, and the machine learning model may set the stage for the broader use of clinical features in healthcare.Entities:
Keywords: acute kidney injury; artificial intelligence; machine learning; rehospitalization; sepsis; sepsis survivors
Year: 2022 PMID: 35055358 PMCID: PMC8777885 DOI: 10.3390/jpm12010043
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Clinical features and classes in sepsis survivors divided into training and testing dataset.
| All Patients | Training Set | Testing Set | |
|---|---|---|---|
| ( | ( | ( | |
| Demographic and clinical characteristics | |||
| Age, years | 76.4 (61.4, 85.2) | 76.4 (61.2, 85.2) | 76.4 (61.9, 85.2) |
| Male sex, | 8557 (36.0) | 5995 (36.0) | 2562 (35.9) |
| Smoking, | 5373 (22.6) | 3744 (22.5) | 1629 (22.9) |
| Alcohol consumption, | 3945 (16.6) | 2739 (16.5) | 1206 (16.9) |
| Underlying Comorbidities | |||
| Hypertension, | 13,238 (55.7) | 9271 (55.7) | 3967 (55.6) |
| Transient ischemic attack, | 579 (2.4) | 399 (2.4) | 180 (2.5) |
| Ischemic stroke, | 3343 (14.1) | 2358 (14.2) | 985 (13.8) |
| Hemorrhagic stroke, | 1084 (4.6) | 774 (4.7) | 310 (4.3) |
| Dementia, | 3190 (13.4) | 2218 (13.3) | 972 (13.6) |
| Diabetes mellitus, | 7803 (32.8) | 5432 (32.7) | 2371 (33.3) |
| Gout, | 2443 (10.3) | 1737 (10.4) | 706 (9.9) |
| Myocardial infarction, | 1852 (7.8) | 1272 (7.6) | 580 (8.1) |
| Coronary artery disease, | 6260 (26.3) | 4308 (25.9) | 1952 (27.4) |
| CHF, | 4759 (20.0) | 3282 (19.7) | 1477 (20.7) |
| Atrial fibrillation, | 2394 (10.1) | 1667 (10.0) | 727 (10.2) |
| Chronic liver disease, | 3875 (16.3) | 2729 (16.4) | 1146 (16.1) |
| Cirrhosis, | 1395 (5.9) | 996 (6.0) | 399 (5.6) |
| Peptic ulcer disease, | 5632 (23.7) | 3957 (23.8) | 1675 (23.5) |
| COPD, | 4469 (18.8) | 3110 (18.7) | 1359 (19.1) |
| Asthma, | 1192 (5.0) | 835 (5.0) | 357 (5.0) |
| PAOD, | 192 (0.8) | 130 (0.8) | 62 (0.9) |
| Autoimmune disease, | 821 (3.5) | 591 (3.6) | 230 (3.2) |
| Cancer, | 11,592 (48.8) | 8145 (49.0) | 3447 (48.4) |
| Valvular heart disease, | 1303 (5.5) | 908 (5.5) | 395 (5.5) |
| Critical conditions | |||
| ICU admission, | 12,962 (54.6) | 9041 (54.4) | 3921 (55.0) |
| Use of mechanical ventilators, | 8740 (36.8) | 6083 (36.6) | 2657 (37.3) |
| Use of inotropes, | 11,343 (47.7) | 7933 (47.7) | 3410 (47.8) |
| Laboratory data | |||
| Blood urea nitrogen, mg/dL | 24.0 (14.0, 51.0) | 24.0 (14.0, 51.0) | 24.0 (14.0, 50.0) |
| Creatinine, mg/dL | 1.1 (0.7, 2.1) | 1.1 (0.7, 2.2) | 1.1 (0.7, 2.1) |
| White blood cells, /mm3 | 8100 (5700, 11,900) | 8100 (5700, 11,900) | 8100 (5700, 12,000) |
| Hemoglobin, g/dL | 10.1 (8.9, 11.5) | 10.1 (8.9, 11.5) | 10.1 (9.0, 11.6) |
| Sodium, mmol/L | 139.0 (135.0, 142.0) | 139.0 (135.0, 142.0) | 139.0 (135.0, 142.0) |
| Potassium, mmol/L | 4.1 (3.6, 4.6) | 4.1 (3.6, 4.6) | 4.1 (3.6, 4.6) |
| Chloride, mmol/L | 103.0 (98.0, 106.0) | 103.0 (98.0, 106.0) | 103.0 (98.0, 106.0) |
| Calcium, mg/dL | 8.5 (8.0, 9.0) | 8.5 (8.0, 9.0) | 8.5 (8.0, 9.0) |
| Phosphate, mg/dL | 3.3 (2.6, 4.0) | 3.3 (2.6, 4.0) | 3.3 (2.7, 4.1) |
| HCO3, mmol/L | 23.7 (19.3, 28.0) | 23.7 (19.3, 28.0) | 23.8 (19.4, 28.0) |
| C-reactive protein, mg/dL | 3.4 (1.2, 9.0) | 3.4 (1.2, 9.1) | 3.3 (1.1, 8.7) |
| Albumin, mg/dL | 3.0 (2.6, 3.4) | 3.0 (2.6, 3.4) | 3.0 (2.6, 3.4) |
| Alanine transaminase, U/L | 25.0 (15.0, 44.0) | 25.0 (15.0, 45.0) | 25.0 (15.0, 44.0) |
| Aspartate transaminase, U/L | 29.0 (20.0, 51.0) | 29.0 (20.0, 51.0) | 29.0 (20.0, 50.0) |
| Alkaline phosphatase, U/L | 95.0 (70.0, 147.0) | 95.0 (69.0, 147.0) | 94.0 (70.0, 147.0) |
| Gamma-glutamyl transferase, U/L | 54.0 (25.0, 125.0) | 53.0 (25.0, 125.0) | 54.0 (24.0, 126.0) |
| Total bilirubin, mg/dL | 0.6 (0.4, 1.1) | 0.6 (0.4, 1.1) | 0.6 (0.4, 1.1) |
| HbA1c, % | 6.4 (5.8, 7.4) | 6.4 (5.8, 7.4) | 6.4 (5.8, 7.4) |
| Glucose, mg/dL | 116.0 (95.0, 156.0) | 116.0 (94.0, 155.0) | 117.0 (95.0, 157.0) |
| Uric acid, mg/dL | 5.5 (4.1, 7.1) | 5.5 (4.1, 7.1) | 5.6 (4.1, 7.1) |
| Cholesterol, mg/dL | 151.0 (122.0, 182.0) | 152.0 (122.0, 183.0) | 150.0 (121.0, 181.0) |
| LDL-C, mg/dL | 91.0 (70.0, 114.0) | 91.0 (70.0, 115.0) | 91.0 (69.0, 113.0) |
| HDL-C, mg/dL | 41.0 (32.0, 51.0) | 41.0 (32.0, 51.0) | 41.0 (32.0, 51.0) |
| INR | 1.1 (1.0, 1.2) | 1.1 (1.0, 1.2) | 1.1 (1.0, 1.2) |
| aPTT, seconds | 29.9 (27.1, 34.0) | 29.9 (27.2, 34.2) | 29.9 (27.1, 33.8) |
| D-Dimer, ug/mL | 3.6 (1.6, 8.1) | 3.6 (1.5, 7.7) | 3.9 (1.8, 9.3) |
| LDH, U/L | 253.0 (196.0, 361.0) | 252.0 (196.0, 361.0) | 255.0 (197.0, 361.0) |
| NT-proBNP, pg/mL | 3146.0 (836.5, 11,617.0) | 3142.0 (823.8, 11,648.5) | 3185.0 (856.8, 11,580.8) |
| Concomitant Medications | |||
| ACEI, | 2225 (9.4) | 1537 (9.2) | 688 (9.7) |
| ARB, | 6972 (29.3) | 4884 (29.4) | 2088 (29.3) |
| Alpha, blocker, | 6109 (25.7) | 4228 (25.4) | 1881 (26.4) |
| Beta blocker, | 8521 (35.9) | 5891 (35.4) | 2630 (36.9) |
| CCB, | 10,534 (44.3) | 7362 (44.3) | 3172 (44.5) |
| Warfarin, | 1263 (5.3) | 893 (5.4) | 370 (5.2) |
| DOAC, | 147 (0.6) | 106 (0.6) | 41 (0.6) |
| Aspirin, | 5445 (22.9) | 3743 (22.5) | 1702 (23.9) |
| Plavix, | 3267 (13.7) | 2242 (13.5) | 1025 (14.4) |
| Nitrate, | 6521 (27.4) | 4473 (26.9) | 2048 (28.7) |
| Statin, | 3446 (14.5) | 2387 (14.4) | 1059 (14.9) |
| Diuretic, | 14,714 (61.9) | 10,287 (61.9) | 4427 (62.1) |
| Spironolactone, | 4927 (20.7) | 3427 (20.6) | 1500 (21.0) |
| Metformin, | 3459 (14.6) | 2447 (14.7) | 1012 (14.2) |
| Sulfonylurea, | 2214 (9.3) | 1533 (9.2) | 681 (9.6) |
| Meglitinide, | 2150 (9.0) | 1495 (9.0) | 655 (9.2) |
| SGLT2 inhibitor, | 47 (0.2) | 33 (0.2) | 14 (0.2) |
| GLP1 receptor agonist, | 3 (0.0) | 3 (0.0) | 0 (0.0) |
| Dipeptidyl peptidase-4 inhibitor, | 2720 (11.4) | 1883 (11.3) | 837 (11.7) |
| Thiazolidinedione, | 283 (1.2) | 203 (1.2) | 80 (1.1) |
| Alpha-glucosidase inhibitor, | 1084 (4.6) | 744 (4.5) | 340 (4.8) |
| Insulin, | 11,163 (47.0) | 7810 (47.0) | 3353 (47.0) |
| NSAID, | 11,300 (47.6) | 7917 (47.6) | 3383 (47.5) |
| COX-2 inhibitor, | 3316 (14.0) | 2284 (13.7) | 1032 (14.5) |
| Proton pump inhibitor, | 13,642 (57.4) | 9506 (57.2) | 4136 (58.0) |
| Steroid, | 8227 (34.6) | 5781 (34.8) | 2446 (34.3) |
| Allopurinol, | 1583 (6.7) | 1110 (6.7) | 473 (6.6) |
| Febuxostat, | 1446 (6.1) | 1006 (6.0) | 440 (6.2) |
| Benzbromarone, | 1424 (6.0) | 1007 (6.1) | 417 (5.8) |
| Class/Outcome | |||
| Rehospitalization with AKI † | 8756 (36.9) | 6076 (36.5) | 2680 (37.6) |
Data are presented as n (%) or median and interquartile range. † AKI was defined based on the criteria from Kidney Disease: Improving Global Outcomes. Abbreviations: CHF, congestive heart failure; COPD, chronic obstructive pulmonary disease; PAOD, peripheral arterial occlusive disease; ICU, intensive care unit; HCO3, bicarbonate; HbA1c, hemoglobin A1c; LDL-C, low-density lipoprotein cholesterol; HDL-C, high-density lipoprotein-cholesterol; INR, international normalized ratio; aPTT, activated partial thromboplastin time; LDH, lactate dehydrogenase; NT-proBNP, N-terminal pro-brain natriuretic peptide; ACEI, angiotensin converting enzyme inhibitors; ARB, angiotensin receptor blocker; CCB, calcium channel blocker; DOAC, direct oral anticoagulant; SGLT2, sodium-glucose cotransporter 2 inhibitor; GLP1, glucagon-like peptide-1; NSAID, nonsteroidal anti-inflammatory drug; COX-2, cyclooxygenase-2; AKI, acute kidney injury.
Clinical features selected in machine learning algorithm.
| Demographics | Comorbidities | Laboratory Data | Containment Medications |
|---|---|---|---|
| Age | Hypertension | Blood urea nitrogen | ACEI |
| Gender | Transient ischemic attack | Creatinine | ARB |
| Smoking | Ischemic stroke | White blood cell counts | Alpha blocker |
| Alcohol consumption | Hemorrhagic stroke | Hemoglobin | Beta blocker |
| Dementia | Sodium | CCB | |
| Diabetes mellitus | Potassium | Warfarin | |
| Gout | Chloride | DOAC | |
| Myocardial infarction | Calcium | Aspirin | |
| Coronary artery disease | Phosphate | Plavix | |
| CHF | HCO3 | Nitrate | |
| Atrial fibrillation | C-reactive protein | Statin | |
| Chronic liver disease | Albumin | Diuretic | |
| Cirrhosis | Alanine transaminase | Spironolactone | |
| Peptic ulcer disease | Aspartate transaminase | Metformin | |
| COPD | Alkaline phosphatase | Sulfonylurea | |
| Asthma | Gamma-glutamyl transferase | Meglitinide | |
| PAOD | Total bilirubin | SGLT2 inhibitor | |
| Autoimmune disease | HbA1c | GLP1 receptor agonist | |
| Cancer | Glucose | DPP4 inhibitor | |
| Valvular heart disease | Uric acid | Thiazolidinedione | |
| ICU admission | Cholesterol | Alpha-glucosidase inhibitor | |
| Use of mechanical ventilators | LDL-C | Insulin | |
| Use of inotropes | HDL-C | NSAID | |
| INR | COX-2 inhibitor | ||
| aPTT | Proton pump inhibitor | ||
| D-dimer | Steroid | ||
| LDH | Allopurinol | ||
| NT-proBNP | Febuxostat | ||
| Benzbromarone |
Abbreviations: CHF, congestive heart failure; COPD, chronic obstructive pulmonary disease; PAOD, peripheral arterial occlusive disease; ICU, intensive care unit; HCO3, bicarbonate; HbA1c, hemoglobin A1c; LDL-C, low-density lipoprotein cholesterol; HDL-C, high-density lipoprotein-cholesterol; INR, international normalized ratio; aPTT, activated partial thromboplastin time; LDH, lactate dehydrogenase; NT-proBNP, N-terminal pro-brain natriuretic peptide; ACEI, angiotensin converting enzyme inhibitors; ARB, angiotensin receptor blocker; CCB, calcium channel blocker; DOAC, direct oral anticoagulant; SGLT2, sodium-glucose cotransporter 2 inhibitor; GLP1, glucagon-like peptide-1; DPP4, Dipeptidyl peptidase-4; NSAID, nonsteroidal anti-inflammatory drug; COX-2, cyclooxygenase-2.
Figure 1Comparison of (A) ROC curve, (B) PR curve and (C) predictive ability among different machine learning models. Abbreviations: ROC, receiver operating characteristic curve; PR, precision-recall; GBDT, gradient boosting decision tree; XGBoost, extreme gradient boosting; LGBM, light gradient boosting machine LGBM. Abbreviations: ROC, receiver operating characteristic curve; PR, precision-recall; AUC, area under the curve of receiver operating characteristic curve; LGBM, light gradient boosting machine; GBDT, Gradient Boosting Decision Tree; XGBoost, extreme gradient boosting.
Figure 2(A) The feature importance plot using SHAP value reveals the important features in the final LGBM model (B)The local bar plots of feature importance reveal the SHAP values for features in the model. The feature values are show in gray to the left of the feature names. Abbreviations: SHAP, SHapley Additive exPlanation; LGBM, light gradient boosting machine; CRP, C-reactive protein; WBC, white blood cell counts; BUN, blood urea nitrogen; CCB, calcium channel blocker; INR, international normalized ratio; Hgb, hemoglobin; Na, sodium; NSAID, non-steroidal anti-inflammatory drug; aPTT, activated partial thromboplastin time; AST, aspartate aminotransferase; INR, international normalized ratio.
Figure 3(A) The SHAP heatmap plot with the model’s predictions for important features. The heatmap plot function creates a plot with the features on the y-axis, the model inputs on the x-axis, and the SHAP values encoded on a color scale with a higher value being redder, and a lower value being bluer. The SHAP dependence plots showed the interaction effects between (B) CRP and WBC, (C) CRP and BUN and (D) WBC and BUN in the LGBM model. Abbreviations: SHAP, SHapley Additive exPlanation; CRP, C-reactive protein; WBC, white blood cell counts; BUN, blood urea nitrogen; LGBM, light gradient boosting machine.