| Literature DB >> 35787248 |
Liwei Peng1, Chi Peng2, Fan Yang3, Jian Wang1, Wei Zuo1, Chao Cheng1, Zilong Mao1, Zhichao Jin4, Weixin Li5.
Abstract
OBJECTIVE: Our study aimed to identify predictors as well as develop machine learning (ML) models to predict the risk of 30-day mortality in patients with sepsis-associated encephalopathy (SAE).Entities:
Keywords: Machine learning; Model interpretation; SAE; Sepsis-associated encephalopathy; Web-based calculator
Mesh:
Year: 2022 PMID: 35787248 PMCID: PMC9252033 DOI: 10.1186/s12874-022-01664-z
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Fig. 1Overview of the methods used for data extraction, training, and testing. ICU, intensive care unit; MIMIC, Medical Information Mart for Intensive Care; ML, machine learning; NNET, artificial neural network; NB, naïve bayes; LR, logistic regression; GBM, gradient boosting machine; Ada, adapting boosting; RF, random forest; BT, bagged trees; XGB, eXtreme Gradient Boosting
Baseline characteristic of the MIMIC-IV cohorts
| Variables | Survival | Death | |
|---|---|---|---|
| Age (y), median [Q1, Q3] | 70.00 (58.00,81.00) | 77.00 (66.00,85.25) | < 0.001 |
| Male, n (%) | 3181 (55.21) | 659 (53.49) | 0.286 |
| Race, n (%) | 0.001 | ||
| Black | 490 (8.50) | 93 (7.55) | |
| White | 4000 (69.42) | 824 (66.88) | |
| Hispanic | 174 (3.02) | 31 (2.52) | |
| Asian | 193 (3.35) | 31 (2.52) | |
| Others | 905 (15.71) | 253 (20.54) | |
| Myocardial infarction | 1006 (17.46) | 265 (21.51) | 0.001 |
| Congestive heart failure | 1977 (34.31) | 540 (43.83) | < 0.001 |
| Peripheral vascular disease | 696 (12.08) | 174 (14.12) | 0.054 |
| Cerebrovascular disease | 259 (4.49) | 51 (4.14) | 0.636 |
| Dementia | 300 (5.21) | 81 (6.57) | 0.064 |
| Chronic pulmonary disease | 1728 (29.99) | 421 (34.17) | 0.004 |
| Rheumatic disease | 234 (4.06) | 79 (6.41) | < 0.001 |
| Peptic ulcer disease | 229 (3.97) | 64 (5.19) | 0.063 |
| Mild liver disease | 709 (12.30) | 242 (19.64) | < 0.001 |
| Diabetes without complication | 1308 (22.70) | 255 (20.70) | 0.135 |
| Diabetes with complication | 507 (8.80) | 105 (8.52) | 0.798 |
| Paraplegia | 124 (2.15) | 17 (1.38) | 0.101 |
| Renal disease | 1349 (23.41) | 387 (31.41) | < 0.001 |
| Malignant cancer | 839 (14.56) | 342 (27.76) | < 0.001 |
| Severe liver disease | 225 (3.90) | 93 (7.55) | < 0.001 |
| Metastatic solid tumor | 343 (5.95) | 205 (16.64) | < 0.001 |
| AIDS | 41 (0.71) | 10 (0.81) | 0.849 |
| CCI, median [Q1, Q3] | 6.00 (4.00,8.00) | 7.00 (6.00,9.00) | < 0.001 |
| Temperature (°C), median [Q1, Q3] | 36.90 (36.60,37.30) | 36.70 (36.40,37.10) | < 0.001 |
| MAP (mmHg), median [Q1, Q3] | 75.00 (70.00,82.00) | 73.00 (68.00,80.00) | < 0.001 |
| Heart rate (min), median [Q1, Q3] | 88.00 (77.00,100.00) | 91.00 (80.00,104.00) | < 0.001 |
| Respiratory rate (min), median [Q1, Q3] | 20.00 (17.00,23.00) | 22.00 (19.00,25.00) | < 0.001 |
| RBC (× 109/L), median [Q1, Q3] | 3.41 (3.02,3.92) | 3.28 (2.88,3.80) | < 0.001 |
| WBC (× 109/L), median [Q1, Q3] | 11.80 (8.60,16.03) | 12.62 (8.77,17.50) | 0.002 |
| MCH (pg), median [Q1, Q3] | 30.20 (28.73,31.50) | 30.12 (28.70,31.63) | 0.925 |
| MCHC (%), median [Q1, Q3] | 33.00 (31.85,34.00) | 32.40 (31.30,33.50) | < 0.001 |
| MCV (fL), median [Q1, Q3] | 91.00 (87.00,95.00) | 92.75 (88.00,97.33) | < 0.001 |
| PLT (× 109/L), median [Q1, Q3] | 197.00 (139.00,268.50) | 189.33 (116.71,272.75) | 0.001 |
| RDW (%), median [Q1, Q3] | 14.77 (13.73,16.27) | 16.00 (14.58,18.00) | < 0.001 |
| HCT (%), median [Q1, Q3] | 31.30 (27.78,35.70) | 30.38 (26.80,35.07) | < 0.001 |
| APTT (s), median [Q1, Q3] | 31.90 (27.80,39.80) | 35.70 (29.30,49.50) | < 0.001 |
| PT (s), median [Q1, Q3] | 14.27 (12.85,16.50) | 15.70 (13.40,20.00) | < 0.001 |
| INR, median [Q1, Q3] | 1.30 (1.15,1.50) | 1.40 (1.20,1.85) | < 0.001 |
| pH, median [Q1, Q3] | 7.37 (7.32,7.42) | 7.36 (7.31,7.42) | < 0.001 |
| Bicarbonate (mmol/L), median [Q1, Q3] | 23.00 (20.50,26.00) | 21.90 (19.00,25.50) | < 0.001 |
| Lactate (mmol/L), median [Q1, Q3] | 1.65 (1.20,2.28) | 1.95 (1.40,2.89) | < 0.001 |
| BE (mEq/L), median [Q1, Q3] | -0.50 (-3.50,1.43) | -1.33 (-5.00,1.00) | < 0.001 |
| Aniongap (mmol/L), median [Q1, Q3] | 14.00 (12.00,16.50) | 15.50 (13.00,18.21) | < 0.001 |
| PaO2 (mmHg), median [Q1, Q3] | 112.00 (76.00,173.00) | 92.00 (68.00,131.00) | < 0.001 |
| PaCO2 (mmHg), median [Q1, Q3] | 41.00 (37.00,47.00) | 41.00 (35.00,48.00) | 0.014 |
| Chloride (mmol/L), median [Q1, Q3] | 104.50 (100.50,108.20) | 103.00 (98.75,107.50) | < 0.001 |
| Calcium (mmol/L), median [Q1, Q3] | 8.20 (7.73,8.63) | 8.10 (7.63,8.65) | 0.097 |
| Sodium (mmol/L), median [Q1, Q3] | 138.60 (136.00,141.00) | 138.00 (134.82,141.43) | 0.001 |
| Potassium (mmol/L), median [Q1, Q3] | 4.15 (3.83,4.55) | 4.28 (3.88,4.78) | < 0.001 |
| Glucose (mmol/L), median [Q1, Q3] | 128.50 (108.33,156.24) | 131.00 (107.00,163.54) | 0.130 |
| CRE (mg/dL), median [Q1, Q3] | 1.05 (0.75,1.68) | 1.35 (0.85,2.30) | < 0.001 |
| BUN (mg/dL), median [Q1, Q3] | 22.29 (15.00,37.67) | 32.58 (20.67,52.35) | < 0.001 |
| Vasopressor | 1799 (31.22) | 538 (43.67) | < 0.001 |
| GCS | 13.00 (9.00,14.00) | 8.00 (3.00,12.00) | < 0.001 |
| SOFA | 6.00 (4.00,9.00) | 9.00 (6.00,12.00) | < 0.001 |
| APSIII | 55.00 (41.00,72.00) | 80.50 (63.00,102.00) | < 0.001 |
| SIRS | 3.00 (2.00,3.00) | 3.00 (2.75,4.00) | < 0.001 |
AIDS Acquired Immunodeficiency Syndrome, CCI Charlson Comorbidity Index, MAP Mean Artery Pressure, RBC Red Blood Cell, WBC White Blood Cell, MCH Mean Corpuscular Hemoglobin, MCHC Mean Corpuscular Hemoglobin Concentration, MCV Mean Corpuscular Volume, PLT Platelet, RDW Red blood cell volume Distribution Width, HCT Hematocrit, APTT Activated Partial Thromboplastin Time, PT Prothrombin Time, INR International Normalized Ratio, pH potential of Hydrogen, BE Buffer Excess, CRE Creatinine, BUN Blood Urea Nitrogen, GCS Glasgow Coma Score, SOFA Sepsis related Organ Failure Assessment, APSIII Acute Physiology Score III, SIRS Systemic Inflammatory Response Syndrome
Fig. 2Association between the number of variables allowed to be considered at each split and the prediction accuracy in the REF algorithm. REF, recursive feature elimination
Fig. 3AUC of ROC curve by ML models in the validation cohort. AUC, area under the curve; ROC, receiver operate characteristics; ML, machine learning; NNET, artificial neural network; NB, naïve bayes; LR, logistic regression; GBM, gradient boosting machine; Ada, adapting boosting; RF, random forest; BT, bagged trees; XGB, eXtreme Gradient Boosting
Analysis of sensitivity and specificity
| Model | Accuracy | Sensitivity | Specificity | PPV | NPV | AUC | Operating threshold | 95% CI |
|---|---|---|---|---|---|---|---|---|
| NNET | 0.840 | 0.802 | 0.733 | 0.391 | 0.946 | 0.833 | 0.164 | (0.816, 0.849) |
| NB | 0.833 | 0.767 | 0.800 | 0.450 | 0.941 | 0.816 | 0.058 | (0.799, 0.833) |
| LR | 0.843 | 0.808 | 0.731 | 0.391 | 0.947 | 0.833 | 0.162 | (0.816, 0.848) |
| GBM | 0.844 | 0.805 | 0.699 | 0.360 | 0.944 | 0.824 | 0.141 | (0.807, 0.840) |
| Ada | 0.846 | 0.786 | 0.737 | 0.390 | 0.942 | 0.834 | 0.148 | (0.817, 0.849) |
| RF | 0.840 | 0.856 | 0.642 | 0.338 | 0.954 | 0.825 | 0.150 | (0.808, 0.841) |
| BT | 0.836 | 0.715 | 0.745 | 0.375 | 0.925 | 0.804 | 0.240 | (0.786, 0.820) |
| XGB | 0.844 | 0.808 | 0.712 | 0.374 | 0.945 | 0.830 | 0.157 | (0.814, 0.846) |
| CatBoost | 0.842 | 0.789 | 0.741 | 0.394 | 0.943 | 0.830 | 0.165 | (0.813, 0.846) |
PPV Positive Predictive Values, NPV Negative Predictive Values, AUC Area Under the Curve, CI Confidence Interval, NNET artificial Neural Network, NB Naïve Bayes, LR Logistic Regression, GBM Gradient Boosting Machine, Ada Adapting boosting, RF Random Forest, BT Bagged Trees, XGB eXtreme Gradient Boosting
Fig. 4Calibration curve in the validation cohort. NNET, artificial neural network; NB, naive bayes; LR, logistic regression; GBM, gradient boosting machine; Ada, adapting boosting; RF, random forest; BT, bagged trees; XGB, eXtreme Gradient Boosting
Fig. 5Variable importance in nine different ML models. ML, machine learning; NNET, artificial neural network; NB, naïve bayes; LR, logistic regression; GBM, gradient boosting machine; Ada, adapting boosting; RF, random forest; BT, bagged trees; XGB, eXtreme Gradient Boosting;
Fig. 6Examples of website usage. Entering the input value determined the mortality and displayed how each value contributed to the prediction. CCI, Charlson Comorbidity Index