| Literature DB >> 32704168 |
Nam K Tran1, Samer Albahra2, Tam N Pham3, James H Holmes4, David Greenhalgh5, Tina L Palmieri5, Jeffery Wajda6, Hooman H Rashidi7.
Abstract
Sepsis is the primary cause of burn-related mortality and morbidity. Traditional indicators of sepsis exhibit poor performance when used in this unique population due to their underlying hypermetabolic and inflammatory response following burn injury. To address this challenge, we developed the Machine Intelligence Learning Optimizer (MILO), an automated machine learning (ML) platform, to automatically produce ML models for predicting burn sepsis. We conducted a retrospective analysis of 211 adult patients (age ≥ 18 years) with severe burn injury (≥ 20% total body surface area) to generate training and test datasets for ML applications. The MILO approach was compared against an exhaustive "non-automated" ML approach as well as standard statistical methods. For this study, traditional multivariate logistic regression (LR) identified seven predictors of burn sepsis when controlled for age and burn size (OR 2.8, 95% CI 1.99-4.04, P = 0.032). The area under the ROC (ROC-AUC) when using these seven predictors was 0.88. Next, the non-automated ML approach produced an optimal model based on LR using 16 out of the 23 features from the study dataset. Model accuracy was 86% with ROC-AUC of 0.96. In contrast, MILO identified a k-nearest neighbor-based model using only five features to be the best performer with an accuracy of 90% and a ROC-AUC of 0.96. Machine learning augments burn sepsis prediction. MILO identified models more quickly, with less required features, and found to be analytically superior to traditional ML approaches. Future studies are needed to clinically validate the performance of MILO-derived ML models for sepsis prediction.Entities:
Mesh:
Year: 2020 PMID: 32704168 PMCID: PMC7378181 DOI: 10.1038/s41598-020-69433-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of sepsis criteria.
ABA American Burn Association, aPTT activated partial thromboplastin time, CRP c-reactive protein, FiO2 fraction of inspired oxygen, GCS Glascow Coma Score, INR international normalized ratio, IV intravenous, NS not significant, PaO2 partial pressure of oxygen, PCT procalcitonin, PLT platelet count, SBP systolic blood pressure, SD standard deviation, SIRS systemic inflammatory response syndrome, SOFA sequential organ failure assessment, SvO2 saturation of venous oxygen, temp temperature, UOP urine output, WBC white blood cell count.
Daily recorded variables for enrolled subjects.
| Vital sign | Blood gas | Chemistry | Heme/Coag | Microbiology | Calculated values | Clinical events |
|---|---|---|---|---|---|---|
| Heart rate | pH | Na+ | HGB | Blood culture | Anion gap | Surgery |
| Respiratory rate | pCO2 | K+ | HCT | Respiratory culture | BUN/creatinine ratio | Ventilatory status |
| SBP/DBP | pO2 | Cl− | WBC | Urine culture | MODS | Antibiotic therapy |
| CVP | HCO3− | TCO2 | Platelet count | Wound culture | SOFA | Dialysis status |
| MAP | FiO2 | Glucose | aPTT | PaO2/FiO2 | Survival status | |
| GCS | Creatinine | INR | ||||
| BUN | D-dimer | |||||
| Total bilirubin |
aPTT activated partial thromboplastin time, BUN blood urea nitrogen, CVP central venous pressure, DBP diastolic blood pressure, FiO2 fraction of inspired oxygen, HCT hematocrit, HGB hemoglobin, INR international normalized ratio, MAP mean arterial pressure, MODS multiple organ dysfunction score, PaO2 partial pressure of arterial oxygen, pCO2 partial pressure of CO2, pO2 partial pressure of oxygen, SBP systolic blood pressure, SOFA sequential organ failure assessment score, SO2 oxygen saturation, TCO2 total CO2, WBC white blood cell count.
Figure 1Machine intelligence learning optimizer: the MILO auto-machine learning (ML) infrastructure consists of begins with two datasets: (a) balanced data (Data Set 1) set used for training and validation, and (b) an unbalanced dataset (Data Set 2) for generalization. MILO removes missing values, assessed and scaled by the software. Unsupervised ML is then used for feature selection and engineering. The generated models are trained and then tested with the Data Set 1 during the supervised ML stage. Primary validation is then performed using Data Set 1 and followed by generalization using Data Set 2. Selected models can then be deployed thereafter as predictive model markup language (PMML) files.
Comparison of septic versus non-septic burn patients.
| Variable | Septic (n = 92) | Non-septic (n = 119) | P-value |
|---|---|---|---|
| Mean (SD) age (years) | 44.5 (18.1) | 38.6 (15.7) | NS |
| Mean (SD) TBSA (%) | 38.9 (16.9) | 21.7 (10.8) | 0.033 |
| Gender (M/F) | 59/33 | 80/39 | NS |
| Inhalation injury (%) | 14.1% | 13.9% | NS |
| ICU LOS (days) | 58.7 (25.6) | 40.2 (26.2) | 0.021 |
| Median (IQR) # infections per patient | |||
| Bloodstream | 5.5 (6.0) | N/A | N/A |
| Pneumonia | 7.2 (5.5) | N/A | N/A |
| Urinary tract | 3.6 (2.7) | N/A | N/A |
| Wound | 4.4 (3.6) | N/A | N/A |
| Mortality (%) | 31.5 | 11.8 | 0.001 |
| Mean GCS (SD) | 10.3 (2.8) | 11.4 (2.8) | 0.005 |
| Mean (SD) MODS | 4.4 (2.8) | 3.9 (2.7) | 0.006 |
| Mean (SD) SOFA | 4.1 (2.7) | 3.2 (2.3) | < 0.001 |
| Median Temperature (IQR) (ºC) | 39.2 (4.0) | 38.0 (3.5) | 0.010 |
| Median HCT (IQR) (%) | 24.0 (7.0) | 25.5 (8.6) | 0.003 |
| Median HGB (IQR) (g/dL) | 7.8 (2.2) | 8.5 (3.0) | < 0.001 |
| Median WBC (IQR) (cells/µL) | 13.1 (10.7) | 12.1 (9.0) | 0.001 |
| Median creatinine (IQR) (mg/dL) | 0.80 (0.91) | 0.71 (0.38) | < 0.001 |
| Median BUN (IQR) (mg/dL) | 21 (20.8) | 13 (9.8) | < 0.001 |
| Mean Na+ (SD) (mmol/L) | 139.2 (5.28) | 136.8 (4.53) | < 0.001 |
| Mean TCO2 (SD) (mmol/L) | 26.8 (4.9) | 24.8 (3.68) | 0.001 |
| Median PLT (IQR) (cells/µL) | 289 (300.5) | 352.5 (344.8) | < 0.001 |
BUN blood urea nitrogen, HCT hematocrit, HGB hemoglobin, ICU intensive care unit, F female, GCS Glascow Coma Score, IQR interquartile range, LOS length-of-stay, M male, MODS multiple organ dysfunction score, N/A not applicable, Na+ ionized sodium, NS not significant, PLT platelet count, SD standard deviation, SOFA sequential organ failure assessment score, TBSA total body surface area, TCO2 total CO2, and WBC white blood cell count.
Figure 2Receiver operator characteristic curves for statistically significant burn sepsis biomarkers: (A–J) represent receiver operator characteristic (ROC) curves and the area under the curve (AUC) analysis (in fractions) with 95% confidence intervals (CI) for statistically significant predictors of burn sepsis. (K) is the ROC curve for the multivariate model that best predicts sepsis using logistic regression. The tangent line for each ROC curve identifies the point where sensitivity and specificity are optimized. BUN blood urea nitrogen, GCS Glascow coma score, HCT hematocrit, HGB hemoglobin, Na+ sodium, PLT platelet, TCO2 total carbon dioxide, TCO2 total carbon dioxide.
Machine learning algorithm performance for the top 5 models identified by traditional programming versus MILO.
| Method | Accuracy (95% CI) | AUROC (95% CI)* | Sensitivity (95% CI) | Specificity (95% CI) | Features |
|---|---|---|---|---|---|
| Logistic regression | 86 (80–90) | 0.96 (0.88–1.00) | 98 (89–100) | 82 (75–88) | 16a |
| Deep neural network | 81 (75–86) | 0.96 (0.85–1.00) | 94 (83–99) | 77 (70–83) | 10b |
| k-nearest neighbor | 81 (75–86) | 0.92 (0.84–1.00) | 98 (89–100) | 76 (68–82) | 10b |
| Support vector machine | 85 (79–89) | 0.97 (0.86–1.00) | 98 (89–100) | 81 (74–87) | 14c |
| Random forest | 79 (73–85) | 0.92 (0.84–1.00) | 94 (83–99) | 75 (67–82) | 10b |
| k-nearest neighbor | 90 (85–94) | 0.96 (0.85–1.00) | 96 (86–99) | 88 (82–93) | 5e |
| Logistic regression | 87 (81–91) | 0.95 (0.83–1.00) | 98 (89–100) | 83 (77–89) | 23f |
| Naïve bayes | 89 (84–93) | 0.95 (0.84–1.00) | 94 (83–99) | 87 (81–92) | 11d |
| Random forest | 84 (79–89) | 0.94 (0.84–1.00) | 96 (86–99) | 81 (74–87) | 23f |
| Deep neural network | 84 (79–89) | 0.95 (0.85–1.00) | 100 (93–100) | 80 (72–86) | 17 g |
| Support vector machine | 86 (80–90) | 0.97 (0.87–1.00) | 98 (89–100) | 82 (75–88) | 11d |
| Gradient boosting machine | 81 (75–86) | 0.94 (0.88–1.00) | 96 (86–99) | 76 (69–83) | 5e |
BUN blood urea nitrogen, CI confidence interval, CVP central venous pressure, DBP diastolic blood pressure, GCS Glascow Coma Score, HCT hematocrit, HGB hemoglobin, HR heart rate, MAP mean arterial pressure, MODS multiple organ dysfunction score, PLT platelet count, RR respiratory rate, SBP systolic blood pressure, SO2 oxygen saturation, TCO2 total CO2, and WBC white blood cell count.
*Area under the ROC curves are reported in fractions.
aMAP, RR, body temperature, GCS, WBC, HGB, HCT, PLT, Na+ , K+ , BUN, creatinine, BUN/creatinine, glucose, TCO2, and MODS.
bBody temperature, WBC, HGB, HCT, Na+ , K+ , BUN, creatinine, BUN/creatinine, and TCO2.
cRR, body temperature, GCS, WBC, HGB, HCT, PLT, Na+ , K+ , BUN, creatinine, BUN/creatinine, TCO2, and MODS.
dSBP, MAP, HR, TEMP, HCT, Na+ , K+ , BUN, BUN/creatinine, anion gap, and TCO2.
eHR, body temperature, HGB, BUN, and TCO2.
fSBP, DBP, MAP, CVP, RR, HR, body temperature, GCS, SO2, WBC, HGB, HCT, PLT, Na+ , K+ , Cl−, anion gap, BUN, creatinine, BUN/creatinine, glucose, TCO2, and MODS.
gMAP, HR, RR, TEMP, WBC, HGB, HCT, PLT, Na+ , K+ , BUN, creatinine, BUN/creatinine, glucose, anion gap, TCO2, and MODS.
Figure 3MILO ROC for optimal ML model: screenshot of the optimal machine learning (ML) model generated by MILO based on logistic regression. (A) is the MILO read out for the receiver operator characteristic (ROC) curve using the selected ML model (i.e., logistic regression). (B) is the generalization/reliability plot for the selected ML model. (C) is the filtered list of ML models displaying other parameter such as average sensitivity and specificity (Sn + Sp “bar”), area under the curve (AUC) for the ROC analysis, F1 score, binary sensitivity and specificity, Brier Score, scaler used, feature selector used, scorer used, and searcher used.