| Literature DB >> 31703390 |
Jau-Woei Perng1, I-Hsi Kao1, Chia-Te Kung2, Shih-Chiang Hung2, Yi-Horng Lai3, Chih-Min Su2.
Abstract
In emergency departments, the most common cause of death associated with suspected infected patients is sepsis. In this study, deep learning algorithms were used to predict the mortality of suspected infected patients in a hospital emergency department. During January 2007 and December 2013, 42,220 patients considered in this study were admitted to the emergency department due to suspected infection. In the present study, a deep learning structure for mortality prediction of septic patients was developed and compared with several machine learning methods as well as two sepsis screening tools: the systemic inflammatory response syndrome (SIRS) and quick sepsis-related organ failure assessment (qSOFA). The mortality predictions were explored for septic patients who died within 72 h and 28 days. Results demonstrated that the accuracy rate of deep learning methods, especially Convolutional Neural Network plus SoftMax (87.01% in 72 h and 81.59% in 28 d), exceeds that of the other machine learning methods, SIRS, and qSOFA. We expect that deep learning can effectively assist medical staff in early identification of critical patients.Entities:
Keywords: deep learning; machine learning; mortality prediction; neural networks; sepsis
Year: 2019 PMID: 31703390 PMCID: PMC6912277 DOI: 10.3390/jcm8111906
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Baseline features for machine learning (first reading on admission).
| No. | Clinical Variables | Survived | Non-Surviving |
| Miss (%) |
|---|---|---|---|---|---|
| 1 | Blood Pressure | 139.76 ± 32.48 | 121.17 ± 43.80 | <0.001 | 0 |
| 2 | Triage | <0.001 | 0 | ||
| 1 | 6.3% | 27.3% | |||
| 2 | 33.4% | 40.7% | |||
| 3 | 57.0% | 29.9% | |||
| 4 | 3.1% | 2.0% | |||
| 5 | 0.2% | 0.1% | |||
| 3 | GCS (E) | <0.001 | 0 | ||
| 1 | 2.1% | 14.2% | |||
| 2 | 2.5% | 8.1% | |||
| 3 | 3.4% | 7.9% | |||
| 4 | 92.0% | 69.8% | |||
| 4 | GCS (V) | <0.001 | 0 | ||
| 1 | 8.4% | 27.5% | |||
| 2 | 3.5% | 9.3% | |||
| 3 | 1.4% | 3.0% | |||
| 4 | 3.0% | 5.7% | |||
| 5 | 83.7% | 54.5% | |||
| 5 | GCS (M) | <0.001 | 0 | ||
| 1 | 0.9% | 10.2% | |||
| 2 | 1.1% | 4.4% | |||
| 3 | 2.8% | 6.8% | |||
| 4 | 4.8% | 10.9% | |||
| 5 | 5.5% | 11.6% | |||
| 6 | 84.9% | 56.2% | |||
| 6 | WBC | 11.32 ± 8.36 | 14.29 ± 16.95 | <0.001 | 0.017 |
| 7 | Hb | 12.01 ± 2.39 | 10.74 ± 2.61 | <0.001 | 0.008 |
| Seg | 76.98 ± 13.37 | 77.96 ± 16.84 | <0.001 | 0.064 | |
| 9 | Lymph | 14.81 ± 10.67 | 12.02 ± 12.57 | <0.001 | 0.067 |
| 10 | PT-INR | 1.21 ± 0.55 | 1.58 ± 0.93 | <0.001 | 21.94 |
| 11 | BUN | 24.76 ± 24.33 | 45.21 ± 37.56 | <0.001 | 0 |
| 12 | Cr | 1.47 ± 1.79 | 2.29 ± 2.30 | <0.001 | 0 |
| 13 | Bil | 2.42 ± 3.74 | 6.13 ± 8.51 | <0.001 | 20.16 |
| 14 | AST | 73.29 ± 258.18 | 258.10 ± 1099.87 | <0.001 | 16.46 |
| 15 | ALT | 47.24 ± 129.98 | 105.73 ± 364.09 | <0.001 | 0 |
| 16 | Troponin I | 0.33 ± 3.20 | 1.43 ± 8.52 | <0.001 | 24.21 |
| 17 | pH | 7.40 ± 0.11 | 7.33 ± 0.18 | <0.001 | 22.79 |
| 18 | HCO3 | 23.41 ± 6.49 | 20.49 ± 8.01 | <0.001 | 22.79 |
| 19 | Atypical lymphocyte | 0.078 ± 0.51 | 0.18 ± 0.66 | <0.001 | 0.067 |
| 20 | Promyelocyte | 0.0071 ± 0.45 | 0.044 ± 1.23 | <0.001 | 0.067 |
| 21 | Metamyelocyte | 0.11 ± 0.51 | 0.55 ± 1.60 | <0.001 | 0.067 |
| 22 | Myelocyte | 0.15 ± 0.71 | 0.61 ± 1.53 | <0.001 | 0.001 |
| 23 | Sodium ion | 135.48 ± 5.60 | 134.51 ± 8.67 | <0.001 | 0.067 |
| 24 | Potassium ion | 3.91 ± 0.71 | 4.30 ± 1.14 | <0.001 | 0 |
| 25 | Albumin | 2.99 ± 0.72 | 2.55 ± 0.63 | <0.001 | 25.79 |
| 26 | Sugar | 163.16 ± 106.24 | 192.04 ± 167.08 | <0.001 | 15.10 |
| 27 | RDW-SD | 46.46 ± 7.54 | 53.69 ± 11.54 | <0.001 | 0.012 |
| 28 | MCV | 88.44 ± 8.14 | 90.35 ± 9.26 | <0.001 | 0.011 |
| 29 | RDW-CV | 14.49 ± 22.24 | 16.50 ± 3.27 | <0.001 | 0.012 |
| 30 | Base excess | −1.12 ± 6.44 | −5.08 ± 9.15 | <0.001 | 22.79 |
| 31 | MCH | 29.53 ± 3.16 | 29.91 ± 3.35 | <0.001 | 0.010 |
| 32 | MCHC | 33.36 ± 1.39 | 33.11 ± 1.71 | <0.001 | 0.011 |
| 33 | MAP | 101.14 ± 26.83 | 88.25 ± 32.54 | <0.001 | 0.011 |
| 34 | RR | 19.58 ± 2.77 | 20.25 ± 6.04 | <0.001 | 0 |
| 35 | Temperature | 37.37 ± 1.26 | 36.46 ± 4.21 | <0.001 | 0.001 |
| 36 | Heart rate | 99.90 ± 23.00 | 102.66 ± 33.95 | <0.001 | 0 |
| 37 | Age | 61.05 ± 18.11 | 68.52 ± 15.02 | <0.001 | 0 |
| 38 | Sex (male%) | 52.6% | 61.3% | <0.001 | 0 |
| 39 | qSOFA Score | <0.001 | 0 | ||
| 0 | 69.7% | 33.1% | 0 | ||
| 1 | 23.7% | 38.1% | 0 | ||
| 2 | 6.0% | 23.9% | 0 | ||
| 3 | 0.6% | 4.8% | 0 | ||
| 40 | Shock episode | 2.5% | 26.8% | <0.001 | 0 |
| 41 | Liver cirrhosis | 6.9% | 17.6% | <0.001 | 0 |
| 42 | DM | 25.2% | 26.4% | 0.029 | 0 |
| 43 | CRF | 10.3% | 28.1% | <0.001 | 0 |
| 44 | CHF | 4.5% | 9.1% | <0.001 | 0 |
| 45 | CVA | 8.6% | 12.5% | <0.001 | 0 |
| 46 | Solid tumor | 18.0% | 43.6% | <0.001 | 0 |
| 47 | RI | 66.0% | 48.9% | <0.001 | 0 |
| 48 | UTI | 21.1% | 15.7% | <0.001 | 0 |
| 49 | Soft tissue infection | 13.7% | 4.7% | <0.001 | 0 |
| 50 | Intra-abdominal infection | 11.2% | 10.6% | 0.141 | 0 |
| 51 | Other infection | 35.7% | 33.4% | <0.001 | 0 |
| 52 | Bacteremia | 8.1% | 16.5% | <0.001 | 0 |
| 53 | Antibiotic used within 24 h | 77.9% | 85.5% | <0.001 | 0 |
Abbreviation: GCS (E), Glasgow Coma Scale eye opening; GCS (V), Glasgow Coma Scale verbal response; GCS (M), Glasgow Coma Scale motor response; WBC, white blood cell count; Hb, hemoglobin; Seg, segment; Lymph, lymphocyte; PT-INR, prothrombin time international normalized ratio; BUN, Blood urea nitrogen; Cr, creatinine; Bil, bilirubin; AST, glutamic-pyruvic transaminase; ALT, alanine aminotransferase; pH, pondus hydrogenii; HCO3, hydrogen carbonate Ion; RDW-SD, red cell distribution width standard deviation; MCV, mean corpuscular volume; RDW-CV, red cell distribution width coefficient of variation; MCH, mean corpuscular hemoglobin; MCHC, mean corpuscular hemoglobin concentration; MAP, mean arterial pressure; RR, respiratory rate; DM, diabetes mellitus; CRF, chronic renal failure; CVA, congestive heart failure; RI, Respiratory infection; UTI, urinary tract infection.
Figure 1Structure of the autoencoder (AE) in this experiment.
Figure 2Structure of Convolutional Neural Network (CNN) in this experiment.
Figure 3Structure of patient data management.
Figure 4Extracted feature visualization of mortality prediction of testing dataset within 72 h: (a) CNN, (b) AE, (c) PCA. Abbreviation: PCA, principal component analysis; AE, autoencoder; CNN, convolutional neural network.
Figure 5Extracted feature visualization of mortality prediction for the testing dataset in 28 d: (a) CNN, (b) AE, (c) PCA. Abbreviation: PCA, principal component analysis; AE, autoencoder; CNN, convolutional neural network.
Figure 6Area under curve (AUC) curve of the mortality prediction for the testing dataset at 72 h: (a) no feature extraction, (b) PCA, (c) AE, (d) CNN. Abbreviation: SIRS, systemic inflammatory response syndrome; qSOFA, quick sepsis-related organ failure assessment; RF, random forest; KNN, K nearest neighbor; SVM, support vector machine; PCA, principal component analysis; AE, autoencoder; CNN, convolutional neural network.
Figure 7AUC curve of the mortality prediction of the testing dataset in 28 d: (a) no feature extraction, (b) PCA, (c) AE, (d) CNN. Abbreviation: SIRS, systemic inflammatory response syndrome; qSOFA, quick sepsis-related organ failure assessment; RF, random forest; KNN, K nearest neighbor; SVM, support vector machine; PCA, principal component analysis; AE, autoencoder; CNN, convolutional neural network.
AUC with 95% confidence interval and the accuracy rate of various methods in predicting 72-h mortality and compared with CNN plus SoftMax by Delong test.
| Algorithms | AUC | SE | 95%CI | Compared with CNN + SoftMax | Acc (%) |
|---|---|---|---|---|---|
| SIRS | 0.67 | 0.0101 | 0.67–0.68 | 59.43 | |
| qSOFA | 0.74 | 0.0101 | 0.73–0.74 | 67.27 | |
| RF | 0.89 | 0.0067 | 0.88–0.89 | 62.56 | |
| KNN | 0.83 | 0.0087 | 0.83–0.84 | 77.31 | |
| SVM | 0.93 | 0.0044 | 0.92–0.93 | 74.33 | |
| SoftMax | 0.91 | 0.0052 | 0.91–0.92 | 82.73 | |
| PCA + RF | 0.90 | 0.0059 | 0.90–0.91 | 62.62 | |
| PCA + KNN | 0.88 | 0.0071 | 0.88–0.89 | 81.67 | |
| PCA + SVM | 0.91 | 0.0055 | 0.90–0.91 | 78.91 | |
| PCA + SoftMax | 0.92 | 0.0050 | 0.92–0.93 | 83.48 | |
| AE + RF | 0.77 | 0.0064 | 0.76–0.77 | 63.52 | |
| AE + KNN | 0.92 | 0.0053 | 0.91–0.92 | 80.64 | |
| AE + SVM | 0.85 | 0.0086 | 0.85–0.85 | 78.76 | |
| AE + SoftMax | 0.93 | 0.0042 | 0.92–0.93 | 84.17 | |
| CNN + RF | 0.87 | 0.0069 | 0.87–0.88 | 61.03 | |
| CNN + KNN | 0.86 | 0.0069 | 0.85–0.86 | 81.73 | |
| CNN + SVM | 0.92 | 0.0047 | 0.92–0.92 | 84.96 | |
|
|
|
|
|
|
|
Abbreviation: SIRS, systemic inflammatory response syndrome; qSOFA, quick sepsis-related organ failure assessment; RF, random forest; KNN, K nearest neighbor; SVM, support vector machine; PCA, principal component analysis; AE, autoencoder; CNN, convolutional neural network; AUC, area under the curve; SE standard error; CI, confidence interval; Acc, accuracy.
AUC with 95% confidence interval and the accuracy rate of various methods in predicting 28-days mortality and compared with CNN plus SoftMax by Delong test.
| Algorithms | AUC | SE | 95%CI | Compared with CNN + SoftMax | Acc (%) |
|---|---|---|---|---|---|
| SIRS | 0.59 | 0.0063 | 0.59–0.60 | 59.43 | |
| qSOFA | 0.68 | 0.0061 | 0.67–0.69 | 67.27 | |
| RF | 0.89 | 0.0032 | 0.89–0.89 | 62.56 | |
| KNN | 0.84 | 0.0047 | 0.83–0.84 | 77.31 | |
| SVM | 0.90 | 0.0031 | 0.89–0.90 | 74.33 | |
| SoftMax | 0.88 | 0.0034 | 0.90–0.89 | 82.73 | |
| PCA + RF | 0.89 | 0.0034 | 0.89–0.89 | 62.62 | |
| PCA + KNN | 0.84 | 0.0050 | 0.84–0.85 | 81.67 | |
| PCA + SVM | 0.89 | 0.0033 | 0.88–0.89 | 78.91 | |
| PCA + SoftMax | 0.91 | 0.0031 | 0.90–0.91 | 83.48 | |
| AE + RF | 0.84 | 0.0037 | 0.83–0.84 | 63.52 | |
| AE + KNN | 0.81 | 0.0042 | 0.81–0.82 | 80.64 | |
| AE + SVM | 0.89 | 0.0033 | 0.89–0.90 | 78.76 | |
| AE + SoftMax | 0.90 | 0.0032 | 0.89–0.90 | 84.17 | |
| CNN + RF | 0.90 | 0.0032 | 0.90–0.91 | 61.03 | |
| CNN + KNN | 0.86 | 0.0040 | 0.85–0.86 | 81.73 | |
| CNN + SVM | 0.92 | 0.0027 | 0.91–0.92 | 84.96 | |
|
|
|
|
|
|
|
Abbreviation: SIRS, systemic inflammatory response syndrome; qSOFA, quick sepsis-related organ failure assessment; RF, random forest; KNN, K nearest neighbor; SVM, support vector machine; PCA, principal component analysis; AE, autoencoder; CNN, convolutional neural network; AUC, area under the curve; SE standard error; CI, confidence interval; Acc, accuracy.
Feature importance of 72 h mortality prediction by Random Forest (RF) (%).
| Test 1 | Test 2 | Test 3 | Test 4 | ||||
|---|---|---|---|---|---|---|---|
| Feature | Importance | Feature | Importance | Feature | Importance | Feature | Importance |
| BE | 35.60 | BE | 39.50 | BE | 33.59 | BE | 36.50 |
| Shock episode | 12.89 | Shock episode | 11.86 | Shock episode | 13.89 | Shock episode | 13.00 |
| GCS (V) | 7.62 | ||||||
| ~ Lower than 5% ignored ~ | |||||||
Abbreviation: RF, random forest; BE, base excess; GCS (V), Glasgow Coma Scale- Verbal response.
Feature importance of 28 d mortality prediction by RF (%).
| Test 1 | Test 2 | Test 3 | Test 4 | ||||
|---|---|---|---|---|---|---|---|
| Feature | Importance | Feature | Importance | Feature | Importance | Feature | Importance |
| BE | 20.39 | BE | 23.38 | BE | 19.88 | BE | 20.29 |
| RDW-SD | 9.07 | Solid tumor | 6.00 | RDW-SD | 10.11 | RDW-CV | 8.55 |
| RDW-CV | 5.53 | RDW-CV | 5.80 | Solid tumor | 5.55 | ||
| Solid tumor | 5.35 | RDW-SD | 5.43 | RDW-SD | 5.54 | ||
| ~ Lower than 5% ignored ~ | |||||||
Abbreviation: RF, random forest; BE, base excess; RDW-SD, Red cell distribution width standard deviation; RDW-CV, Red cell distribution width coefficient of variation.