| Literature DB >> 35206537 |
Jae Kwan Kim1,2, Wonbin Ahn3, Sangin Park1, Soo-Hong Lee2, Laehyun Kim1,4.
Abstract
Sepsis is a life-threatening condition with a high mortality rate. Early prediction and treatment are the most effective strategies for increasing survival rates. This paper proposes a neural architecture search (NAS) model to predict the onset of sepsis with a low computational cost and high search performance by applying a genetic algorithm (GA). The proposed model shares the weights of all possible connection nodes internally within the neural network. Externally, the search cost is reduced through the weight-sharing effect between the genotypes of the GA. A predictive analysis was performed using the Medical Information Mart for Intensive Care III (MIMIC-III), a medical time-series dataset, with the primary objective of predicting sepsis onset 3 h before occurrence. In addition, experiments were conducted under various prediction times (0-12 h) for comparison. The proposed model exhibited an area under the receiver operating characteristic curve (AUROC) score of 0.94 (95% CI: 0.92-0.96) for 3 h, which is 0.31-0.26 higher than the scores obtained using the Sequential Organ Failure Assessment (SOFA), quick SOFA (qSOFA), and Simplified Acute Physiology Score (SAPS) II scoring systems. Furthermore, the proposed model exhibited a 12% improvement in the AUROC value over a simple model based on the long short-term memory neural network. Additionally, it is not only optimally searchable for sepsis onset prediction, but also outperforms conventional models that use similar predictive purposes and datasets. Notably, it is sufficiently robust to shape changes in the input data and has less structural dependence.Entities:
Keywords: genetic algorithm; intensive care unit; neural architecture search; sepsis
Mesh:
Year: 2022 PMID: 35206537 PMCID: PMC8872017 DOI: 10.3390/ijerph19042349
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Target time of a prediction model using a sliding window scheme. The prediction time can vary from 0 to 12 h, and 3 h, the primary objective, is for illustrative purposes only.
Baseline characteristics and essential variables are presented as median values (Q1–Q3).
| Overall | |
|---|---|
| Admission | 58,976 |
| Adult patients | 38,425 |
| Age | 65.86 (52.72–77.97) |
| Gender (female) | 15,409 |
| HR 1 (bpm) | 84.00 (73.00–97.00) |
| MAP 2 (mmHg) | 76.00 (67.33–87.00) |
| RR 3 (cpm) | 18.00 (14.00–22.00) |
| Na (mmol/L) | 138.00 (136.00–141.00) |
| K (mmol/L) | 4.10 (3.80–4.60) |
| HCO3 (mmol/L) | 24.00 (21.00–26.00) |
| WBC 4 (×103/mm3) | 11.00 (7.90–14.90) |
| PaO2/FiO2 ratio | 267.50 (180.00–352.50) |
| Ht 5 (%) | 31.00 (26.00–36.00) |
| Urea (mmol/L) | 1577.00 (968.00–2415.00) |
| Bilirubin (mg/dL) | 0.7 (0.40–1.70) |
1 HR, heart rate; 2 MAP, mean arterial pressure; 3 RR, respiratory rate; 4 WBC, white blood cell count; 5 Ht, hematocrit.
Example of merging when feature id and feature name differ in the same feature.
| Feature | ID | Name |
|---|---|---|
| Heart rate | 211 | Heart Rate |
| 220045 | Heart Rate | |
| Temperature | 678 | Temperature F |
| 223761 | Temperature Fahrenheit | |
| 676 | Temperature C | |
| 223762 | Temperature Celsius | |
| Systolic blood pressure | 51 | Arterial BP (Systolic) |
| 442 | Manual BP (Systolic) | |
| 455 | NBP (Systolic) | |
| 6701 | Arterial BP #2 (Systolic) | |
| 220179 | Non-Invasive Blood Pressure systolic | |
| 220050 | Arterial Blood Pressure systolic | |
| PaO2/FiO2 ratio | 50821 | PO2 |
| 50816 | Oxygen | |
| 223835 | Inspired O2 Fraction | |
| 3420 | FiO2 | |
| 3422 | FiO2 (Meas) | |
| 190 | FiO2 Set | |
| White blood cells count | 51300 | WBC Count |
| 51301 | White Blood Cells | |
| Glasgow coma scale | 723 | Verbal Response |
| 454 | Motor Response | |
| 184 | Eye Opening | |
| 223900 | GCS—Verbal Response | |
| 223901 | GCS—Motor Response | |
| 220739 | GCS—Eye Opening |
Figure 2Process of creating the final dataset.
Features in SOFA, qSOFA, SAPS II, and InSight.
| SOFA | qSOFA | SAPS II | InSight | |
|---|---|---|---|---|
| Age | O | O | ||
| Heart rate | O | O | ||
| pH | O | |||
| Systolic blood pressure | O | O | O | |
| Pulse pressure | O | |||
| Temperature | O | O | ||
| Respiratory rate | O | |||
| Glasgow coma scale | O | O | O | |
| Mechanical ventilation or CPAP | O | |||
| PaO2 | O | O | O | |
| FiO2 | O | O | O | |
| Urine output | O | O | ||
| Blood urea nitrogen | O | |||
| Blood oxygen saturation | O | |||
| Sodium | O | |||
| Potassium | O | |||
| Bicarbonate | O | |||
| Bilirubin | O | O | ||
| White blood cell count | O | O | ||
| Chronic diseases | O | |||
| Type of admission | O | |||
| Platelets | O | |||
| Creatinine | O | |||
| Mean arterial pressure | O | |||
| Dopamine | O | |||
| Epinephrine | O | |||
| Norepinephrine | O |
Figure 3Structure and input data flow of the proposed model.
Figure 4Example of the process of translating from a genotype into a phenotype. The components of a diagonal matrix are the activation functions of each node. The solid arrows in the DAG represent the selected connections.
Figure 5Neural architecture search and training process with genetic algorithm.
Figure 6Example of a flow using actually vital data during a test.
Experimental results of the proposed model and summarized existing research results.
| Authors | Dataset | Model | Prediction Time | Sensitivity | Specificity | AUROC (95% CI) |
|---|---|---|---|---|---|---|
| Calvert et al. [ | MIMIC-II | InSight | 3 h | 0.90 | 0.81 | 0.83 |
| Desautels et al. [ | MIMIC-III | InSight | 4 h | 0.80 | 0.54 | 0.74 |
| Kam et al. [ | MIMIC-II | LSTM | 3 h | 0.91 | 0.94 | 0.93 |
| Nemati et al. [ | MIMIC-III | AISE | 4 h | 0.85 | 0.67 | 0.85 |
| Khojandi et al. [ | Oklahoma State University | RF | 0 h | 0.99 | 0.97 | 0.90 |
| Moor et al. [ | MIMIC-III | MGP-TCN | 7 h | - | - | 0.86 |
| Li et al. [ | 2019 PhysioNet/CinC Challenge dataset | CNN+RNN | 12 h | - | - | 0.75 |
| Scherpf et al. [ | MIMIC-III | RNN | 3 h | 0.90 | 0.47 | 0.81 |
| Lauritsen et al. [ | The Danish National Patient Registry | CNN+LSTM | 3 h | - | - | 0.86 |
| Yang et al. [ | 2019 PhysioNet/CinC Challenge dataset | XGBOOST | 1 h | 0.90 | 0.64 | 0.85 |
| Bedoya et al. [ | Duke University Hospital | MGP-RNN | 4 h | - | - | 0.88 |
| Li et al. [ | 2019 PhysioNet/CinC Challenge dataset | LightGBM | 6 h | 0.86 | 0.63 | 0.85 |
| Shashikumar et al. [ | MIMIC-III | DeepAISE | 4 h | 0.80 | 0.75 | 0.87 |
| Rafiei et al. [ | 2019 PhysioNet/CinC Challenge dataset | SSP | 4 h | 0.85 | 0.81 | 0.92 |
| This study | MIMIC-III | SOFA | 3 h | 0.65 | 0.58 | 0.63 |
| qSOFA | 3 h | 0.61 | 0.75 | 0.65 | ||
| SAPS II | 3 h | 0.65 | 0.77 | 0.68 | ||
| LSTM | 3 h | 0.83 | 0.74 | 0.84 | ||
| The proposed model | 3 h | 0.93 | 0.91 | 0.94 | ||
| 4 h | 0.91 | 0.86 | 0.93 | |||
| 8 h | 0.88 | 0.82 | 0.87 | |||
| 12 h | 0.86 | 0.81 | 0.83 |
Figure 7AUROC, sensitivity, and specificity of the proposed model, with the prediction time ranging from 0 to12 h.
Figure 8The RNN cell discovered in the proposed model.