| Literature DB >> 36001363 |
Ahmad Ayad1, Ahmed Hallawa2, Arne Peine2, Lukas Martin2, Lejla Begic Fazlic3, Guido Dartmann3, Gernot Marx2, Anke Schmeink1.
Abstract
BACKGROUND: In recent years, the volume of medical knowledge and health data has increased rapidly. For example, the increased availability of electronic health records (EHRs) provides accurate, up-to-date, and complete information about patients at the point of care and enables medical staff to have quick access to patient records for more coordinated and efficient care. With this increase in knowledge, the complexity of accurate, evidence-based medicine tends to grow all the time. Health care workers must deal with an increasing amount of data and documentation. Meanwhile, relevant patient data are frequently overshadowed by a layer of less relevant data, causing medical staff to often miss important values or abnormal trends and their importance to the progression of the patient's case.Entities:
Keywords: CNN; DNN; EHR; ICU; anomaly detection; lab values; lightGBM; machine learning; medical Informatics; time series classification
Year: 2022 PMID: 36001363 PMCID: PMC9453586 DOI: 10.2196/37658
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Overall abnormality detection system in practice. DNN: deep neural network.
Clinical and demographic properties of the study population [16].
| Property | MIMIC-III data set | eICU data set |
| Number of ICUsa | 5 | 335 |
| Data acquisition timespan | 2001-2012 | 2014-2015 |
| Number of included patients (N) | 11,443 | 23,699 |
| Age (years), median (IQR) | 66.9 (56.3-77.5) | 65.0 (54-74) |
| Body weight in kg, mean (SD) | 85.7 (18.1) | 83.5 (22.0) |
| Sex, female, n(%) | 4329 (36.3%) | 10,546 (42%) |
| Sex, male, n (%) | 7614 (63.7%) | 14,540 (58%) |
| In-hospital mortality, % | 11.1 | 13.2 |
| LOSb in ICU (days), median (IQR) | 3.1 (1.6-6.1) | 3.0 (1.71-5.9) |
aICU: intensive care unit.
bLOS: length of stay.
Figure 2Overall system model used in our study when trained on the MIMIC-III data set and tested on the eICU data set. ICU: intensive care unit; Sigmoid is an activation function; L: lab value; t: time step.
Figure 3Moving window technique to extract sequences from intensive care unit stays. X and Y represent the input and output data respectively; W represents the windows extracted from the input sequences.
Figure 4LSTM architecture used in our experiments. LSTM: long short-term memory; ReLU: rectified linear unit; Tanh, ReLU and Sigmoid are activation functions.
Figure 5Multiple convolutional neural network model architecture used in our experiments. Conv1D: 1D convolutional layer; LeakyReLU: leaky rectified linear unit; ReLU: rectified linear unit; Sigmoid, LeakyReLU, and ReLU are activation functions.
Figure 6Transformer architecture used in our experiments. Conv1D: 1D convolutional layer; Time2Vec: time to vector transformation; ReLU: rectified linear unit; ReLU and Sigmoid are activation functions.
Figure 7TCN architecture used in our experiments. LeakyReLU: leaky rectified linear unit; ReLU: rectified linear unit; TCN: temporal convolutional network; LeakyReLU, ReLU, and Sigmoid are activation functions.
Sample counts for training, validation, and testing in both training methods.
| Method | Number of training samples | Number of validation samples | Number of first testing samples | Number of second testing samples |
| #1 | 73,190 (MIMIC-III) | 12,915 (MIMIC-III) | 21,526 (MIMIC-III) | 196,208 (eICU) |
| #2 | 166,776 (eICU) | 29,431 (eICU) | 49,052 (eICU) | 86,106 (MIMIC-III) |
Figure 8Validation loss of the different models. LSTM: long short-term network; M-CNN: multiple convolutional neural network; TCN: temporal convolutional network; Val.: validation; ICU: intensive care unit.
Testing results for the different models over all lab values (micro-average) on the MIMIC-III data seta.
| Training data set and model | Accuracy | Precision | Recall | F1 score | |
|
| |||||
|
| LSTMb | 0.85 | 0.83 | 0.87 | 0.85 |
|
| CNNc | 0.86 | 0.84 | 0.85 | 0.84 |
|
| M-CNNd | 0.88 | 0.87 | 0.89 | 0.88 |
|
| Transformer | 0.86 | 0.88 | 0.81 | 0.84 |
|
| TCNe | 0.86 | 0.87 | 0.85 | 0.86 |
|
| LightGBMf | 0.83 | 0.82 | 0.76 | 0.78 |
|
| |||||
|
| LSTM | 0.8 | 0.79 | 0.81 | 0.8 |
|
| CNN | 0.85 | 0.86 | 0.83 | 0.84 |
|
| M-CNN | 0.87 | 0.88 | 0.86 | 0.87 |
|
| Transformer | 0.86 | 0.86 | 0.84 | 0.85 |
|
| TCN | 0.83 | 0.82 | 0.84 | 0.83 |
|
| LightGBM | 0.82 | 0.77 | 0.78 | 0.77 |
aThe models listed under MIMIC-III were trained on the MIMIC-III data set and those under eICU were trained on the eICU data set.
bLSTM: long short-term memory.
cCNN: convolutional neural network.
dM-CNN: multiple convolutional neural network.
eTCN: temporal convolutional network.
fLightGBM: gradient boosting–based method.
Testing results for the different models over all lab values (micro-average) on the eICU data seta.
| Training data set and model | Accuracy | Precision | Recall | F1 score | |
|
| |||||
|
| LSTMb | 0.79 | 0.81 | 0.8 | 0.8 |
|
| CNNc | 0.78 | 0.8 | 0.8 | 0.8 |
|
| M-CNNd | 0.8 | 0.8 | 0.83 | 0.81 |
|
| Transformer | 0.75 | 0.82 | 0.69 | 0.75 |
|
| TCNe | 0.71 | 0.74 | 0.72 | 0.73 |
|
| LightGBMf | 0.75 | 0.78 | 0.75 | 0.76 |
|
| |||||
|
| LSTM | 0.82 | 0.85 | 0.83 | 0.84 |
|
| CNN | 0.85 | 0.86 | 0.83 | 0.84 |
|
| M-CNN | 0.89 | 0.9 | 0.91 | 0.9 |
|
| Transformer | 0.86 | 0.87 | 0.88 | 0.87 |
|
| TCN | 0.89 | 0.88 | 0.89 | 0.89 |
|
| LightGBM | 0.82 | 0.77 | 0.78 | 0.77 |
aThe models under MIMIC-III were trained on the MIMIC-III data set and those under eICU were trained on the eICU data set.
bLSTM: long short-term memory.
cCNN: convolutional neural network.
dM-CNN: multiple convolutional neural network.
eTCN: temporal convolutional network.
fLightGBM: gradient boosting–based method.
Inference time for the different models.
| Model name | Average inference time/batch |
| LSTMa | 654 ms |
| CNNb | 220 ms |
| M-CNNc | 285 ms |
| TCNd | 854 ms |
| Transformer | 598 ms |
| LightGBMe | 121 ms |
aLSTM: long short-term memory.
bCNN: convolutional neural network.
cM-CNN: multiple convolutional neural network.
dTCN: temporal convolutional network.
eLightGBM: gradient boosting–based method.