| Literature DB >> 32614874 |
Seyedmostafa Sheikhalishahi1,2, Vevake Balaraman1,2, Venet Osmani2.
Abstract
Progress of machine learning in critical care has been difficult to track, in part due to absence of public benchmarks. Other fields of research (such as computer vision and natural language processing) have established various competitions and public benchmarks. Recent availability of large clinical datasets has enabled the possibility of establishing public benchmarks. Taking advantage of this opportunity, we propose a public benchmark suite to address four areas of critical care, namely mortality prediction, estimation of length of stay, patient phenotyping and risk of decompensation. We define each task and compare the performance of both clinical models as well as baseline and deep learning models using eICU critical care dataset of around 73,000 patients. This is the first public benchmark on a multi-centre critical care dataset, comparing the performance of clinical gold standard with our predictive model. We also investigate the impact of numerical variables as well as handling of categorical variables on each of the defined tasks. The source code, detailing our methods and experiments is publicly available such that anyone can replicate our results and build upon our work.Entities:
Mesh:
Year: 2020 PMID: 32614874 PMCID: PMC7332047 DOI: 10.1371/journal.pone.0235424
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics and mortality outcome measures.
*LoS (Length of Stay). Continuous variables are presented as Median [Interquartile Range Q1–Q3]; binary or categorical variables as Count (%).
| Overall | Dead at Hospital | Alive at Hospital | |
|---|---|---|---|
| ICU Admissions | 73,718 | 6,167 | 67,551 |
| Age | 62.41 [52-75] | 68.12 [59-80] | 61.8 [52-75] |
| Gender (F) | 33,544 (45.5) | 2,830 (45.8) | 30,714 (45.4) |
| Caucasian | 56,973 (77.2) | 4,866 (78.9) | 52,107 (77.1) |
| African American | 7,982 (10.8) | 582 (9.4) | 7,400 (10.9) |
| Hispanic | 2,937 (3.98) | 226 (3.6) | 2,711 (4) |
| Asian | 1,174 (1.59) | 97 (1.5) | 1,077 (1.5) |
| Native American | 413 (0.56) | 42 (0.68) | 371 (0.54) |
| Unknown | 4,239 (5.7) | 354 (5.7) | 3,885 (5.7) |
| Hospital LoS* (days) | 5.29 [2.53-6.84] | 3.9 [1.42-5.22] | 5.41 [2.65-6.92] |
| ICU LoS* (days) | 2.32 [1.01-2.91] | 3.17 [1.19-4.43] | 2.24 [1-2.83] |
| Hospital Death | 6,167 (8.36) | 6,167 (100) | - |
| ICU Death | 4,575 (6.2) | 4,575 (74.1) | - |
Fig 1Cohort selection criteria.
Selected variables for all the four tasks.
| Variable | Data Type |
|---|---|
| Heart rate | Numerical |
| Mean arterial pressure | Numerical |
| Diastolic blood pressure | Numerical |
| Systolic blood pressure | Numerical |
| O2 | Numerical |
| Respiratory rate | Numerical |
| Temperature | Numerical |
| Glucose | Numerical |
| FiO2 | Numerical |
| pH | Numerical |
| Height | Numerical |
| Weight | Numerical |
| Age | Numerical |
| Admission diagnosis | Categorical |
| Ethnicity | Categorical |
| Gender | Categorical |
| Glasgow Coma Score Total | Categorical |
| Glasgow Coma Score Eyes | Categorical |
| Glasgow Coma Score Motor | Categorical |
| Glasgow Coma Score Verbal | Categorical |
Number of patients and records in four tasks.
| Task | No. of patients | Clinical records |
|---|---|---|
| In-hospital Mortality | 30,680 | 1,164,966 |
| Remaining LoS | 73,389 | 3,054,314 |
| Phenotyping | 49,299 | 2,172,346 |
| Physiologic Decompensation | 55,933 | 2,800,711 |
Phenotype categories.
| Type | Phenotype |
|---|---|
| Acute | 1. Respiratory failure; insufficiency; arrest 2. Fluid and electrolyte disorders 3. Septicemia 4. Acute and unspecified renal failure 5. Pneumonia 6. Acute cerebrovascular disease 7. Acute myocardial infarction 8. Gastrointestinal hemorrhage 9. Shock 10. Pleurisy; pneumothorax; pulmonary collapse 11. Other lower respiratory disease 12. Complications of surgical 13. Other upper respiratory disease |
| Chronic | 1. Hypertension with complications 2. Essential hypertension 3. Chronic kidney disease 4. Chronic obstructive pulmonary disease 5. Disorders of lipid metabolism 6. Coronary atherosclerosis and related 7. Diabetes mellitus without complication |
| Mixed | 1. Cardiac dysrhythmias 2. Congestive heart failure; non hypertensive 3. Diabetes mellitus with complications 4. Other liver diseases 5. Conduction disorders |
Fig 2Model architecture.
In-hospital mortality prediction during first 24 and 48 hours in ICU.
(Num. and Cat. indicate presence of numerical and categorical variables respectively. Repn. indicates representation of categorical variables, either One Hot Encoding (OHE) or embedding (EMB)).
| Data | Model | Num. | Cat. | Repn. | AUROC | AUPRC | Spec. | Sens. | PPV | NPV |
|---|---|---|---|---|---|---|---|---|---|---|
| First 24 hours | APACHE | ✓ | ✓ | Not spec. | 77.30 | 41.23 | 38.74 | 86 | 57.09 | 93.07 |
| LR | ✓ | ✓ | EMB | 79.88±0.67 | 40.50 | 46.01 | 90 | 64.53 | 90.06 | |
| ANN | ✓ | ✓ | EMB | 82.60±0.58 | 46.17 | 51.99 | 90 | 65.91 | 90.78 | |
| ✓ | ✓ | EMB | 90 | 90.82 | ||||||
| BiLSTM | ✓ | ✓ | OHE | 82.78±0.32 | 46.34 | 51.96 | 90 | 62.95 | 91.09 | |
| BiLSTM | ✕ | ✓ | EMB | 78.57±0.70 | 40.21 | 43.83 | 90 | 58.52 | 90.83 | |
| BiLSTM | ✓ | ✕ | ✕ | 76.63±0.75 | 38.56 | 36.30 | 90 | 66.00 | 90.82 | |
| First 48 hours | LR | ✓ | ✓ | EMB | 82.34±0.65 | 45.39 | 51.07 | 90 | 69.06 | 90.33 |
| ANN | ✓ | ✓ | EMB | 85.36±0.66 | 52.59 | 57.19 | 90 | 69.78 | 91.53 | |
| ✓ | ✓ | EMB | 90 | 68.98 | ||||||
| BiLSTM | ✓ | ✓ | OHE | 84.96±0.63 | 51.63 | 56.12 | 90 | 64.82 | 91.83 | |
| BiLSTM | ✕ | ✓ | EMB | 80.59±0.82 | 45.59 | 46.39 | 90 | 63.86 | 91.27 | |
| BiLSTM | ✓ | ✕ | ✕ | 80.77±1.29 | 45.48 | 44.02 | 90 | 67.94 | 90.63 |
Length of stay in hospital prediction, evaluated using Mean Absolute Error (MAE).
| Data | Model | Num. | Cat. | Repn. | MAE [Day] | |
|---|---|---|---|---|---|---|
| In ICU unit | LR | ✓ | ✓ | EMB | 0.024±0.001 | 1.292±0.008 |
| ANN | ✓ | ✓ | EMB | 0.048±0.003 | 1.267±0.014 | |
| ✓ | ✓ | EMB | 0.643±0.042 | 0.532±0.033 | ||
| BiLSTM | ✓ | ✓ | OHE | 0.623±0.025 | 0.511± 0.021 | |
| BiLSTM | ✕ | ✓ | EMB | 0.610±0.029 | 0.532±0.033 | |
| BiLSTM | ✓ | ✕ | ✕ | 0.610±0.042 |
Phenotyping task on eICU (reported scores are AUROC).
| Phenotype | Prevalence | Type | Num & cat | Num. | Cat. |
|---|---|---|---|---|---|
| Respiratory failure; insufficiency; arrest | 0.241 | acute | 83.31±0.32 | 73.09±0.41 | 81.24±0.19 |
| Fluid and electrolyte disorders | 0.156 | acute | 72.76±0.77 | 60.35±0.50 | 72.18±1.20 |
| Septicemia | 0.145 | acute | 91.54±0.15 | 71.43±0.50 | 90.86±0.50 |
| Acute and unspecified renal failure | 0.142 | acute | 75.93±0.68 | 65.41±0.66 | 74.14±1.32 |
| Pneumonia | 0.120 | acute | 89.34±0.51 | 70.28±0.77 | 88.47±0.24 |
| Acute cerebrovascular disease | 0.108 | acute | 94.24±0.58 | 74.37±0.75 | 93.63±0.49 |
| Acute myocardial infarction | 0.090 | acute | 91.35±0.67 | 70.56±0.74 | 91.18±0.87 |
| Gastrointestinal hemorrhage | 0.079 | acute | 91.38 ± 0.74 | 61.33 ± 1.33 | 90.66 ± 0.83 |
| Shock | 0.068 | acute | 85.75 ± 0.57 | 77.12 ± 0.41 | 82.74 ± 1.35 |
| Pleurisy; pneumothorax; pulmonary collapse | 0.039 | acute | 70.40 ± 2.23 | 61.15 ± 1.56 | 70.03 ± 0.90 |
| Other lower respiratory disease | 0.030 | acute | 80.42 ± 0.99 | 60.06 ± 1.24 | 79.60 ± 1.05 |
| Complications of surgical | 0.011 | acute | 68.45 ± 3.91 | 54.01 ± 4.79 | 65.43 ± 3.17 |
| Other upper respiratory disease | 0.007 | acute | 77.46 ± 5.46 | 53.56 ± 3.17 | 74.18 ± 4.52 |
| - | - | 82.49 ± 1.35 | 65.60 ± 1.30 | 81.10 ± 1.28 | |
| Hypertension with complications | 0.019 | chronic | 85.70 ± 2.59 | 81.27 ± 1.29 | 81.61 ± 2.97 |
| Essential hypertension | 0.203 | chronic | 72.16 ± 0.74 | 66.58 ± 0.31 | 68.31 ± 0.66 |
| Chronic kidney disease | 0.104 | chronic | 65.96 ± 1.66 | 62.06 ± 1.39 | 65.05 ± 0.90 |
| Chronic obstructive pulmonary disease | 0.093 | chronic | 75.62 ± 1.44 | 63.73 ± 0.60 | 74.48 ± 1.67 |
| Disorders of lipid metabolism | 0.054 | chronic | 72.95 ± 1.05 | 62.85 ± 1.03 | 71.56 ± 1.36 |
| Coronary atherosclerosis and related | 0.041 | chronic | 80.89 ± 0.45 | 64.03 ± 0.98 | 79.90 ± 1.34 |
| Diabetes mellitus without complication | 0.006 | chronic | 61.55 ± 4.52 | 58.89 ± 5.77 | 59.12 ± 3.56 |
| - | - | 73.55 ± 1.78 | 65.63 ± 1.63 | 70.72 ± 1.78 | |
| Cardiac dysrhythmias | 0.165 | mixed | 75.68 ± 0.86 | 66.24 ± 0.81 | 71.92 ± 1.49 |
| Congestive heart failure; non hypertensive | 0.106 | mixed | 78.87 ± 1.05 | 66.34 ± 0.76 | 76.56 ± 1.56 |
| Diabetes mellitus with complications | 0.047 | mixed | 93.59 ± 0.65 | 90.38 ± 1.41 | 89.59 ± 0.99 |
| Other liver diseases | 0.039 | mixed | 78.33 ± 1.71 | 68.20 ± 1.02 | 75.51 ± 2.32 |
| Conduction disorders | 0.013 | mixed | 83.58 ± 1.68 | 72.90 ± 2.43 | 80.81 ± 1.66 |
| - | - | 82.01 ± 1.19 | 72.81 ± 1.29 | 78.88 ± 1.60 | |
| - | - | 79.89 ± 1.44 | 67.05 ± 1.39 | 77.75 ± 1.48 |
Decompensation risk prediction in eICU.
| Data | Model | Num. | Cat. | Repn. | AUROC | AUPRC | Spec. | Sens. | PPV | NPV |
|---|---|---|---|---|---|---|---|---|---|---|
| In ICU unit | LR | ✓ | ✓ | EMB | 67.63 ±5.89 | 16.53 | 18.92 | 90.00 | Nan | 95.10 |
| ANN | ✓ | ✓ | EMB | 80.59±0.60 | 22.86 | 47.65 | 90.00 | 45.73 | 95.32 | |
| BiLSTM | ✓ | ✓ | EMB | 90.00 | 78.51 | 97.27 | ||||
| BiLSTM | ✕ | ✓ | EMB | 86.82±0.70 | 36.34 | 61.08 | 90.00 | 57.31 | 96.13 | |
| BiLSTM | ✓ | ✕ | ✕ | 95.15±0.16 | 68.28 | 85.60 | 90.00 |