| Literature DB >> 35806137 |
Vasiliki Danilatou1,2, Stylianos Nikolakakis3, Despoina Antonakaki4, Christos Tzagkarakis4, Dimitrios Mavroidis4, Theodoros Kostoulas5, Sotirios Ioannidis3,4.
Abstract
Intensive care unit (ICU) patients with venous thromboembolism (VTE) and/or cancer suffer from high mortality rates. Mortality prediction in the ICU has been a major medical challenge for which several scoring systems exist but lack in specificity. This study focuses on two target groups, namely patients with thrombosis or cancer. The main goal is to develop and validate interpretable machine learning (ML) models to predict early and late mortality, while exploiting all available data stored in the medical record. To this end, retrospective data from two freely accessible databases, MIMIC-III and eICU, were used. Well-established ML algorithms were implemented utilizing automated and purposely built ML frameworks for addressing class imbalance. Prediction of early mortality showed excellent performance in both disease categories, in terms of the area under the receiver operating characteristic curve (AUC-ROC): VTE-MIMIC-III 0.93, eICU 0.87, cancer-MIMIC-III 0.94. On the other hand, late mortality prediction showed lower performance, i.e., AUC-ROC: VTE 0.82, cancer 0.74-0.88. The predictive model of early mortality developed from 1651 VTE patients (MIMIC-III) ended up with a signature of 35 features and was externally validated in 2659 patients from the eICU dataset. Our model outperformed traditional scoring systems in predicting early as well as late mortality. Novel biomarkers, such as red cell distribution width, were identified.Entities:
Keywords: ICU; cancer; interpretable machine learning; mortality; venous thromboembolism
Mesh:
Year: 2022 PMID: 35806137 PMCID: PMC9266386 DOI: 10.3390/ijms23137132
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Demographic and clinical characteristics of patients with VTE or cancer from eICU and MIMIC-III databases. SD denotes the standard deviation, and LOS length of stay. p-Values between “surviving” and “non-surviving” patients are reported.
| VTE | VTE | Cancer | ||||
|---|---|---|---|---|---|---|
| Characteristic | eICU | MIMIC-III |
| |||
| 3724 | 2468 | 5128 |
| |||
|
| ||||||
|
| 409 (9.3 %) | <0.001 | 605 (24.5%) | <0.001 | ||
|
| ||||||
|
| N/A |
Description of attributes selected for patients with thrombosis and/or cancer from the various tables of eICU dataset. NOF denotes the number of features, and MD the missing data.
| Group Name | Table in | Description | Number of | Missing | Most |
|---|---|---|---|---|---|
| Patient | Patient | Basic | 11 | 0 | Gender, age, ethnicity, |
| Diagnosis | Diagnosis | Diagnoses | 61 | 395 | PE, DVT, hypertension |
| APSIII | Apache- | Acute | 24 | 529 | Variables used in APS |
| APACHE | Apache- | APACHE | 34 | 529 | Variables used in |
| Labs | Lab | Laboratory | 80 | 442 | Hematocrit hemoglobin, |
| Vital signs | Physical | Vital signs | 8 | 597 | Blood pressure diastolic |
| Vital | Vital signs | 10 | 433 | Heart rate, respiration, | |
| Medications | Admission | Medications | 36 | 3263 | Aspirin, furosemide |
| Drugs | Infusion | Medications | 30 | 2503 | Heparin, epinephrine, |
| Treatment | Medications | 52 | 1040 | Medications in groups, | |
| Medical | Pasthistory | Past history of | 111 | 422 | Hypertension, |
Figure 1Custom machine learning pipeline: XGBoost.
Figure 2Receiver operating characteristic (ROC) curves for early mortality in ICU patients with thrombosis from MIMIC-III database using all features: (a) curves and comparison with traditional scores (APS, Acute Physiology Score; LODS, Logistic Organ Dysfunction Score; MLODS, Multiple Logistic Organ Dysfunction Score; OASIS, Outcome and Assessment Information Set; SAPS, Simplified Acute Physiology Score; SIRS, Systemic Inflammatory Response Syndrome; SOFA, Sequential Organ Failure), (b) Precision–recall (PR) curve.
Analytical metrics of performance of predictive models for early mortality in patients with thrombosis from MIMIC-III database (Abbreviations: APS, Acute Physiology Score; LODS, Logistic Organ Dysfunction Score; OASIS, Outcome and Assessment Information Set; SAPS, Simplified Acute Physiology Score; SIRS, Systemic Inflammatory Response Syndrome; SOFA, Sequential Organ Failure).
|
| F1 Score | Accuracy | Specificity | Sensitivity | |
|---|---|---|---|---|---|
| Early/Late | Early/Late | Early/Late | Early/Late | Early/Late | |
|
| 0.94 (0.91, 0.96) | 0.87 | 0.83 | 0.93 | 0.79 |
|
| 0.85 (0.81, 0.88) | 0.6 | 0.81 | 0.85 | 0.7 |
|
| 0.81 (0.76, 0.85) | 0.55 | 0.8 | 0.85 | 0.6 |
|
| 0.78 (0.74, 0.82) | 0.5 | 0.736 | 0.76 | 0.65 |
|
| 0.77 (0.72, 0.81) | 0.5 | 0.77 | 0.8 | 0.6 |
|
| 0.8 (0.76, 0.84) | 0.55 | 0.76 | 0.77 | 0.7 |
|
| 0.8 (0.76, 0.84) | 0.53 | 0.78 | 0.82 | 0.6 |
|
| 0.85 (0.81, 0.89) | 0.61 | 0.81 | 0.83 | 0.73 |
|
| 0.64 (0.59, 0.68) | 0.39 | 0.53 | 0.5 | 0.71 |
|
| 0.76 (0.72, 0.81) | 0.5 | 0.7 | 0.7 | 0.71 |
Figure 3ROC curves for early mortality in ICU patients with VTE from eICU dataset: (a) curves, feature discriminative analysis and comparison with clinical scores (APS, Acute Physiology Score; APACHE, Acute Physiology Age Chronic Health Evaluation), (b) Precision–recall curve.
Detailed metrics of performance for the predictive models of early mortality in patients with thrombosis from the eICU dataset (JADBio).
|
| F1 Score | Accuracy | Specificity | Sensitivity | |
|---|---|---|---|---|---|
|
| 0.87 (0.84, 0.9) | 0.42 | 0.92 | 0.95 | 0.42 |
|
| 0.87 (0.83, 0.9) | 0.43 | 0.92 | 0.95 | 0.50 |
|
| 0.83 (0.78, 0.87) | 0.37 | 0.92 | 0.95 | 0.38 |
|
| 0.79 (0.75, 0.84) | 0.32 | 0.92 | 0.96 | 0.29 |
|
| 0.78 (0.73, 0.82) | 0.30 | 0.89 | 0.92 | 0.38 |
|
| 0.82 (0.77, 0.86) | 0.53 | 0.94 | 0.99 | 0.23 |
|
| 0.73 (0.68, 0.77) | 0.26 | 0.82 | 0.92 | 0.30 |
|
| 0.60 (0.56, 0.64) | 0.27 | 0.93 | 0.98 | 0.06 |
|
| 0.55 (0.5, 0.59) | 0.41 | 0.92 | 0.99 | 0.01 |
Figure 4Plot showing predictive features of early mortality in ICU patients with thrombosis, using (a) and (b) datasets. Green bars represent the average percentage drop in predictive performance when the feature is removed from the model. (Abbreviations: avg, average; bun, blood urea nitrogen; fiO2: fraction of inspired oxygen; rdw, red cell distribution width; peep, positive end expiratory pressure; wbc, white blood cells; 0–6, 6–12, 24–30, 42–48 is the time in hours after admission).
Most important selected features with predictive performance for early and late mortality in ICU patients with cancer. (Abbreviations: LOS, length of stay; SBP, systolic blood pressure; RDW, red cell distribution width; RR, respiratory rate; AST, aspartate transaminase; GCS, Glasgow Coma Scale; PaO2, partial pressure of arterial oxygen; FFP, fresh frozen plasma).
| 1st | Endotrachial | Min | Metastatic | Albumin | Mean | SBP | RDW |
|---|---|---|---|---|---|---|---|
|
| Metastatic | OASIS | LOS | Renal | 1st day | SAPSII | PaO2 |
|
| Metastatic | Open | ALP | OASIS | GCS | 1st day | Age |
|
| Metastatic | AST | Albumin | RDW | Aorto- | OASIS | |
|
| Metastatic | Age | SAPSII | Etoposide | PaO2 | FFP | Lung |
| > | Trans- |
Figure 5ROC curves for early mortality in ICU patients with cancer from MIMIC-III database using all features. (a) curve and comparison with clinical scores (OASIS, Outcome and Assessment Information Set; SAPS, Simplified Acute Physiology Score; SOFA, Sequential Organ Failure), (b) Precision-Recall curve.
Detailed metrics of performance for prediction of early and late mortality in ICU patients with cancer using all features.
| Mortality Prediction |
| Accuracy | F1 Score | Specificity | Sensitivity |
|---|---|---|---|---|---|
|
| 0.94 (0.92, 0.96) | 0.88 | 0.82 | 0.91 | 0.82 |
|
| 0.88 (0.85, 0.91) | 0.85 | 0.89 | 0.84 | 0.78 |
|
| 0.84 (0.8, 0.87) | 0.79 | 0.85 | 0.74 | 0.79 |
|
| 0.78 (0.74, 0.82) | 0.72 | 0.49 | 0.72 | 0.71 |
|
| 0.76 (0.71, 0.8) | 0.72 | 0.5 | 0.72 | 0.7 |
|
| 0.74 (0.69, 0.74) | 0.68 | 0.76 | 0.64 | 0.74 |
Figure 6ROC curves for late mortality in ICU patients with thrombosis from MIMIC-III database using all features: (a) curves and comparison with clinical scores. (b) PR curve.
Detailed metrics of performance for prediction of early and late mortality of ICU patients with thrombosis using all features.
| Early Mortality | Late Mortality | ||
|---|---|---|---|
|
|
|
| |
|
| 0.93 | 0.87 | 0.82 |
|
| 0.89 | 0.92 | 0.76 |
|
| 0.72 | 0.97 | 0.60 |
|
| 0.67 | 0.99 | 0.49 |
|
| 0.95 | 0.1 | 0.90 |
Figure 7ROC curves for late mortality in ICU patients with cancer from MIMIC-III database, stratified based on the time of death after admission expressed in months (, , , , , ). Comparison with traditional clinical scores.
External validation of a predictive model for early mortality in patients with VTE based on a signature of 35 features.
|
| F1 Score | Accuracy | Specificity | Sensitivity | |
|---|---|---|---|---|---|
|
| 0.93 [0.91, 0.96] | 0.74 | 0.77 | 0.93 | 0.74 |
|
| 0.89 [0.87, 0.91] | 0.56 | 0.86 | 0.89 | 0.7 |
Figure 8ROC curves for early mortality in ICU patients with thrombosis. Validation of the model in the two datasets.
Figure 9ROC curve for early mortality in ICU patients with VTE from eICU dataset (XGBoost).
Detailed metrics of performance for prediction of early mortality of ICU patients with thrombosis using all features.
| XGBoost | All Features |
|---|---|
|
| 0.84 (0.83–0.85) |
|
| 0.87 |
|
| 0.63 |
|
| 0.29 |
|
| 0.95 |
Analytical metrics of performance of predictive models for late mortality in patients with thrombosis from the MIMIC-III database (Abbreviations: APS, Acute Physiology Score; LODS, Logistic Organ Dysfunction Score; OASIS, Outcome and Assessment Information Set; SAPS, Simplified Acute Physiology Score; SIRS, Systemic Inflammatory Response Syndrome; SOFA, Sequential Organ Failure).
|
| F1 Score | Accuracy | Specificity | Sensitivity | |
|---|---|---|---|---|---|
|
|
|
|
|
| |
|
| 0.83 (0.79, 0.87) | 0.72 | 0.74 | 0.75 | 0.78 |
|
| 0.76 (0.71, 0.81) | 0.64 | 0.69 | 0.7 | 0.7 |
|
| 0.65 (0.59, 0.7) | 0.56 | 0.62 | 0.53 | 0.69 |
|
| 0.74 (0.69, 0.79) | 0.62 | 0.69 | 0.69 | 0.68 |
|
| 0.57 (0.51, 0.62) | 0.46 | 0.62 | 0.56 | 0.53 |
|
| 0.68 (0.63, 0.69) | 0.59 | 0.63 | 0.51 | 0.77 |
|
| 0.68 (0.63, 0.72) | 0.59 | 0.62 | 0.55 | 0.73 |
|
| 0.76 (0.72, 0.81) | 0.63 | 0.7 | 0.78 | 0.63 |
|
| 0.49 (0.48, 0.5) | 0.54 | 0.62 | 0.34 | 0.65 |
|
| 0.58 (0.52, 0.63) | 0.52 | 0.62 | 0.35 | 0.73 |