| Literature DB >> 34103544 |
José Castela Forte1,2,3, Galiya Yeshmagambetova4, Maureen L van der Grinten4, Bart Hiemstra5, Thomas Kaufmann5, Ruben J Eck6, Frederik Keus7, Anne H Epema5, Marco A Wiering4, Iwan C C van der Horst8.
Abstract
Critically ill patients constitute a highly heterogeneous population, with seemingly distinct patients having similar outcomes, and patients with the same admission diagnosis having opposite clinical trajectories. We aimed to develop a machine learning methodology that identifies and provides better characterization of patient clusters at high risk of mortality and kidney injury. We analysed prospectively collected data including co-morbidities, clinical examination, and laboratory parameters from a minimally-selected population of 743 patients admitted to the ICU of a Dutch hospital between 2015 and 2017. We compared four clustering methodologies and trained a classifier to predict and validate cluster membership. The contribution of different variables to the predicted cluster membership was assessed using SHapley Additive exPlanations values. We found that deep embedded clustering yielded better results compared to the traditional clustering algorithms. The best cluster configuration was achieved for 6 clusters. All clusters were clinically recognizable, and differed in in-ICU, 30-day, and 90-day mortality, as well as incidence of acute kidney injury. We identified two high mortality risk clusters with at least 60%, 40%, and 30% increased. ICU, 30-day and 90-day mortality, and a low risk cluster with 25-56% lower mortality risk. This machine learning methodology combining deep embedded clustering and variable importance analysis, which we made publicly available, is a possible solution to challenges previously encountered by clustering analyses in heterogeneous patient populations and may help improve the characterization of risk groups in critical care.Entities:
Year: 2021 PMID: 34103544 PMCID: PMC8187398 DOI: 10.1038/s41598-021-91297-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Complete list of input features. List including all patient characteristics, co-morbidities, time-series of laboratory parameters, and clinical examination parameters.
| Patient characteristics | Age, sex, APACHE IV Score, SAPS II Score, BMI, surgical admission, previous admission to ICU |
| Clinical examination | Cardiac index, mottling, atrial fibrillation, heart rate at admission, urine output in previous 6 h, prolonged capillary refill time, central venous pressure (CVP), diastolic blood pressure (DBP), systolic blood pressure (SBP), mean arterial pressure (MAP) Worsened respiratory condition after 24 h assessed by physician, tidal volume, Respiratory rate of ventilator, positive end-expiratory pressure (PEEP) of ventilator, mechanical ventilation after 24 h (binary), mechanical ventilation at admission (binary), respiratory rate, lowest FiO2 (%) during ICU stay EMV score |
| Co-morbidities and medical history | History of cardiovascular disease (CVD), history of chronic kidney disease (CKD), history of cirrhosis, history of chronic obstructive pulmonary disease (COPD), history of diabetes, history of hematological malignancy, history of metastatic disease, history of myocardial infarction, history of respiratory insufficiency, history of acquired immunodeficiency syndrome (AIDS), history of immune insufficiency, previous dialysis |
| Laboratory variables (for which the mean and variance were taken) | ALAT, ASAT, albumin, amylase, ALP, bilirubin (total), gamma-GT, CK, CRP, calcium, chloride, magnesium, MCV, sodium, phosphate, potassium, fibrinogen, hemoglobin, hematocrit, creatinine, LDH, leukocytes, thrombocytes, troponin T, total protein, urea Ionized calcium, glucose, hemoglobin, potassium, lactate, sodium, arterial HCO3, arterial pCO2, arterial pH, arterial pO2, arterial saturation, methylated hemoglobin, HbCO |
ALAT alanine transaminase, ASAT aspartate transaminase, CK creatine kinase, CRP C-reactive protein, LDH lactate dehydrogenase, POC point of care, HCO bicarbonate, pCO arterial CO2 pressure, pO arterial O2 pressure, HbMet methemoglobin, HbCO carboxyhemoglobin.
Figure 1Schematic overview of the different steps in the analysis. Patient selection, integration of different data sources, data processing with feature extraction (FE) or dynamic time warping (DTW), comparison of the four clustering algorithms, selection of the best algorithm based on patient distribution and internal validity measures, training of the classifier for attributing true labels to the clusters and calculating feature importance with SHAP, and cluster characterization based on input data from diagnoses, feature importance, and differences in outcomes including mortality, AKI, and other clinical events.
Clinical characteristics for the 743 patients, including patient characteristics, clinical examination data, co-morbidities and medical history, and outcomes.
| Age (years) | 62 [61, 63] |
| Gender (% female) | 63.3 |
| APACHE IV Score | 76.9 [74.8, 79.2] |
| SAPS II Score | 46.9 [45.7, 48.0] |
| BMI | 26.7 [26.4, 27.1] |
| Surgical patient (%) | 35.0 |
| Previous admission to ICU (%) | 11.6 |
| Clinical examination | |
| Cardiac index > 2.2 (%) | 42.7 |
| Mottling (% with severe, > 4) | 3.1 |
| Mechanically ventilated at admission (%) | 61.8 |
| Worsened respiratory condition after 24 h (%) | 12.4 |
| Urine output in previous 6 h (ml/kg/h) | 0.9 [0.84, 0.95] |
| CRT prolonged (%) | 29.9 |
| EMV score | 11.27 [10.9, 11.63] |
| History of CVD (%) | 5.0 |
| History of CKD (%) | 6.9 |
| History of cirrhosis (%) | 3.4 |
| History of COPD (%) | 12.4 |
| Previous dialysis (%) | 1.4 |
| History of diabetes (%) | 20.3 |
| History of hematological malignancy (%) | 4.0 |
| History of metastatic disease (%) | 3.6 |
| History of myocardial infarction (%) | 8.2 |
| History of respiratory failure (%) | 4.9 |
| Cardiovascular | 32.8 |
| Gastrointestinal | 14.2 |
| Genito-urinary | 1.2 |
| Haematological | 1.4 |
| Metabolic | 2.3 |
| Musculoskeletal/skin | 0.9 |
| Neurological | 15.0 |
| Respiratory | 19.5 |
| Transplant | 5.1 |
| Trauma | 7.7 |
| In-ICU mortality (%) | 19.4 |
| 30-day mortality (%) | 22.3 |
| 90-day mortality (%) | 27.6 |
| No AKI (%) | 79.7 |
| AKI stage 1 (%) | 59.8 |
| AKI stage 2 or 3 (%) | 44.3 |
| Required vasoactive medication (%) | 48.6 |
| Any type of shock (%) | 51.6 |
| Required RRT (%) | 10.1 |
| ICU length of stay (days) | 6.0 [5.4, 6.6] |
aDenotes variables not included as input for clustering analysis.
Figure 3Heatmap of outcomes and clinical end-points per cluster. Bars on the right show the colour scale representing the proportion of patients within the cluster with the outcome (upper panel) or the discharge diagnosis (lower panel).
Figure 2Heatmap of patient characteristics, clinical examination and co-morbidity data per cluster. Bars on the right show the colour scale representing the proportion of patients with each characteristic regarding demographics, clinical examination, and co-morbidities. For continuous variables, such as SBP or urine output, it represents a scaled value from highest cluster mean (1.0) to lowest cluster mean (0.0).
Mortality rates, and hazard ratios for mortality and acute kidney injury per cluster.
| Outcome/cluster | % | Hazard ratio with 95%CI | p-value |
|---|---|---|---|
| In-ICU Mortality | % | ||
| Cluster 1 | 19.1 | 0.99 [0.77–1.26] | 0.916 |
| Cluster 2 | 30.0 | 1.55 [1.26–1.91] | < 0.001 |
| Cluster 3 | 34.8 | 1.80 [1.33–2.42] | < 0.001 |
| Cluster 4 | 11.1 | 0.57 [0.48–0.68] | < 0.001 |
| Cluster 5 | 17.9 | 0.92 [0.81,1.06] | 0.254 |
| Cluster 6 | 16.8 | 0.87 [0.70,1.07] | 0.191 |
| Cluster 1 | 19.1 | 0.85 [0.67–1.10] | 0.218 |
| Cluster 2 | 31.0 | 1.39 [1.12–1.71] | 0.002 |
| Cluster 3 | 37.0 | 1.66 [1.23–2.23] | < 0.001 |
| Cluster 4 | 16.7 | 0.75 [0.63–0.89] | 0.001 |
| Cluster 5 | 22.4 | 1.00 [0.86–1.15] | 0.972 |
| Cluster 6 | 17.9 | 0.80 [0.65–0.99] | 0.042 |
| Cluster 1 | 26.5 | 0.96 [0.75–1.23] | 0.763 |
| Cluster 2 | 36.0 | 1.30 [1.06–1.61] | 0.012 |
| Cluster 3 | 43.5 | 1.58 [1.17–2.12] | 0.002 |
| Cluster 4 | 18.1 | 0.66 [0.55–0.78] | < 0.001 |
| Cluster 5 | 27.6 | 1.00 [0.88–1.15] | 0.996 |
| Cluster 6 | 26.3 | 0.95 [0.77–1.18] | 0.673 |
| Cluster 1 | 50 | 1,13 [0.88–1.45] | 0.343 |
| Cluster 2 | 56 | 1.26 [1.03–1.59] | 0.027 |
| Cluster 3 | 73.9 | 1.67 [1.24–2.25] | < 0.001 |
| Cluster 4 | 19.4 | 0.44 [0.37–0.52] | < 0.001 |
| Cluster 5 | 45.2 | 1.02 [0.89–1.17] | 0.780 |
| Cluster 6 | 48.4 | 1.09 [0.88–1.35] | 0.421 |
Hazard ratios were compared using the log-rank test, with the full cohort used as reference.
AKI acute kidney injury.
Figure 4Kaplan–Meier curves stratified per cluster for mortality during and after ICU stay. Survival curves for all six clusters, with the number of patients at risk at 30 and 90 days per cluster.