| Literature DB >> 22027016 |
Geert Meyfroidt1, Fabian Güiza, Dominiek Cottem, Wilfried De Becker, Kristien Van Loon, Jean-Marie Aerts, Daniël Berckmans, Jan Ramon, Maurice Bruynooghe, Greet Van den Berghe.
Abstract
BACKGROUND: The intensive care unit (ICU) length of stay (LOS) of patients undergoing cardiac surgery may vary considerably, and is often difficult to predict within the first hours after admission. The early clinical evolution of a cardiac surgery patient might be predictive for his LOS. The purpose of the present study was to develop a predictive model for ICU discharge after non-emergency cardiac surgery, by analyzing the first 4 hours of data in the computerized medical record of these patients with Gaussian processes (GP), a machine learning technique.Entities:
Mesh:
Year: 2011 PMID: 22027016 PMCID: PMC3228706 DOI: 10.1186/1472-6947-11-64
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Guidelines for ICU discharge after cardiac surgery at the university hospitals Leuven
| 1. Respiratory criteria | The patient is extubated and weaned from mechanical ventilation or other forms of mechanical respiratory support. |
| 2. Hemodynamic criteria | Inotropic support and vasopressor therapy have to be stopped upon discharge. |
| 3. Neurologic criteria | Patient is awake, capable of communication and has sufficient pain control with his current analgesic therapy regimen. |
| 4. Bleeding | No major bleeding, defined as a persistent need of transfusion of more than 2 units of packed cells per day. |
| 5. Other organ systems | No vital threats to other organ systems (such as the kidneys, the central nervous system,...) are present |
Baseline characteristics
| Test cohort (n = 462) | Validation cohort (n = 499) | p-value | |
|---|---|---|---|
| ICU length of stay in days (continuous): median (P25-P75) | 1.9 (1.2-3.6) | 2.0 (1.2-3.9) | 0.277 |
| ICU length of stay in days (discrete): median (P25-P75) | 3 (2-5) | 3 (2-5) | 0.360 |
| Patients discharged on the day after surgery: n (%) | 147 (31.8) | 149 (29.9) | 0.511 |
| ICU mortality: n (%) | 8 (1.7) | 11 (2.2) | 0.599 |
| Hospital mortality: n (%) | 15 (3.2) | 16 (3.2) | 0.972 |
| EuroSCORE (additive): median (P25-P75) | 5 (3-7) | 5 (3-7) | 0.202 |
| EuroSCORE (logistic)%: median (P25-P75) | 3.9 (2.1-7.4) | 4.5 (2.2-8.1) | 0.828 |
| Type of surgery | |||
| Isolated OPCAB: n (%) | 190 (41.1) | 179 (35.9) | 0.094 |
| Isolated on-pump CAB: n (%) | 1 (0.2) | 0 (0) | 0.298 |
| Valvular surgery (single, multiple and/or combined with CAB or other surgery): n (%) | 250 (54.1) | 298 (59.7) | 0.079 |
| Other cardiac surgery (congenital, aorta ascendens, myxoma ...): n (%) | 21 (4.6) | 22 (4.4) | 0.919 |
| Repeat cardiac surgery: n (%) | 27 (5.8) | 49 (9.8) | 0.023 |
| Surgery post-endocarditis: n (%) | 7 (1.5) | 7 (1.4) | 0.885 |
Classification task
| GP (4 h) | EuroSCORE (4 h) | EuroSCORE vs. GP | Nurses (6 h) | Nurses vs. GP | ICU physicians (6 h) | ICU physicians vs. GP | |
|---|---|---|---|---|---|---|---|
| aROC | 0.758 | 0.726 | p = 0.286 | X | X | X | X |
| Brier Score | 0.179 | 0.324 | p < 0.001 | X | X | X | X |
| Brier Score Scaled | 11% | 0% | p < 0.001 | X | X | X | X |
| Hosmer Lemeshow p-value | 0.382 | < 0.001 | X | X | X | X | X |
| aROC | 0.769 | 0.726 | p = 0.124 | 0.695 | p = 0.018 | X | X |
| Brier Score | 0.177 | 0.326 | p < 0.001 | 0.245 | p < 0.001 | X | X |
| Brier Score Scaled | 13% | 0% | p < 0.001 | 1.35% | p < 0.001 | X | X |
| Hosmer Lemeshow p-value | 0.405 | < 0.001 | X | < 0.001 | X | X | X |
| aROC | 0.777 | 0.726 | p = 0.334 | X | X | 0.758 | p = 0.719 |
| Brier Score | 0.166 | 0.328 | p < 0.001 | X | X | 0.216 | p = 0.055 |
| Brier Score Scaled | 14.2% | 0% | p < 0.001 | X | X | 12.5% | p < 0.001 |
| Hosmer Lemeshow p-value | 0.696 | < 0.001 | X | X | X | < 0.001 | X |
The interpretation of the different validation measures can be found in the methodology section.
Figure 1ROC of the classification task. Panel A. Validation cohort of 499 patients. ROC of the GP model (aROC = 0.758) (dark interrupted line), with a circle indicating the cutoff for best discrimination and calibration. ROC of the EuroSCORE (aROC = 0.726) (grey dotted line), with a star indicating the cutoff for best discrimination and calibration. There was no statistically significant difference between GP and EuroSCORE (p = 0.286). Panel B. Validation subcohort of 396 patients. ROC of the GP model (aROC = 0.769) (dark interrupted line), with a circle indicating the cutoff for best discrimination and calibration. ROC of the EuroSCORE (aROC = 0.726) (grey dotted line), with a star indicating the cutoff for best discrimination and calibration. ROC of the predictions by nurses (aROC = 0.695) (black uninterrupted line), with a triangle indicating the cutoff for best discrimination and calibration. The aROC of the predictions by nurses was significantly lower than the GP model (p = 0.018). Panel C. Validation subcohort of 159 patients. ROC of the GP model (aROC = 0.777) (dark interrupted line), with a circle indicating the cutoff for best discrimination and calibration. ROC of the EuroSCORE (aROC = 0.726) (grey dotted line), with a star indicating the cutoff for best discrimination and calibration. ROC of the predictions by ICU physicians (aROC = 0.758) (black uninterrupted line), with a triangle indicating the cutoff for best discrimination and calibration. The aROC of the predictions by ICU physicians was not significantly different from the GP model (p = 0.719).
Figure 2Reliability diagrams of the classification task. The reliability curve plots the observed fraction of positives against the predicted fraction of positives. The diagonal indicates a perfect reliability. The dotted horizontal line is the no resolution line, indicating the mean prevalence of the outcome in the population. The Brier score can be expressed as the sum of three terms related to the components of a reliability diagram.
The first term, reliability, is the mean squared difference of the reliability curve to the diagonal. The second term, resolution, is the mean squared difference of the reliability curve to the no resolution line. The third term is a measure of uncertainty. N is the number of instances, s is the fraction of positives in the dataset, and for the kth bin, nis the number of examples, pis the predicted probability, and ois the fraction of positives. Upper right panel. Validation cohort, 499 patients. Brier score and reliability diagram of the GP model. Upper left panel. Validation cohort, 499 patients. Brier score and reliability diagram of the EuroSCORE. Brier score was above the threshold of 0.25, and significantly higher (worse) than the GP models (p < 0.001). Lower right panel. Validation subcohort, 396 patients. Brier score and reliability diagram of the predictions by ICU nurses. Brier score was significantly higher (worse) than the GP models (p < 0.001). Lower left panel. Validation subcohort, 159 patients. Brier score and reliability diagram of the predictions by ICU doctors. Brier score was not significantly higher (worse) than the GP models (p = 0.055).
Regression task
| Actual | GP (4 h) | EuroSCORE (4 h) | EuroSCORE vs. GP | Nurses (6 h) | Nurses vs. GP | ICU physicians (6 h) | ICU physicians vs. GP | |
|---|---|---|---|---|---|---|---|---|
| LOS: Median (P25-P75)(days) | 3 (2-5) | 3 (2-3)* | 4 (3-5)* | p < 0.001 | X | X | X | X |
| LPF: Median (P25-P75) | X | 0 (0-0.4) | -0.3 (-0.5-0) | p = 0.003 | X | X | X | X |
| Patients with LPF = 0: n(%) | X | 200 (40) | 94 (19) | P < 0.001 | X | X | X | X |
| RMSRE | X | 0.408 | 0.643 | X | X | X | X | |
| LOS: Median (P25-P75)(days) | 3 (2-4) | 3 (2-3)* | 4 (3-5)* | p < 0.001 | 3 (2-3)* | p = 0.012 | X | X |
| LPF: Median (P25-P75) | X | 0 (0-0.4) | -0.3 (-0.5-0) | p = 0.002 | 0 (0-0.3) | p = 0.567 | X | X |
| Patients with LPF = 0: n(%) | X | 181 (46) | 86 (22) | P < 0.001 | 152 (38) | P = 0.044 | X | X |
| RMSRE | X | 0.389 | 0.635 | 0.522 | X | X | ||
| LOS: Median (P25-P75)(days) | 3 (2-5) | 3 (2-3)* | 4 (4-5)* | p < 0.001 | X | X | 3 (2-3)* | p = 0.578 |
| LPF: Median (P25-P75) | X | 0.2 (0-0.5) | -0.25 (-0.5-0.2) | p < 0.001 | X | X | 0.2 (0-0.4) | p = 0.755 |
| Patients with LPF = 0: n(%) | X | 59 (37) | 27 (17) | p < 0.001 | X | X | 49 (31) | P = 0.234 |
| RMSRE | X | 0.439 | 0.631 | X | X | 0.612 | ||
*p-value < 0.001 (as compared to actual LOS)
LOS= Length of stay
LPF= Loss Penalty Function
RMSRE= Root Mean Squared Relative Error
The interpretation of the different validation measures can be found in the methodology section.
Figure 3Distribution of the predictions of the regression task (validation cohort, 499 patients. GP versus EuroSCORE). The black bar indicates the actual number of patients discharged on each discrete day. Day 1 is the day of surgery (no patients were discharged on day 1), day 2 is the day after surgery, and so on. The other bars indicate the number of patients predicted to be discharged on that day. The subdivision in these bars indicates the number of true positives (predicted and actually discharged on that day).