| Literature DB >> 34384206 |
Danielle Jeddah1,2, Ofer Chen2, Ari M Lipsky2,3, Andrea Forgacs2, Gershon Celniker2, Craig M Lilly4,5,6, Itai M Pessach1.
Abstract
OBJECTIVE: Predictive models for critical events in the intensive care unit (ICU) might help providers anticipate patient deterioration. At the heart of predictive model development lies the ability to accurately label significant events, thereby facilitating the use of machine learning and similar strategies. We conducted this study to establish the validity of an automated system for tagging respiratory and hemodynamic deterioration by comparing automatic tags to tagging by expert reviewers.Entities:
Keywords: Artificial Intelligence; Big Data; Clinical Deterioration; Critical Care; Respiratory Insufficiency
Year: 2021 PMID: 34384206 PMCID: PMC8369051 DOI: 10.4258/hir.2021.27.3.241
Source DB: PubMed Journal: Healthc Inform Res ISSN: 2093-3681
Characteristics of the study population
| UMass dataset | Study population | Events used for analysis[ | |
|---|---|---|---|
| Total stays | 72,650 | 500 | 500 |
|
| |||
| Sex, male | 41,472 (57.1) | 280 (56.0) | - |
|
| |||
| Age (yr) | 64 (52–76) | 65 (53–76) | - |
|
| |||
| Ethnicity | - | ||
| White | 63,144 (86.9) | 426 (85.2) | |
| Non-White | 4,188 (5.8) | 33 (6.6) | |
| Unknown | 5,241 (7.2) | 41 (8.2) | |
|
| |||
| ICU type | - | ||
| Medical | 30,730 (42.4) | 239 (47.8) | |
| Surgical | 14,644 (20.2) | 86 (17.2) | |
| Cardiac | 13,402 (18.5) | 77 (15.4) | |
| Neuro | 13,770 (19.0) | 98 (19.6) | |
|
| |||
| LOS (hr) | 59 (34–110) | 114 (45–250) | - |
|
| |||
| Mortality[ | 8,613 (11.9) | 113 (22.6) | - |
|
| |||
| Vasopressor[ | 10,429 (14.4) | 240 (48.0) | 219 |
|
| |||
| Mechanical ventilation[ | 7,386 (10.2) | 218 (43.6) | 146 |
Values are presented as number of patients (%) or median (25th–75th percentile).
LOS: length of stay in intensive care unit.
There is no statistically significant difference in the parameters compared between UMass dataset and our study population, except for the characteristics that were part of the case stratification process, which improves statistical efficiency while preserving randomization (p < 0.05). The diversity of the patient population helps mitigate concerns associated with model development based on data from a single center.
To avoid mis-tagging some events occurring during the first hours of the stay or very near to a previous event were excluded as explained in the method section.
Figure 1Process of patient selection and stratified over-sampling of critical events. To increase the prevalence of events of interest and improve statistical efficiency, stratified over-sampling of deterioration events was implemented. The dataset was divided into three subsets: patients with presumptive respiratory events, patients with presumptive hemodynamic events, and patients with neither of those events. To avoid duplications, each stay could only belong to one category; hence, stays with both a respiratory and a hemodynamic event were grouped according to the event that occurred earlier during the stay. Each set was then randomly sampled to yield 166–167 stays, stratified for basic demographic and clinical characteristics. In this manner, the 500-stay validation cohort included a higher proportion of patients with significant events than the general patient population, but with similar baseline demographics.
Figure 2Distribution of events between stays. The distribution of events between stays as tagged by the expert reviewers or by the automatic tagging system. More than one intubation event was defined as occurring if an intubation event occurred at least 12 hours after a previous extubation event during that intensive care unit (ICU) stay. Similarly, a vasopressor initiation event was included if it was the first vasopressor initiation during a particular ICU stay, or if it was initiated at least 6 hours after ending previous vasopressor administration.
Confusion matrix for the respiratory deterioration tag
| Expert reviewers | |||
|---|---|---|---|
|
| |||
| Positive | Negative | ||
| Automated system | Positive | 117 | 29 |
| Negative | 25 | 339 | |
Accuracy analysis for the respiratory deterioration tag
| Accuracy (%) | 95% CI | ||
|---|---|---|---|
| Lower | Upper | ||
| Sensitivity | 82.4 | 75.1 | 88.3 |
| Specificity | 92.1 | 88.9 | 94.7 |
| Positive predicted value | 80.1 | 72.7 | 86.3 |
| Negative predicted value | 93.1 | 90.0 | 95.5 |
| Overall agreement | 89.4 | 86.4 | 92.0 |
CI: confidence interval.
Confusion matrix for the hemodynamic deterioration tag
| Expert reviewers | |||
|---|---|---|---|
|
| |||
| Positive | Negative | ||
| Automated system | Positive | 167 | 52 |
| Negative | 16 | 292 | |
Accuracy analysis for the hemodynamic deterioration tag
| Accuracy (%) | 95% CI | ||
|---|---|---|---|
| Lower | Upper | ||
| Sensitivity | 91.3 | 86.2 | 94.9 |
| Specificity | 84.9 | 80.7 | 88.5 |
| Positive predicted value | 76.3 | 70.1 | 81.7 |
| Negative predicted value | 94.8 | 91.7 | 97.0 |
| Overall agreement | 87.1 | 83.9 | 89.8 |
CI: confidence interval.