| Literature DB >> 36117237 |
Jim M Smit1,2, Jesse H Krijthe3, Andrei N Tintu4, Henrik Endeman5, Jeroen Ludikhuize6,7, Michel E van Genderen5, Shermarke Hassan8, Rachida El Moussaoui9, Peter E Westerweel10, Robbert J Goekoop11, Geeke Waverijn12, Tim Verheijen13, Jan G den Hollander9, Mark G J de Boer14, Diederik A M P J Gommers5, Robin van der Vlies15, Mark Schellings16, Regina A Carels17, Cees van Nieuwkoop18, Sesmu M Arbous19, Jasper van Bommel5, Rachel Knevel13,20, Yolanda B de Rijke4, Marcel J T Reinders3.
Abstract
BACKGROUND: Timely identification of deteriorating COVID-19 patients is needed to guide changes in clinical management and admission to intensive care units (ICUs). There is significant concern that widely used Early warning scores (EWSs) underestimate illness severity in COVID-19 patients and therefore, we developed an early warning model specifically for COVID-19 patients.Entities:
Keywords: Artificial intelligence; COVID-19; Dynamic model updating; Early warning score; Intensive care; Machine learning; Medical prediction model
Year: 2022 PMID: 36117237 PMCID: PMC9482891 DOI: 10.1186/s40635-022-00465-4
Source DB: PubMed Journal: Intensive Care Med Exp ISSN: 2197-425X
Fig. 1Study design. a Schematic representation of the dynamic model updating procedure. For example, to predict deterioration for patients admitted to hospital A in October 2020, the model is fitted using patient data collected up to that date in the remaining hospitals, and a calibrator is fitted using patient data collected up to that date in hospital A. These two combined result in calibrated predictions. This process is repeated each month, for each hospital, from August 2020 until May 2021. b Flowchart of patient inclusion. ICU = intensive care unit, ED = emergency department, EoLC = end-of-life care, LOS = length-of-stay
Pathway and population characteristics
| DA ( | ICU ( | Died ( | Transfer ( | SA ( | Total ( | |
|---|---|---|---|---|---|---|
| Sex male, % | 55.3 | 64.7 | 100.0 | 56.9 | 56.2 | 57.0 |
| Female, % | 43.1 | 34.9 | 0.0 | 39.8 | 43.8 | 41.3 |
| Unknown, % | 1.6 | 0.4 | 0.0 | 3.3 | 0.0 | 1.6 |
| Age, years med (IQR) | 61.0 (51.0–70.0) | 63.0 (55.0–70.0) | 76.0 (74.5–77.5) | 60.0 (53.2–69.0) | 66.5 (55.0–75.5) | 61.0 (52.0–70.0) |
| Mean (SD) | 59.6 (14.2) | 61.5 (11.7) | 76.0 (3.0) | 59.8 (11.9) | 64.5 (11.6) | 60.0 (13.5) |
| Ward LOS, days med (IQR) | 3.7 (1.9–6.4) | 2.3 (1.1–3.9) | 7.6 (7.2–7.9) | 1.1 (0.8–2.0) | 4.7 (1.2–14.5) | 2.9 (1.5–5.5) |
| Mean (SD) | 5.3 (6.8) | 3.4 (4.4) | 7.6 (0.7) | 1.8 (2.3) | 8.6 (8.9) | 4.5 (6.2) |
| RR, breaths/min med (IQR) | 18.0 (16.0–22.0) | 22.0 (19.0–26.0) | 20.0 (20.0–20.0) | 20.0 (16.8–24.0) | 18.0 (16.0–23.5) | 20.0 (16.0–24.0) |
| Mean (SD) | 19.3 (5.0) | 22.9 (6.0) | 20.0 (0.0) | 20.8 (5.1) | 20.7 (6.2) | 20.1 (5.4) |
| SpO2, % med (IQR) | 96.0 (95.0–98.0) | 95.0 (94.0–97.0) | 95.5 (95.2–95.8) | 95.0 (94.0–97.0) | 95.0 (93.5–97.0) | 96.0 (94.2–97.0) |
| Mean (SD) | 96.0 (3.5) | 95.1 (4.8) | 95.5 (0.5) | 95.5 (2.2) | 94.5 (3.4) | 95.8 (3.6) |
| SBP, mmHg med (IQR) | 125.0 (113.0–137.0) | 125.0 (114.0–137.0) | 137.5 (136.8–138.2) | 123.0 (113.0–133.5) | 119.0 (107.2–126.8) | 124.0 (113.0–136.0) |
| Mean (SD) | 126.4 (18.8) | 127.4 (19.3) | 137.5 (1.5) | 124.1 (16.5) | 120.0 (17.3) | 126.2 (18.6) |
| T, °C med (IQR) | 37.1 (36.6–37.8) | 37.3 (36.7–38.0) | 37.0 (37.0–37.1) | 37.0 (36.6–37.7) | 36.8 (36.2–37.0) | 37.1 (36.6–37.8) |
| Mean (SD) | 37.2 (0.9) | 37.4 (1.0) | 37.0 (0.1) | 37.2 (0.9) | 36.8 (0.7) | 37.2 (0.9) |
| HR, bpm med (IQR) | 81.0 (71.0–91.0) | 83.0 (73.0–92.0) | 91.0 (84.0–98.0) | 81.0 (72.0–90.0) | 80.0 (73.8–84.8) | 81.0 (72.0–91.0) |
| Mean (SD) | 82.0 (15.2) | 83.5 (15.2) | 91.0 (14.0) | 81.6 (13.7) | 81.2 (13.6) | 82.2 (15.0) |
| O2, yes/no, % | 57.4 | 76.4 | 0.0 | 82.5 | 68.8 | 63.8 |
| O2, L/min med (IQR) | 3.0 (2.0–4.0) | 6.0 (3.0–12.0) | – | 4.0 (2.0–5.0) | 3.0 (2.0–7.5) | 3.0 (2.0–5.0) |
| Mean (SD) | 3.6 (3.0) | 7.5 (5.4) | – | 4.4 (3.0) | 5.5 (5.1) | 4.5 (3.9) |
| SpO2/O2, 1/(L/min) med (IQR) | 32.7 (23.5–48.5) | 15.8 (8.1–31.0) | – | 24.2 (18.6–47.0) | 30.7 (14.1–48.5) | 31.7 (18.6–48.0) |
| Mean (SD) | 41.4 (26.5) | 23.0 (21.3) | – | 32.4 (21.1) | 35.0 (24.1) | 36.4 (25.7) |
DA discharged alive, ICU intensive care unit, SA still admitted, IQR interquartile range, SD standard deviation, LOS length-of-stay, RR respiratory rate, SBP systolic blood pressure, T temperature, HR heart rate
Fig. 2Model discrimination and decision curve analysis. a Overall ROC curves for the RF and LR models and the NEWS. We placed two landmarks for a NEW score of 5 and 7, i.e., the recommended trigger thresholds for an urgent and emergency response. We calculated both the pAUC between a false positive rate of 0 and 0.33 (grey area) and the complete AUC. Shaded areas around each point in the ROC curves represent the 95% bootstrap percentile CIs25 (with 1000 bootstrap replications stratified for positive and negative samples). b Hospital-specific pAUCs. The error bars represent the 95% bootstrap percentile CIs25 (with 1000 bootstrap replications stratified for positive and negative samples). P-values, calculated as described in Additional file 1: appendix F.4, are shown for the difference in pAUC between the RF models and NEWS (upper bar), between the RF and LR models (middle bar) and between the LR models and NEWS (lower bar). c Overall decision curve analysis results. The standardized net benefit is plotted over a range of clinically relevant probability thresholds with corresponding odds. The ‘Intervention for all’ line indicates the NB if a (urgent or emergency) response would always be triggered
Fig. 3Overall model calibration of the static and dynamic RF models (a) and LR models (b). Top left: smoothed flexible calibration curves. Top right: zoom-in of the calibration curve in the 0–0.2 probability range (grey area). Shaded areas around the curves represent the 95% CIs. Bottom: histogram of the predictions (logscale). Shaded areas around each point in the calibration curves (before smoothing) represent the 95% bootstrap percentile CIs25 (with 1000 bootstrap replications stratified for positive and negative samples). The smooth curves including CIs were estimated by locally weighted scatterplot smoothing (see https://github.com/jimmsmit/COVID-19_EWS for the implementation). a Overall model calibration of the static and dynamic RF models. b Overall model calibration of the static and dynamic LR models
Fig. 4Distribution of SHapley Additive exPlanations (SHAP) values of the included predictors (based on mean SHAP magnitude) for the random forest model. For each predictor, each dot represents the impact of that predictor for a single prediction. The colors of the dots correspond with the value for the specific predictor. Thus, pink dots with positive SHAP values indicate that high values of the predictor are associated with a high risk of clinical deterioration. Conversely, blue dots with positive SHAP values indicate that low values of the predictor are associated with a high risk of clinical deterioration