| Literature DB >> 34983764 |
Anneroos W Boerman1,2, Michiel Schinkel1,3, Lotta Meijerink4, Eva S van den Ende1, Lara Ca Pladet1, Martijn G Scholtemeijer4, Joost Zeeuw4, Anuschka Y van der Zaag1, Tanca C Minderhoud1, Paul W G Elbers5, W Joost Wiersinga3,6, Robert de Jonge2, Mark Hh Kramer7, Prabath W B Nanayakkara8.
Abstract
OBJECTIVES: To develop predictive models for blood culture (BC) outcomes in an emergency department (ED) setting.Entities:
Keywords: accident & emergency medicine; diagnostic microbiology; internal medicine
Mesh:
Year: 2022 PMID: 34983764 PMCID: PMC8728456 DOI: 10.1136/bmjopen-2021-053332
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Baseline characteristics of the study population stratified on blood culture outcomes
| Characteristic | Negative cultures* (N=4287) | Positive cultures (N=598) | Total (N=4885) |
| Age, years | |||
| Median (IQR) | 66 (51–75) | 70 (59–79) | 66 (52–76) |
| Sex | |||
| Male | 56.3% | 62.2% | 57.0% |
| Modified Early Warning Score | |||
| Median (IQR) | 3 (2–4) | 4 (2–5) | 3 (2–4) |
| Missing (N) | 2515 | 351 | 2866 |
| Heart rate (beats per minute) | |||
| Median (IQR) | 94 (82–107) | 100 (88–111) | 95 (82–108) |
| Missing (N) | 181 | 15 | 196 |
| Systolic blood pressure (mm Hg) | |||
| Median (IQR) | 126 (112–142) | 118 (104–136) | 125 (111–141) |
| Missing (N) | 372 | 36 | 408 |
| Respiratory rate (per minute) | |||
| Median (IQR) | 19 (15–23) | 21 (16–25) | 19 (15–24) |
| Missing (N) | 1310 | 149 | 1459 |
| Temperature (degree celsius) | |||
| Median (IQR) | 37.8 (37.0–38.5) | 38.1 (37.2–38.8) | 37.8 (37.0–38.5) |
| Missing (N) | 198 | 26 | 224 |
| C reactive protein (µmol/L) | |||
| Median (IQR) | 60 (25–134) | 104 (39–216) | 64 (25–144) |
| Missing (N) | 132 | 23 | 155 |
| Whitecell counts (109 /L) | |||
| Median (IQR) | 10 (6.8–13.8) | 11.9 (8.2–16.0) | 10.2 (6.9–14.2) |
| Missing (N) | 144 | 22 | 166 |
| Thrombocyte counts (109 /L) | |||
| Median (IQR) | 234 (174–311) | 211 (149–273) | 231 (171–307) |
| Missing (N) | 593 | 105 | 698 |
| Bilirubin (µmol/L) | |||
| Median (IQR) | 9 (6–13) | 13 (8–22) | 9 (6–14) |
| Missing (N) | 1205 | 163 | 1368 |
| Creatinine (µmol/L) | |||
| Median (IQR) | 82 (65–113) | 105 (73–160) | 84 (66–119) |
| Missing (N) | 171 | 27 | 198 |
| Length of ED stay (hours) | |||
| Median (IQR) | 4.3 (3.2–5.8) | 4.7 (3.3–6.3) | 4.4 (3.2–5.9) |
| Hospital admission | |||
| Admitted | 68.0% | 84.6% | 70.0% |
| 30-day mortality | |||
| Died | 6.7% | 11.5% | 7.3% |
*Likely contaminants are classified as negative cultures in this table.
ED, emergency department.
Figure 1Receiver operating characteristic (ROC) and precision recall (PR) curves for positive blood cultures in aggregated cross-validation sets and test set. GB, gradient boosted tree model; LR, logistic regression model.
Performance metrics of both models in the aggregated cross-validation sets and the test set
| Model | Modelling phase | AUROC | AUPRC | Brier score* | F1-score† |
| Gradient boosted trees | Cross-validation mean | 0.77 (SD=0.03) | 0.340 | 0.066 | 0.16 |
| Test | 0.77 (95% CI 0.73 to 0.82) | 0.37 | 0.092 | 0.17 | |
| Logistic regression | Cross-validation mean | 0.75 (SD=0.02) | 0.31 | 0.098 | 0.14 |
| Test | 0.78 (95% CI 0.73 to 0.82) | 0.37 | 0.092 | 0.16 |
*The Brier score is a cost function that measures performance of probabilistic predictions. The score ranges from 0 to 1. The lower the score, the more accurate the prediction.
†F1-scores present a balance between precision and recall. The higher the score, the more accurate the prediction.
AUPRC, area under the precision recall curve; AUROC, area under the curve of the receiver operating characteristics.
Figure 2SHAP-plot of feature importance in the gradient boosted tree model. SHAP, shapley additive explanation. The figure shows the 20 most important features in the gradient boosted tree model, in order of importance on the Y-axis. The relative effect of the feature on the risk of a positive blood culture is shown on the X-axis (right of 0.0 = increased risk, left of 0.0 = lower risk). The colours represent the actual values of the features themselves. Blue depicts a low actual value of the feature while red depicts a high actual value. With yes/no features, no is depicted as a low value (blue) and yes as a high value (red).
Figure 3Feature importances of the logistic regression model. The 20 most important features in the logistic regression model are shown. The features for which a high value is predictive of a positive BC are shown in red and those predictive of a negative culture in blue. The X-axis presents the relative importance of these features.
Performance metrics for both models at preselected thresholds in the aggregated cross-validation sets and the test set
| Model and metric | Optimal sensitivity-specificity | Sensitivity retained at over 90% | ||
| Cross-validation (n=3608) | Test | Cross-validation (n=3608) | Test | |
| Gradient boosted tree model | ||||
| Threshold for positive prediction | 10% | 12.5% | 6% | 6% |
| True negative (n (%)) | 2126 (58.9) | 829 (64.9) | 1369 (37.9) | 473 (37.0) |
| True positive (n (%)) | 322 (8.9) | 103 (8.1) | 400 (11.1) | 142 (1.1) |
| False negative (n (%)) | 121 (3.4) | 52 (4.1) | 43 (1.2) | 13 (1.0) |
| False positive (n (%)) | 1039 (28.8) | 293 (22.9) | 1796 (49.8) | 649 (50.8%) |
| Sensitivity (%) | 72.7 | 66.5 | 90.3 | 91.6 |
| Specificity (%) | 67.2 | 73.9 | 43.3 | 42.2 |
| Positive predictive value (%) | 23.7 | 26 | 18.2 | 18 |
| Negative predictive value (%) | 94.6 | 94.1 | 97 | 97.3 |
| Logistic regression model | ||||
| Threshold for positive prediction | 12.5% | 10%* | 6% | 6% |
| True negative (n (%)) | 2172 (60.2) | 680 (53.2) | 1144 (31.7) | 429 (33.6) |
| True positive (n (%)) | 308 (8.5) | 123 (9.6) | 405 (11.2) | 142 (11.1) |
| False negative (n (%)) | 135 (3.7) | 32 (2.5) | 38 (1.1) | 13 (1.0) |
| False positive (n (%)) | 993 (27.5) | 442 (34.6) | 2021 (56.0) | 693 (45.3) |
| Sensitivity (%) | 69.5 | 79.4 | 91.4 | 91.6 |
| Specificity (%) | 68.6 | 60.6 | 36.1 | 38.2 |
| Positive predictive value (%) | 23.7 | 21.8 | 16.7 | 17 |
| Negative predictive value (%) | 94.1 | 95.5 | 96.8 | 97.1 |
*This is the only scenario where the optimal threshold would be different when based on the maximum sum of sensitivity and specificity or on a minimal difference between sensitivity and specificity. In this case, the threshold was chosen based on the maximum sum of sensitivity and specificity.