| Literature DB >> 34115013 |
Christine Giang1, Jacob Calvert, Keyvan Rahmani, Gina Barnes, Anna Siefkas, Abigail Green-Saxena, Jana Hoffman, Qingqing Mao, Ritankar Das.
Abstract
ABSTRACT: Ventilator-associated pneumonia (VAP) is the most common and fatal nosocomial infection in intensive care units (ICUs). Existing methods for identifying VAP display low accuracy, and their use may delay antimicrobial therapy. VAP diagnostics derived from machine learning (ML) methods that utilize electronic health record (EHR) data have not yet been explored. The objective of this study is to compare the performance of a variety of ML models trained to predict whether VAP will be diagnosed during the patient stay.A retrospective study examined data from 6126 adult ICU encounters lasting at least 48 hours following the initiation of mechanical ventilation. The gold standard was the presence of a diagnostic code for VAP. Five different ML models were trained to predict VAP 48 hours after initiation of mechanical ventilation. Model performance was evaluated with regard to the area under the receiver operating characteristic (AUROC) curve on a 20% hold-out test set. Feature importance was measured in terms of Shapley values.The highest performing model achieved an AUROC value of 0.854. The most important features for the best-performing model were the length of time on mechanical ventilation, the presence of antibiotics, sputum test frequency, and the most recent Glasgow Coma Scale assessment.Supervised ML using patient EHR data is promising for VAP diagnosis and warrants further validation. This tool has the potential to aid the timely diagnosis of VAP.Entities:
Mesh:
Year: 2021 PMID: 34115013 PMCID: PMC8202554 DOI: 10.1097/MD.0000000000026246
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Figure 1Patient exclusion numbers for the intubation task. BUN = blood urea nitrogen, CAP = community acquired pneumonia, MV = mechanical ventilation.
Data included as input to the algorithm.
| Required vitals and labs | Boolean indicators | Optional measures |
| – Systolic BP – Diastolic BP – HR – Respiratory rate – Temperature – Hematocrit – SpO2 – BUN – GCS – Platelet count – WBC – Creatinine | – Antibiotics – Sputum labs – Blood culture labs – Any of cirrhosis, congestive heart failure, fever, bacteremia, intracranial hemorrhage, renal failure, respiratory distress, respiratory failure, sepsis, subarachnoid hemorrhage, shortness of breath – ARDS | – Age – Total urine output events – Number of blood culture tests – Number of sputum tests – Number of MV hours |
ARDS = acute respiratory distress syndrome, BP = blood pressure, BUN = blood urea nitrogen, GCS = Glasgow Coma Scale, HR = heart rate, MV = mechanical ventilation, SpO2 = oxygen saturation, WBC = white blood cell.
Demographic and comorbidity information for the experimental population.
| Characteristic | VAP positive n = 524 | VAP negative n = 5602 |
| Age | ||
| <30 | 27 (5.15%) | 215 (3.84%) |
| 30–49 | 81 (15.46%) | 735 (13.12%) |
| 50–59 | 105 (20.04%) | 972 (17.35%) |
| 60–69 | 120 (22.90%) | 1383 (24.69%) |
| 70–79 | 107 (20.42%) | 1185 (21.15%) |
| 80+ | 71 (13.55%) | 984 (17.57%) |
| ARDS | ||
| Yes | 35 (6.68%) | 284 (5.07%) |
| No | 489 (93.32%) | 5318 (94.93%) |
| Sputum test performed | ||
| Yes | 497 (94.85%) | 2644 (47.20%) |
| No | 27 (5.15%) | 2958 (52.80%) |
| Gender | ||
| Male | 313 (59.73%) | 3172 (56.62%) |
| Female | 211 (40.27%) | 2430 (43.38%) |
| Ethnicity | ||
| White | 354 (5.15%) | 4072 (72.69%) |
| Black/African-American | 44 (8.40%) | 484 (8.64%) |
| Asian | 16 (3.05%) | 134 (2.39%) |
| Hispanic/Latino | 11 (2.10%) | 209 (3.73%) |
| Unknown/other | 99 (18.89%) | 703 (12.55%) |
ARDS = acute respiratory distress syndrome, VAP = ventilator-associated pneumonia.
AUROC results on the hold-out test set of models trained to predict VAP 48 hours after intubation, using summary statistics from the previous k hours of patient data.
| 6 | 12 | 24 | 48 | |
| Logistic regression | 0.744 | 0.751 | 0.739 | 0.776 |
| Multilayer perceptron | 0.731 | 0.740 | 0.722 | 0.741 |
| Random forest | 0.771 | 0.767 | 0.780 | 0.777 |
| Support vector machines | 0.765 | 0.769 | 0.764 | 0.775 |
| XGBoost | 0.799 | 0.794 | 0.775 | 0.791 |
| CURB-65 | 0.503 | 0.498 | 0.498 | 0.498 |
| PIRO | 0.565 | 0.555 | 0.566 | 0.557 |
AUROC = the area under the receiver operating characteristic curve, ICU = intensive care unit, PIRO = predisposition, insult, response, organ dysfunction, VAP = ventilator-associated pneumonia.
Figure 2ROC curve comparison for intubation task models, with summary statistics calculated from the (A) 6 hours and (B) 48 hours of data preceding the time of prediction. K = number of hours used to make prediction, MIMC = multiparameter intelligent monitoring in intensive care, ROC = receiver operating characteristic.
AUROC results on the hold-out test set of models trained to predict VAP k hours after ICU admission.
| 6 | 12 | 24 | 48 | |
| Logistic regression | 0.772 | 0.788 | 0.812 | 0.822 |
| Multilayer perceptron | 0.587 | 0.712 | 0.753 | 0.820 |
| Random forest | 0.706 | 0.762 | 0.822 | 0.838 |
| Support vector machines | 0.766 | 0.799 | 0.812 | 0.829 |
| XGBoost | 0.733 | 0.796 | 0.820 | 0.854 |
| CURB-65 | 0.481 | 0.496 | 0.506 | 0.517 |
| PIRO | 0.584 | 0.595 | 0.599 | 0.622 |
AUROC = the area under the receiver operating characteristic curve, PIRO = predisposition, insult, response, organ dysfunction, VAP = ventilator-associated pneumonia.
Figure 3ROC curve comparison for admission task models with summary statistics calculated from the (A) 6 hours and (B) 48 hours of data preceding the time of prediction.