| Literature DB >> 30921400 |
Daniel Zeiberg1, Tejas Prahlad1, Brahmajee K Nallamothu2,3,4,5, Theodore J Iwashyna2,3,4,5,6, Jenna Wiens1,5, Michael W Sjoding2,3,5,7.
Abstract
BACKGROUND: Existing prediction models for acute respiratory distress syndrome (ARDS) require manual chart abstraction and have only fair performance-limiting their suitability for driving clinical interventions. We sought to develop a machine learning approach for the prediction of ARDS that (a) leverages electronic health record (EHR) data, (b) is fully automated, and (c) can be applied at clinically relevant time points throughout a patient's stay. METHODS ANDEntities:
Mesh:
Year: 2019 PMID: 30921400 PMCID: PMC6438573 DOI: 10.1371/journal.pone.0214465
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Study cohort.
Flow diagram of 2016 model derivation cohort and 2017 testing cohort.
Fig 2Timeline for prediction.
At the patient’s time of eligibility (i.e., when they develop moderate hypoxia), the patient’s risk of future ARDS was predicted using the most recent six hours of data.
Fig 3Model training, validation and testing pipeline.
The data was split temporally into a training/validation dataset (2016) and testing dataset (2017). Step 1: 5-fold cross validation was performed using the 2016 dataset to identify the optimal model hyperparameter. Step 2: The model was re-trained using the optimal hyperparameter on the entire 2016 dataset to learn model parameters. Step 3: The model was evaluated on held-out test data from 2017.
Study population characteristics.
| Clinical characteristics | Training/validation | Test cohort |
|---|---|---|
| Year | 2016 | 2017 |
| Number (N) | 1621 | 1122 |
| Diagnosed with ARDS | 51 | 27 |
| Median age [IQR] | 62 [51–71] | 62 [51–72] |
| Female (%) | 45.5 | 42.9 |
| Race (%) | ||
| Caucasian | 85.5 | 74.4 |
| Black | 9.1 | 8.5 |
| Other | 5.4 | 17.1 |
| Admission Source (%) | ||
| ED | 66.3 | 54.1 |
| Post-op | 22 | 31.6 |
| Floor | 11.7 | 14.4 |
| Length of stay, d | 5 [3–9] | 6 [3–9] |
| ARDS onset, hr | 43.2 [19.1–72.1] | 38.0 [21.5–72.6] |
| In-hospital mortality (%) | 6.0 | 4.6 |
Fig 4Model Performance.
Performance of the ARDS risk prediction model (L2-regulized model) in the 2017 test cohort. A. ROC curve and 95% interval estimates. B. Confusion matrix with 95% interval estimates.
Fig 5Model Sensitivity stratified by ARDS time of onset.
Model performance in subgroups of ARDS patients based on time from ARDS risk stratification to ARDS onset.
Top predictive factors.
Top 10 risk factors and top 10 protective factors identified in the model to risk stratify patients for ARDS.
| 1 | Low minimum PaO2/FiO2a | 124–161 | 0.15 | |
| 2 | High minimum heart rate | > 95 | 0.15 | |
| 3 | Normal hemoglobin level | 12–16 g/dL | 0.14 | |
| 4 | High albumin level | > 5 g/dL | 0.14 | |
| 5 | Low minimum O2 saturationb | < 89% | 0.13 | |
| 6 | Very high median heart rate | > 104 | 0.13 | |
| 7 | Very high mean heart rate | > 104 | 0.12 | |
| 8 | Normal platelet count | 150–400 | 0.12 | |
| 9 | Low Interquartile range systolic BP | 0–4 | 0.12 | |
| 10 | High Standard deviation O2 saturation | 1.9–2.9 | 0.11 | |
| 1 | Missing lactate result | n/a | -0.12 | |
| 2 | Missing pH result | n/a | -0.12 | |
| 3 | Location: scheduled chemotherapy | n/a | -0.12 | |
| 4 | Middle age range | 47–58 | -0.11 | |
| 5 | Normal bicarbonate | 22–34 | -0.11 | |
| 6 | Low O2 saturation standard deviation | 0.6–1.3 | -0.1 | |
| 7 | Low minimum heart rate | 65–74 | -0.1 | |
| 8 | Low maximum heart rate | 48–80 | -0.1 | |
| 9 | Very high mean O2 saturation | > 98% | -0.1 | |
| 10 | Middle Mean heart rate | 83–92 | -0.1 | |
Model features were derived using data from the six hours preceding the time when the patient met eligibility criteria. Summary measures, including minimum value, maximum, mean, median, standard deviation, and interquartile range were calculated for each continuous variable recorded multiple times during the six-hour window (e.g. heart rate). Inter-quartile range is the difference between the 75th and 25th percentile value. These summary measures were then quantized as described in the methods.
aPaO2/FiO2 was directly calculated or derived based on recorded O2 saturation of an arterial blood gas measurement was absent [11].
bLowest minimum oxygen saturation may have occurred when the patient was on at least 3 liters of supplemental oxygen at the time of ARDS prediction or in the prior six hours when the patient was on less than this amount of oxygen.