| Literature DB >> 35750878 |
Khushbu Agarwal1, Sutanay Choudhury2, Sindhu Tipirneni3, Pritam Mukherjee4, Colby Ham1, Suzanne Tamang5, Matthew Baker6, Siyi Tang7, Veysel Kocaman8, Olivier Gevaert4,5, Robert Rallo1, Chandan K Reddy3.
Abstract
Developing prediction models for emerging infectious diseases from relatively small numbers of cases is a critical need for improving pandemic preparedness. Using COVID-19 as an exemplar, we propose a transfer learning methodology for developing predictive models from multi-modal electronic healthcare records by leveraging information from more prevalent diseases with shared clinical characteristics. Our novel hierarchical, multi-modal model ([Formula: see text]) integrates baseline risk factors from the natural language processing of clinical notes at admission, time-series measurements of biomarkers obtained from laboratory tests, and discrete diagnostic, procedure and drug codes. We demonstrate the alignment of [Formula: see text]'s predictions with well-established clinical knowledge about COVID-19 through univariate and multivariate risk factor driven sub-cohort analysis. [Formula: see text]'s superior performance over state-of-the-art methods shows that leveraging patient data across modalities and transferring prior knowledge from similar disorders is critical for accurate prediction of patient outcomes, and this approach may serve as an important tool in the early response to future pandemics.Entities:
Mesh:
Year: 2022 PMID: 35750878 PMCID: PMC9232529 DOI: 10.1038/s41598-022-13072-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1An illustration of multi-modal data sources observed over the course of a COVID-19 patient’s stay in the hospital. The colors indicate diagnosis (purple), drugs (green), procedures (gray), and numeric lab measurements (blue bars). Different data modalities are observed at varying frequency in raw patient data, with lab measurements being the most sparse across patients and across time. reduces the impact of sparsity by utilizing all modalities of data in a given time interval (e.g., 24 h), creating more informed patient state snapshots in time.
Figure 2Cohort selection process for COVID-19 and Severe Respiratory Disease (SRD) patients from the Stanford Hospital. *For the SRD cohort, the start year 2015 was chosen heuristically to ensure sufficient data.
Summary of the COVID and SRD cohorts.
| COVID | SRD | ||
|---|---|---|---|
| Number of patients | 1701 | 6892 | |
| Number of hospitalizations | 1701 | 9348 | |
| Before 2020 | 0 | 8574 | |
| 2020-Q1 | 35 | 559 | |
| 2020-Q2 | 329 | 202 | |
| 2020-Q3 | 350 | 13 | |
| 2020-Q4 | 566 | 0 | |
| 2021-Q1 | 421 | 0 | |
| Length of stay, median (Interquartile range) | 4.8 (2.8–8.8) | 5.0 (2.0–13.0) | .42 |
| Age at encounter, mean (SD) | 56.8 (18.6) | 38.14 (30.9) | |
| Age at encounter among adults, mean (SD) | 56.8 (18.7) | 60.11 (18.4) | |
| 0 (0%) | 3689 (39%) | ||
| 18–30 | 166 (10%) | 492 (5%) | |
| 30–65 | 903 (53%) | 2534 (27%) | |
| 632 (37%) | 2633 (28%) | ||
| White | 582 (34%) | 3522 (51%) | |
| Black or African American | 70 (4%) | 314 (5%) | |
| Asian | 228 (13%) | 1157 (17%) | |
| American Indian or Alaskan Native | 10 (1%) | 21 (0%) | |
| Native Hawaiian or Other Pacific Islander | 46 (3%) | 173 (3%) | |
| Other/unknown | 765 (45%) | 1705 (25%) | |
| Hispanic or Latino | 691 (41%) | 1687 (24%) | |
| Not Hispanic or Latino | 989 (58%) | 5099 (74%) | |
| Other/Unknown | 21 (1%) | 106 (2%) | |
| Male | 854 (50%) | 3108 (45%) | |
| Female | 847 (50%) | 3784 (55%) | |
| Ventilation (yes/no) | 194/1507 | 1365/7983 | |
| ICU admissions (yes/no) | 98/1603 | 2188/7160 | |
| Mortality (died/survived) | 110/1591 | 780/8568 | .01 |
| Diagnosis | 2310/2599 | 6293 | |
| Procedure | 1204/1905 | 5778 | |
| Drugs | 2147/2283 | 4592 | |
| Lab measurements | 1355/1496 | 2431 | |
Figure 3architecture. Patient context encoders are shown in (a) for static attributes and (b) for multi-modal temporal attributes. The proposed hierarchical transfer learning model is shown in (c). The transfer learning components take as input the patient’s multi-modal encoded state and produce a contextualized vector. The vectors for all time steps are combined along with static attributes to model patient’s (task-specific) evolution over time.
Performance comparison of with other methods.
| Model | 3 days | 7 days | ||
|---|---|---|---|---|
| AUROC | F1 | AUROC | F1 | |
| LR | 0.79 (0.77–0.81) | 0.67 (0.65–0.71) | 0.74 (0.71–0.77) | 0.68 (0.65–0.72) |
| BEHRT | 0.68 (0.64–0.71) | 0.43 (0.42–0.44) | 0.62 (0.58–0.65) | 0.43 (0.42–0.44) |
| GRU | 0.84 (0.81–0.86) | 0.80 (0.76–0.83) | 0.67 (0.61–0.72) | |
| 0.72 (0.70–0.74) | ||||
| LR | 0.64 (0.52–0.78) | 0.31 (0.28–0.35) | 0.68 (0.56–0.80) | 0.31 (0.28–0.34) |
| BEHRT | 0.63 (0.60–0.66) | 0.43 (0.42–0.44) | 0.66 (0.63–0.69) | 0.2 (0.19–0.22) |
| GRU | 0.62 (0.48–0.77) | 0.5 (0.5–0.5) | 0.72 (0.59–0.87) | 0.51 (0.49–0.54) |
Significant values are in bold.
The methods are evaluated for predicting patient stay and ventilation risk in short-term (next 3 days) as well as long-term (next 7 days). ’s best performance was observed using BERT as fine tuning layer for long term ventilation prediction while feed-forward layer did best for other tasks.
Ablation study results of the proposed model analyzing the impact of transfer learning and data modalities on the final performance.
| Method | Short-term patient stay | Long-term patient stay | Short-term ventilation | Long-term ventilation | ||||
|---|---|---|---|---|---|---|---|---|
| AUROC | F1 | AUROC | F1 | AUROC | F1 | AUROC | F1 | |
| 0.71 | 0.62 | 0.72 | 0.67 | 0.72 | 0.47 | 0.71 | 0.48 | |
| 0.81 | 0.69 | 0.78 | 0.70 | 0.79 | 0.50 | 0.74 | 0.49 | |
| 0.83 | 0.71 | 0.79 | 0.72 | 0.84 | 0.52 | 0.77 | 0.51 | |
| 0.82 | 0.70 | 0.76 | 0.69 | 0.81 | 0.54 | 0.74 | 0.50 | |
| 0.83 | 0.71 | 0.77 | 0.69 | 0.79 | 0.53 | 0.75 | 0.51 | |
| 0.83 | 0.72 | 0.77 | 0.70 | 0.81 | 0.53 | 0.75 | 0.51 | |
| 0.83 | 0.71 | 0.79 | 0.72 | 0.84 | 0.52 | 0.77 | 0.51 | |
Ablation study was performed with a fixed set of hyper-parameters and feed forward network fine tuning layer. See Supplementary “Methods: Implementation details” for description of hyperparameter tuning.
Figure 4(a) Risk factor prevalence in ventilated patients compared to the total population. (b) The distribution of model predicted risk scores for ventilation outcomes across the test cohort. The bars show the range while the mean score is showed as a line across the bar. All chronic conditions lead to a higher predicted risk while the mean scores were highest for patients with CAD and Diabetes [consistent with the ground truth trends observed for ventilated patients in (a)].
Figure 5The influence of chronic risk factors across sex and age groups on the predicted risk scores for ventilation outcome ().
Figure 6Size-3 comorbidities by prevalence in ventilated patients vs model predicted risk score. The top 5 (out of 30) comorbidities in the ground truth were found within the top 9 risk scores predicted by the model.