| Literature DB >> 30646310 |
Ben J Marafino1,2,3, Miran Park1,2, Jason M Davies1,2,4,5, Robert Thombley1,2, Harold S Luft6, David C Sing1,2,7, Dhruv S Kazi8,9,10, Colette DeJong1,2, W John Boscardin9, Mitzi L Dean1,2, R Adams Dudley1,2,10.
Abstract
Importance: Accurate prediction of outcomes among patients in intensive care units (ICUs) is important for clinical research and monitoring care quality. Most existing prediction models do not take full advantage of the electronic health record, using only the single worst value of laboratory tests and vital signs and largely ignoring information present in free-text notes. Whether capturing more of the available data and applying machine learning and natural language processing (NLP) can improve and automate the prediction of outcomes among patients in the ICU remains unknown.Entities:
Mesh:
Year: 2018 PMID: 30646310 PMCID: PMC6324323 DOI: 10.1001/jamanetworkopen.2018.5097
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Characteristics of the Cohort
| Characteristic | Value (N = 101 196) |
|---|---|
| Deaths, No. (%) | 10 505 (10.4) |
| Length of stay, mean (SD) [IQR], d | |
| First ICU | 3.5 (4.4) [1-3] |
| Hospital | 11.6 (17.1) [4-13] |
| Age, mean (SD) [IQR], y | 61.3 (17.1) [51-74] |
| Male, No. (%) | 51 899 (51.3) |
| Age categories, No. (%) | |
| <40 y | 12 197 (12.1) |
| 40-59 y | 30 567 (30.2) |
| 60-79 y | 42 828 (42.3) |
| >79 y | 15 604 (15.4) |
| Type of ICU at first admission, No. (%) | |
| Combined medical and surgical | 32 218 (31.8) |
| Medical | 19 110 (18.9) |
| Surgical | 21 910 (21.6) |
| Neurologic | 14 242 (14.1) |
| Coronary care | 13 716 (13.6) |
Abbreviations: ICU, intensive care unit; IQR, interquartile range.
External Validation of Models Built on Each Participating Site
| Participating Site | Type of Model by Test Site, AUC | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Baseline Model | Clinical Trajectory–Augmented Model | NLP-Augmented Model | |||||||
| UCSF | MPMC | BIDMC | UCSF | MPMC | BIDMC | UCSF | MPMC | BIDMC | |
| UCSF | NA | 0.604 | 0.838 | NA | 0.801 | 0.876 | NA | 0.878 | 0.897 |
| MPMC | 0.781 | NA | 0.714 | 0.823 | NA | 0.803 | 0.894 | NA | 0.854 |
| BIDMC | 0.867 | 0.729 | NA | 0.888 | 0.814 | NA | 0.923 | 0.857 | NA |
Abbreviations: AUC, area under the receiver operating characteristic curve; BIDMC, Beth Israel Deaconess Medical Center; MPMC, Mills-Peninsula Medical Center; NA, not applicable; NLP, natural language processing; UCSF, University of California, San Francisco.
Calculated using nested 10-fold cross-validation. All comparisons of the AUCs for each train and test pair between models (eg, trained on BIDMC, tested at UCSF for model 1 vs model 2: 0.867 vs 0.888) were statistically significant at P < .05.
Uses the highest and lowest of all laboratory values and vital signs.
Adds measures of distribution, variability, and trajectory of laboratory values and vital signs to models already using the highest and lowest values.
Adds NLP to models already using all observed values and measures of distribution, variability, and trajectory of laboratory values and vital signs.
Model Discrimination for Multicenter Models Using Different Data and Analytic Methods
| Modeling Approach | AUC (95% CI) |
|---|---|
| Using highest and lowest of all laboratory values and vital signs, logistic regression (baseline) | 0.831 (0.830-0.832) |
| Adding information from all observed laboratory values and vital signs | 0.899 (0.896-0.902) |
| Adding NLP of clinical text | 0.922 (0.916-0.924) |
Abbreviations: AUC, area under the receiver operating characteristic curve; NLP, natural language processing.
Calculated using nested 10-fold cross-validation; 95% CIs were computed using bootstrapping.
Adds measures of distribution, variability, and trajectory of laboratory values and vital signs to models already using the highest and lowest values.
Adds NLP to models already using all observed values and measures of distribution, variability, and trajectory of laboratory values and vital signs.
Examples of Influential Predictive Terms Derived From Clinical Text
| Clinical Term | Weight |
|---|---|
| Pupils (fixed) | 7.78 |
| Gag | 6.74 |
| ECMO | 6.18 |
| Coagulopathy | 4.67 |
| Shock | 4.41 |
| Intubated | 4.28 |
| PEA | 3.68 |
| Chemotherapy | 3.49 |
| Ascites | 3.27 |
| CVVH | 2.78 |
| Sepsis | 2.27 |
| Meropenem | 2.09 |
| EtOH | −1.14 |
| OHNS | −1.15 |
| Alert | −1.51 |
| EBL | −2.10 |
| Diet | −2.68 |
| Awake | −3.11 |
| PERRL | −4.28 |
| Denies (pain) | −4.56 |
| POD | −4.70 |
| Extubated | −7.64 |
Abbreviations: CVVH, continuous venovenous hemofiltration; EBL, expected blood loss; ECMO, extracorporeal membrane oxygenation; EtOH, ethanol (alcohol); OHNS, otolaryngology–head and neck surgery; PEA, pulseless electrical activity; PERRL, pupils equal, round, and reactive to light; POD, postoperative day.
Each term is associated with a β coefficient or weight in the logistic regression model, which represents its relative association with mortality. Positive weights indicate increased odds of mortality when the term is included in a clinical note. Negative weights indicate decreased odds of mortality.