| Literature DB >> 34796719 |
Sheng-Feng Sung1,2,3, Chih-Hao Chen4, Ru-Chiou Pan1, Ya-Han Hu5, Jiann-Shing Jeng4.
Abstract
Background Conventional prognostic scores usually require predefined clinical variables to predict outcome. The advancement of natural language processing has made it feasible to derive meaning from unstructured data. We aimed to test whether using unstructured text in electronic health records can improve the prediction of functional outcome after acute ischemic stroke. Methods and Results Patients hospitalized for acute ischemic stroke were identified from 2 hospital stroke registries (3847 and 2668 patients, respectively). Prediction models developed using the first cohort were externally validated using the second cohort, and vice versa. Free text in the history of present illness and computed tomography reports was used to build machine learning models using natural language processing to predict poor functional outcome at 90 days poststroke. Four conventional prognostic models were used as baseline models. The area under the receiver operating characteristic curves of the model using history of present illness in the internal and external validation sets were 0.820 and 0.792, respectively, which were comparable to the National Institutes of Health Stroke Scale score (0.811 and 0.807). The model using computed tomography reports achieved area under the receiver operating characteristic curves of 0.758 and 0.658. Adding information from clinical text significantly improved the predictive performance of each baseline model in terms of area under the receiver operating characteristic curves, net reclassification improvement, and integrated discrimination improvement indices (all P<0.001). Swapping the study cohorts led to similar results. Conclusions By using natural language processing, unstructured text in electronic health records can provide an alternative tool for stroke prognostication, and even enhance the performance of existing prognostic scores.Entities:
Keywords: acute ischemic stroke; machine learning; natural language processing; outcome prediction; risk score
Mesh:
Year: 2021 PMID: 34796719 PMCID: PMC9075227 DOI: 10.1161/JAHA.121.023486
Source DB: PubMed Journal: J Am Heart Assoc ISSN: 2047-9980 Impact factor: 6.106
Figure 1Process of model development and validation.
AIS indicates acute ischemic stroke; and ML, machine learning.
Characteristics of the Study Cohorts
| CYCH (n=3847) | NTUH (n=2668) |
| |
|---|---|---|---|
| Age, mean (SD) | 69.5 (12.3) | 69.8 (13.9) | 0.528 |
| Female | 1583 (41.1) | 1118 (41.9) | 0.543 |
| Hypertension | 3098 (80.5) | 2090 (78.3) | 0.031 |
| Diabetes | 1602 (41.6) | 1024 (38.4) | 0.008 |
| Hyperlipidemia | 2195 (57.1) | 1369 (51.3) | <0.001 |
| Atrial fibrillation | 684 (17.8) | 790 (29.6) | <0.001 |
| Congestive heart failure | 196 (5.1) | 223 (8.4) | <0.001 |
| Cancer | 249 (6.5) | 424 (15.9) | <0.001 |
| Preadmission dependence (mRS >2) | 419 (10.9) | 407 (15.3) | <0.001 |
| Onset‐to‐admission delay >3 h | 2763 (71.8) | 1913 (71.7) | 0.915 |
| NIHSS, median (IQR) | 5 (3–10) | 5 (2–13) | 0.267 |
| Glucose, mean (SD), mg/dL | 163.2 (82.6) | 146.8 (67.7) | <0.001 |
| PLAN score, median (IQR) | 8 (6–12) | 9 (7–12) | 0.001 |
| ASTRAL score, median (IQR) | 21 (18–27) | 22 (18–30) | 0.178 |
| Word count in HPI, median (IQR) | 132 (109–161) | 268 (209–342) | <0.001 |
| BERT tokens in HPI, median (IQR) | 192 (156–240) | 420 (329–535) | <0.001 |
| Word count in CT reports, median (IQR) | 127 (93–189) | 42 (34–52) | <0.001 |
| BERT tokens in CT reports, median (IQR) | 225 (164–351) | 86 (68–106) | <0.001 |
| Poor outcome (mRS >2) | 1674 (43.5) | 1118 (41.9) | 0.196 |
Data are expressed in number (percentage) unless specified otherwise. ASTRAL indicates Acute Stroke Registry and Analysis of Lausanne; BERT, bidirectional encoder representations from transformers; CT, computed tomography; CYCH, Chia‐Yi Christian Hospital; HPI, history of present illness; IQR, interquartile range; mRS, modified Rankin Scale; NIHSS, National Institutes of Health Stroke Scale; NTUH, National Taiwan University Hospital; and PLAN, preadmission comorbidities, level of consciousness, age, and neurological deficit.
Figure 2Receiver operating characteristic curves for predicting a poor functional outcome in the internal (A) and external (B) validation sets.
ASTRAL indicates Acute Stroke Registry and Analysis of Lausanne; AUC, area under the receiver operating characteristic curve; CT, computed tomography; HPI, history of present illness; NIHSS, National Institutes of Health Stroke Scale; and PLAN, preadmission comorbidities, level of consciousness, age, and neurological deficit.
Comparison of the Predictive Ability of Baseline Models With or Without Adding Information From Clinical Text
| Baseline AUC (95% CI) | Text‐enhanced AUC (95% CI) |
| NRI (95% CI) |
| IDI (95% CI) |
| |
|---|---|---|---|---|---|---|---|
| Internal validation | |||||||
| NIHSS | 0.811 (0.783–0.839) | 0.869 (0.846–0.891) | <0.001 | 0.766 (0.648–0.884) | <0.001 | 0.109 (0.089–0.129) | <0.001 |
| Age and NIHSS | 0.841 (0.815–0.866) | 0.872 (0.850–0.895) | <0.001 | 0.514 (0.391–0.637) | <0.001 | 0.065 (0.049–0.080) | <0.001 |
| PLAN score | 0.837 (0.811–0.863) | 0.870 (0.847–0.893) | <0.001 | 0.593 (0.471–0.715) | <0.001 | 0.061 (0.046–0.077) | <0.001 |
| ASTRAL score | 0.840 (0.814–0.866) | 0.871 (0.849–0.894) | <0.001 | 0.527 (0.405–0.650) | <0.001 | 0.070 (0.054–0.086) | <0.001 |
| External validation | |||||||
| NIHSS | 0.807 (0.790–0.823) | 0.843 (0.828–0.858) | <0.001 | 0.719 (0.648–0.791) | <0.001 | 0.089 (0.078–0.100) | <0.001 |
| Age and NIHSS | 0.838 (0.823–0.853) | 0.854 (0.840–0.868) | <0.001 | 0.556 (0.482–0.630) | <0.001 | 0.043 (0.035–0.052) | <0.001 |
| PLAN score | 0.834 (0.818–0.849) | 0.852 (0.838–0.867) | <0.001 | 0.561 (0.488–0.635) | <0.001 | 0.045 (0.037–0.054) | <0.001 |
| ASTRAL score | 0.839 (0.824–0.854) | 0.854 (0.840–0.868) | <0.001 | 0.572 (0.499–0.646) | <0.001 | 0.052 (0.043–0.061) | <0.001 |
ASTRAL indicates Acute Stroke Registry and Analysis of Lausanne; AUC, area under the receiver operating characteristic curve; IDI, integrated discrimination improvement; NIHSS, National Institutes of Health Stroke Scale; NRI, net reclassification improvement; and PLAN, preadmission comorbidities, level of consciousness, age, and neurological deficit.
Figure 3Calibration plots of the baseline and text‐enhanced models.
ASTRAL indicates Acute Stroke Registry and Analysis of Lausanne; NIHSS, National Institutes of Health Stroke Scale; and PLAN, preadmission comorbidities, level of consciousness, age, and neurological deficit.