| Literature DB >> 34326805 |
Christina M Lineback1, Ravi Garg2, Elissa Oh2, Andrew M Naidech1,2, Jane L Holl2, Shyam Prabhakaran3.
Abstract
Background and Purpose: This study aims to determine whether machine learning (ML) and natural language processing (NLP) from electronic health records (EHR) improve the prediction of 30-day readmission after stroke.Entities:
Keywords: bioinformatics; machine learning; natural language processing; readmission; stroke
Year: 2021 PMID: 34326805 PMCID: PMC8315788 DOI: 10.3389/fneur.2021.649521
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
List of discrete features extracted from enterprise data warehouse.
| Demographics | Age, gender, race, ethnicity, marital status, and insurance status |
| Risk factors | Hypertension, diabetes mellitus, atrial fibrillation, prior stroke, coronary artery disease, congestive heart failure, valvular heart disease, coronary artery bypass graft/stent, end-stage renal disease, hypothyroidism, dementia, cancer, chronic lung disease, and smoking status |
| Index stroke encounter characteristics | Primary stroke type, initial NIHSS score, initial GCS score, in-hospital pneumonia, medications (e.g., anticoagulants) at discharge, percutaneous endoscopic gastrostomy, mechanical ventilation, intensive care unit stay, and discharge destination |
| Other baseline factors | Miles from residence to hospital, frequency of hospital admissions in preceding year, and frequency of stroke admissions in preceding year |
Figure 1Description of feature ensemble method.
Figure 2Description of classifier ensemble method.
Baseline characteristics of the training cohort (n = 2,305) and testing cohort (n = 550).
| Mean age in years (SD) | 64.4 (16.4) | 64.8 (15.1) | 0.90 |
| Male sex [ | 1,156 (50.2) | 297 (54) | 0.11 |
| White | 1,156 (50.2) | 284 (51.6) | 0.09 |
| Black | 613 (26.6) | 138 (25) | |
| Asian | 78 (3.4) | 13 (2.4) | |
| American Indian or Alaskan Native | 4 (0.2) | 4 (0.7) | |
| Native Hawaiian/Pacific Islander | 4 (0.2) | 3 (0.5) | |
| Declined, missing, or unknown | 233 (10.1) | 63 (11.4) | |
| Other | 217 (9.41) | 45 (8.1) | |
| Hispanic [ | 164 (7.1) | 63 (11.4) | <0.01 |
| Married | 1,001 (43.4) | 265 (48.1) | 0.02 |
| Widowed | 253 (11.0) | 45 (8.1) | |
| Single | 759 (32.9) | 157 (28.5) | |
| Divorced | 142 (6.2) | 33 (6) | |
| Separated | 8 (0.3) | 1 (0.2) | |
| Unknown, other, or missing | 142 (6.2) | 49 (8.9) | |
| Private | 833 (36.1) | 173 (31.5) | <0.01 |
| Medicare | 1,060 (46.0) | 278 (50.5) | |
| Medicaid | 182 (7.9) | 63 (11.5) | |
| Other or self-pay | 230 (10.0) | 36 (6.5) | |
| Ischemic stroke | 1,825 (79.1) | 416 (75.6) | <0.01 |
| Intracerebral hemorrhage | 257 (11.1) | 94 (17) | |
| Subarachnoid hemorrhage | 223 (9.7) | 40 (7.3) | |
| Hypertension [ | 1,853 (78.8) | 466 (84.7) | 0.01 |
| Diabetes mellitus [ | 629 (27.3) | 179 (32.6) | 0.13 |
| Atrial fibrillation [ | 430 (18.7) | 111 (20.2) | 0.42 |
| Coronary artery disease [ | 189 (8.2) | 30 (5.5) | 0.03 |
| Congestive heart failure [ | 232 (10.1) | 67 (12.2) | 0.15 |
| Valvular heart disease [ | 42 (1.8) | 36 (6.5) | <0.01 |
| Prior stroke [ | 218 (9.5) | 57 (10.3) | 0.57 |
| Chronic lung disease [ | 236 (10.2) | 48 (8.7) | 0.29 |
| Dementia [ | 149 (6.5) | 37 (6.7) | 0.87 |
| Cancer [ | 180 (7.8) | 45 (8.2) | 0.75 |
| End-stage renal disease [ | 39 (1.7) | 13 (2.3) | 0.34 |
| Hypothyroidism [ | 270 (11.7) | 56 (10.2) | 0.32 |
| Current | 363 (15.7) | 76 (13.8) | 0.03 |
| Former | 595 (25.8) | 115 (20.9) | |
| Non-smoker | 1,224 (53.1) | 328 (59.6) | |
| Missing or other | 123 (5.3) | 31 (5.6) | |
| Any prior hospitalization [ | 1,428 (61.0) | 324 (58.9) | 0.37 |
| Median initial NIHSS score (IQR) | 2 (0–6) | 2 (0–6) | 0.09 |
| Median initial GCS (IQR) | 15 (14–15) | 15 (14–15) | 0.10 |
| Missing [ | 83 (3.6) | 22 (4) | 0.65 |
| Intensive care unit stay [ | 1,166 (50.6) | 306 (55.64) | 0.04 |
| Inhospital pneumonia [ | 108 (4.7) | 24 (4.4) | 0.76 |
| Mechanical ventilation [ | 226 (9.8) | 49 (8.9) | 0.52 |
| Gastrostomy [ | 153 (6.6) | 35 (6.3) | 0.80 |
| Home | 1,659 (72.0) | 350 (63.6) | <0.01 |
| Acute inpatient rehabilitation | 429 (18.6) | 148 (26.9) | |
| Skilled nursing facility or long-term facility | 153 (6.6) | 33 (6) | |
| Other hospital or against medical advice | 64 (2.8) | 19 (3.45) | |
| Any unplanned readmission within 30 days [ | 337 (14.6) | 62 (11.5) | 0.04 |
| Stroke readmission within 30 days [ | 124 (5.4) | 24 (4.5) | 0.33 |
Figure 3Comparison of models to predict 30-day all-cause readmissions.
Figure 4Comparison of models to predict 30-day stroke readmissions.