Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Automating Ischemic Stroke Subtype Classification Using Machine Learning and Natural Language Processing.

Literature DB >> 31103549

Automating Ischemic Stroke Subtype Classification Using Machine Learning and Natural Language Processing.

Ravi Garg¹, Elissa Oh¹, Andrew Naidech¹, Konrad Kording², Shyam Prabhakaran³.

Abstract

OBJECTIVE: The manual adjudication of disease classification is time-consuming, error-prone, and limits scaling to large datasets. In ischemic stroke (IS), subtype classification is critical for management and outcome prediction. This study sought to use natural language processing of electronic health records (EHR) combined with machine learning methods to automate IS subtyping.
METHODS: Among IS patients from an observational registry with TOAST subtyping adjudicated by board-certified vascular neurologists, we analyzed unstructured text-based EHR data including neurology progress notes and neuroradiology reports using natural language processing. We performed several feature selection methods to reduce the high dimensionality of the features and 5-fold cross validation to test generalizability of our methods and minimize overfitting. We used several machine learning methods and calculated the kappa values for agreement between each machine learning approach to manual adjudication. We then performed a blinded testing of the best algorithm against a held-out subset of 50 cases.
RESULTS: Compared to manual classification, the best machine-based classification achieved a kappa of .25 using radiology reports alone, .57 using progress notes alone, and .57 using combined data. Kappa values varied by subtype being highest for cardioembolic (.64) and lowest for cryptogenic cases (.47). In the held-out test subset, machine-based classification agreed with rater classification in 40 of 50 cases (kappa .72).
CONCLUSIONS: Automated machine learning approaches using textual data from the EHR shows agreement with manual TOAST classification. The automated pipeline, if externally validated, could enable large-scale stroke epidemiology research.

Entities: Disease Species

Keywords: Ischemic stroke; cardioembolism; cryptogenic; machine learning; natural language processing

Mesh：

Year: 2019 PMID： 31103549 DOI： 10.1016/j.jstrokecerebrovasdis.2019.02.004

Source DB: PubMed Journal: J Stroke Cerebrovasc Dis ISSN： 1052-3057 Impact factor: 2.136

Keyword Cloud
Cited

22 in total

1. Rule-based natural language processing for automation of stroke data extraction: a validation study.

Authors: Dane Gunter; Paulo Puac-Polanco; Olivier Miguel; Rebecca E Thornhill; Amy Y X Yu; Zhongyu A Liu; Muhammad Mamdani; Chloe Pou-Prom; Richard I Aviv
Journal: Neuroradiology Date: 2022-08-01 Impact factor: 2.995

Review 2. Multimodal biomedical AI.

Authors: Julián N Acosta; Guido J Falcone; Pranav Rajpurkar; Eric J Topol
Journal: Nat Med Date: 2022-09-15 Impact factor: 87.241

Review 3. Artificial Intelligence for Large-Vessel Occlusion Stroke: A Systematic Review.

Authors: Nathan A Shlobin; Ammad A Baig; Muhammad Waqas; Tatsat R Patel; Rimal H Dossani; Megan Wilson; Justin M Cappuzzo; Adnan H Siddiqui; Vincent M Tutino; Elad I Levy
Journal: World Neurosurg Date: 2021-12-08 Impact factor: 2.210

4. MIMIC-SBDH: A Dataset for Social and Behavioral Determinants of Health.

Authors: Hiba Ahsan; Emmie Ohnuki; Avijit Mitra; Hong Yu
Journal: Proc Mach Learn Res Date: 2021-08

Review 5. Neurocritical Care: Bench to Bedside (Eds. Claude Hemphill, Michael James) Integrating and Using Big Data in Neurocritical Care.

Authors: Brandon Foreman
Journal: Neurotherapeutics Date: 2020-04 Impact factor: 7.620

6. Automated Electronic Phenotyping of Cardioembolic Stroke.

Authors: Wyliena Guan; Darae Ko; Shaan Khurshid; Ana T Trisini Lipsanopoulos; Jeffrey M Ashburner; Lia X Harrington; Natalia S Rost; Steven J Atlas; Daniel E Singer; David D McManus; Christopher D Anderson; Steven A Lubitz
Journal: Stroke Date: 2020-12-10 Impact factor: 7.914

7. Prediction of Clinical Outcome in Patients with Large-Vessel Acute Ischemic Stroke: Performance of Machine Learning versus SPAN-100.

Authors: B Jiang; G Zhu; Y Xie; J J Heit; H Chen; Y Li; V Ding; A Eskandari; P Michel; G Zaharchuk; M Wintermark
Journal: AJNR Am J Neuroradiol Date: 2021-01-07 Impact factor: 3.825