Literature DB >> 31103549

Automating Ischemic Stroke Subtype Classification Using Machine Learning and Natural Language Processing.

Ravi Garg1, Elissa Oh1, Andrew Naidech1, Konrad Kording2, Shyam Prabhakaran3.   

Abstract

OBJECTIVE: The manual adjudication of disease classification is time-consuming, error-prone, and limits scaling to large datasets. In ischemic stroke (IS), subtype classification is critical for management and outcome prediction. This study sought to use natural language processing of electronic health records (EHR) combined with machine learning methods to automate IS subtyping.
METHODS: Among IS patients from an observational registry with TOAST subtyping adjudicated by board-certified vascular neurologists, we analyzed unstructured text-based EHR data including neurology progress notes and neuroradiology reports using natural language processing. We performed several feature selection methods to reduce the high dimensionality of the features and 5-fold cross validation to test generalizability of our methods and minimize overfitting. We used several machine learning methods and calculated the kappa values for agreement between each machine learning approach to manual adjudication. We then performed a blinded testing of the best algorithm against a held-out subset of 50 cases.
RESULTS: Compared to manual classification, the best machine-based classification achieved a kappa of .25 using radiology reports alone, .57 using progress notes alone, and .57 using combined data. Kappa values varied by subtype being highest for cardioembolic (.64) and lowest for cryptogenic cases (.47). In the held-out test subset, machine-based classification agreed with rater classification in 40 of 50 cases (kappa .72).
CONCLUSIONS: Automated machine learning approaches using textual data from the EHR shows agreement with manual TOAST classification. The automated pipeline, if externally validated, could enable large-scale stroke epidemiology research.
Copyright © 2019. Published by Elsevier Inc.

Entities:  

Keywords:  Ischemic stroke; cardioembolism; cryptogenic; machine learning; natural language processing

Mesh:

Year:  2019        PMID: 31103549     DOI: 10.1016/j.jstrokecerebrovasdis.2019.02.004

Source DB:  PubMed          Journal:  J Stroke Cerebrovasc Dis        ISSN: 1052-3057            Impact factor:   2.136


  22 in total

1.  Rule-based natural language processing for automation of stroke data extraction: a validation study.

Authors:  Dane Gunter; Paulo Puac-Polanco; Olivier Miguel; Rebecca E Thornhill; Amy Y X Yu; Zhongyu A Liu; Muhammad Mamdani; Chloe Pou-Prom; Richard I Aviv
Journal:  Neuroradiology       Date:  2022-08-01       Impact factor: 2.995

Review 2.  Multimodal biomedical AI.

Authors:  Julián N Acosta; Guido J Falcone; Pranav Rajpurkar; Eric J Topol
Journal:  Nat Med       Date:  2022-09-15       Impact factor: 87.241

Review 3.  Artificial Intelligence for Large-Vessel Occlusion Stroke: A Systematic Review.

Authors:  Nathan A Shlobin; Ammad A Baig; Muhammad Waqas; Tatsat R Patel; Rimal H Dossani; Megan Wilson; Justin M Cappuzzo; Adnan H Siddiqui; Vincent M Tutino; Elad I Levy
Journal:  World Neurosurg       Date:  2021-12-08       Impact factor: 2.210

4.  MIMIC-SBDH: A Dataset for Social and Behavioral Determinants of Health.

Authors:  Hiba Ahsan; Emmie Ohnuki; Avijit Mitra; Hong Yu
Journal:  Proc Mach Learn Res       Date:  2021-08

Review 5.  Neurocritical Care: Bench to Bedside (Eds. Claude Hemphill, Michael James) Integrating and Using Big Data in Neurocritical Care.

Authors:  Brandon Foreman
Journal:  Neurotherapeutics       Date:  2020-04       Impact factor: 7.620

6.  Automated Electronic Phenotyping of Cardioembolic Stroke.

Authors:  Wyliena Guan; Darae Ko; Shaan Khurshid; Ana T Trisini Lipsanopoulos; Jeffrey M Ashburner; Lia X Harrington; Natalia S Rost; Steven J Atlas; Daniel E Singer; David D McManus; Christopher D Anderson; Steven A Lubitz
Journal:  Stroke       Date:  2020-12-10       Impact factor: 7.914

7.  Prediction of Clinical Outcome in Patients with Large-Vessel Acute Ischemic Stroke: Performance of Machine Learning versus SPAN-100.

Authors:  B Jiang; G Zhu; Y Xie; J J Heit; H Chen; Y Li; V Ding; A Eskandari; P Michel; G Zaharchuk; M Wintermark
Journal:  AJNR Am J Neuroradiol       Date:  2021-01-07       Impact factor: 3.825

8.  Analysis of Stroke Detection during the COVID-19 Pandemic Using Natural Language Processing of Radiology Reports.

Authors:  M D Li; M Lang; F Deng; K Chang; K Buch; S Rincon; W A Mehan; T M Leslie-Mazwi; J Kalpathy-Cramer
Journal:  AJNR Am J Neuroradiol       Date:  2020-12-17       Impact factor: 3.825

9.  Natural Language Processing Enhances Prediction of Functional Outcome After Acute Ischemic Stroke.

Authors:  Sheng-Feng Sung; Chih-Hao Chen; Ru-Chiou Pan; Ya-Han Hu; Jiann-Shing Jeng
Journal:  J Am Heart Assoc       Date:  2021-11-19       Impact factor: 6.106

10.  Developing automated methods for disease subtyping in UK Biobank: an exemplar study on stroke.

Authors:  Kristiina Rannikmäe; Honghan Wu; Steven Tominey; William Whiteley; Naomi Allen; Cathie Sudlow
Journal:  BMC Med Inform Decis Mak       Date:  2021-06-15       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.