Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Rule-based natural language processing for automation of stroke data extraction: a validation study.

Literature DB >> 35913525

Rule-based natural language processing for automation of stroke data extraction: a validation study.

Dane Gunter¹, Paulo Puac-Polanco², Olivier Miguel², Rebecca E Thornhill³, Amy Y X Yu⁴, Zhongyu A Liu⁴, Muhammad Mamdani⁵, Chloe Pou-Prom⁶, Richard I Aviv^7,8.

Abstract

PURPOSE: Data extraction from radiology free-text reports is time consuming when performed manually. Recently, more automated extraction methods using natural language processing (NLP) are proposed. A previously developed rule-based NLP algorithm showed promise in its ability to extract stroke-related data from radiology reports. We aimed to externally validate the accuracy of CHARTextract, a rule-based NLP algorithm, to extract stroke-related data from free-text radiology reports.
METHODS: Free-text reports of CT angiography (CTA) and perfusion (CTP) studies of consecutive patients with acute ischemic stroke admitted to a regional stroke center for endovascular thrombectomy were analyzed from January 2015 to 2021. Stroke-related variables were manually extracted as reference standard from clinical reports, including proximal and distal anterior circulation occlusion, posterior circulation occlusion, presence of ischemia or hemorrhage, Alberta stroke program early CT score (ASPECTS), and collateral status. These variables were simultaneously extracted using a rule-based NLP algorithm. The NLP algorithm's accuracy, specificity, sensitivity, positive predictive value (PPV), and negative predictive value (NPV) were assessed.
RESULTS: The NLP algorithm's accuracy was > 90% for identifying distal anterior occlusion, posterior circulation occlusion, hemorrhage, and ASPECTS. Accuracy was 85%, 74%, and 79% for proximal anterior circulation occlusion, presence of ischemia, and collateral status respectively. The algorithm confirmed the absence of variables from radiology reports with an 87-100% accuracy.
CONCLUSIONS: Rule-based NLP has a moderate to good performance for stroke-related data extraction from free-text imaging reports. The algorithm's accuracy was affected by inconsistent report styles and lexicon among reporting radiologists.

Entities: Chemical

Keywords: Data extraction; Natural language processing; Rule-based; Stroke; Stroke surveillance

Year: 2022 PMID： 35913525 DOI： 10.1007/s00234-022-03029-1

Source DB: PubMed Journal: Neuroradiology ISSN： 0028-3940 Impact factor: 2.995

Keyword Cloud
References

16 in total

1. Coding neuroradiology reports for the Northern Manhattan Stroke Study: a comparison of natural language processing and manual review.

Authors: J S Elkins; C Friedman; B Boden-Albala; R L Sacco; G Hripcsak
Journal: Comput Biomed Res Date: 2000-02

2. Effect of definition and methods on estimates of prevalence of large vessel occlusion in acute ischemic stroke: a systematic review and meta-analysis.

Authors: Muhammad Waqas; Ansaar T Rai; Kunal Vakharia; Felix Chin; Adnan H Siddiqui
Journal: J Neurointerv Surg Date: 2019-08-23 Impact factor: 5.836

Review 3. Natural Language Processing in Radiology: A Systematic Review.

Authors: Ewoud Pons; Loes M M Braun; M G Myriam Hunink; Jan A Kors
Journal: Radiology Date: 2016-05 Impact factor: 11.105

Review 4. Use and Utility of Administrative Health Data for Stroke Research and Surveillance.

Authors: Amy Y X Yu; Jessalyn K Holodinsky; Charlotte Zerna; Lawrence W Svenson; Nathalie Jetté; Hude Quan; Michael D Hill
Journal: Stroke Date: 2016-05-12 Impact factor: 7.914

5. Automating Ischemic Stroke Subtype Classification Using Machine Learning and Natural Language Processing.

Authors: Ravi Garg; Elissa Oh; Andrew Naidech; Konrad Kording; Shyam Prabhakaran
Journal: J Stroke Cerebrovasc Dis Date: 2019-05-15 Impact factor: 2.136

6. Analysis of Stroke Detection during the COVID-19 Pandemic Using Natural Language Processing of Radiology Reports.

Authors: M D Li; M Lang; F Deng; K Chang; K Buch; S Rincon; W A Mehan; T M Leslie-Mazwi; J Kalpathy-Cramer
Journal: AJNR Am J Neuroradiol Date: 2020-12-17 Impact factor: 3.825

7. Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports.

Authors: Charlene Jennifer Ong; Agni Orfanoudaki; Rebecca Zhang; Francois Pierre M Caprasse; Meghan Hutch; Liang Ma; Darian Fard; Oluwafemi Balogun; Matthew I Miller; Margaret Minnig; Hanife Saglam; Brenton Prescott; David M Greer; Stelios Smirnakis; Dimitris Bertsimas
Journal: PLoS One Date: 2020-06-19 Impact factor: 3.240

8. A systematic review of natural language processing applied to radiology reports.

Authors: Arlene Casey; Emma Davidson; Michael Poon; Hang Dong; Daniel Duma; Andreas Grivas; Claire Grover; Víctor Suárez-Paniagua; Richard Tobin; William Whiteley; Honghan Wu; Beatrice Alex
Journal: BMC Med Inform Decis Mak Date: 2021-06-03 Impact factor: 2.796

9. Automating Stroke Data Extraction From Free-Text Radiology Reports Using Natural Language Processing: Instrument Validation Study.

Authors: Amy Y X Yu; Zhongyu A Liu; Chloe Pou-Prom; Kaitlyn Lopes; Moira K Kapral; Richard I Aviv; Muhammad Mamdani
Journal: JMIR Med Inform Date: 2021-05-04