Literature DB >> 30940752

Using natural language processing to extract structured epilepsy data from unstructured clinic letters: development and validation of the ExECT (extraction of epilepsy clinical text) system.

Beata Fonferko-Shadrach1, Arron S Lacey1,2, Angus Roberts3, Ashley Akbari2, Simon Thompson2, David V Ford2, Ronan A Lyons2, Mark I Rees1,4, William Owen Pickrell1.   

Abstract

OBJECTIVE: Routinely collected healthcare data are a powerful research resource but often lack detailed disease-specific information that is collected in clinical free text, for example, clinic letters. We aim to use natural language processing techniques to extract detailed clinical information from epilepsy clinic letters to enrich routinely collected data.
DESIGN: We used the general architecture for text engineering (GATE) framework to build an information extraction system, ExECT (extraction of epilepsy clinical text), combining rule-based and statistical techniques. We extracted nine categories of epilepsy information in addition to clinic date and date of birth across 200 clinic letters. We compared the results of our algorithm with a manual review of the letters by an epilepsy clinician.
SETTING: De-identified and pseudonymised epilepsy clinic letters from a Health Board serving half a million residents in Wales, UK.
RESULTS: We identified 1925 items of information with overall precision, recall and F1 score of 91.4%, 81.4% and 86.1%, respectively. Precision and recall for epilepsy-specific categories were: epilepsy diagnosis (88.1%, 89.0%), epilepsy type (89.8%, 79.8%), focal seizures (96.2%, 69.7%), generalised seizures (88.8%, 52.3%), seizure frequency (86.3%-53.6%), medication (96.1%, 94.0%), CT (55.6%, 58.8%), MRI (82.4%, 68.8%) and electroencephalogram (81.5%, 75.3%).
CONCLUSIONS: We have built an automated clinical text extraction system that can accurately extract epilepsy information from free text in clinic letters. This can enhance routinely collected data for research in the UK. The information extracted with ExECT such as epilepsy type, seizure frequency and neurological investigations are often missing from routinely collected data. We propose that our algorithm can bridge this data gap enabling further epilepsy research opportunities. While many of the rules in our pipeline were tailored to extract epilepsy specific information, our methods can be applied to other diseases and also can be used in clinical practice to record patient information in a structured manner. © Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY. Published by BMJ.

Entities:  

Keywords:  epilepsy; information extraction; natural language processing; validation

Mesh:

Year:  2019        PMID: 30940752      PMCID: PMC6500195          DOI: 10.1136/bmjopen-2018-023232

Source DB:  PubMed          Journal:  BMJ Open        ISSN: 2044-6055            Impact factor:   2.692


  20 in total

1.  Using UMLS Concept Unique Identifiers (CUIs) for word sense disambiguation in the biomedical domain.

Authors:  Bridget T McInnes; Ted Pedersen; John Carlis
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

2.  Automated extraction of family history information from clinical notes.

Authors:  Robert Bill; Serguei Pakhomov; Elizabeth S Chen; Tamara J Winden; Elizabeth W Carter; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

3.  Harnessing electronic medical records to advance research on multiple sclerosis.

Authors:  Vincent Damotte; Antoine Lizée; Matthew Tremblay; Alisha Agrawal; Pouya Khankhanian; Adam Santaniello; Refujia Gomez; Robin Lincoln; Wendy Tang; Tiffany Chen; Nelson Lee; Pablo Villoslada; Jill A Hollenbach; Carolyn D Bevan; Jennifer Graves; Riley Bove; Douglas S Goodin; Ari J Green; Sergio E Baranzini; Bruce Ac Cree; Roland G Henry; Stephen L Hauser; Jeffrey M Gelfand; Pierre-Antoine Gourraud
Journal:  Mult Scler       Date:  2018-01-09       Impact factor: 6.312

4.  Epilepsy, suicidality, and psychiatric disorders: a bidirectional association.

Authors:  Dale C Hesdorffer; Lianna Ishihara; Lakshmi Mynepalli; David J Webb; John Weil; W Allen Hauser
Journal:  Ann Neurol       Date:  2012-08-07       Impact factor: 10.422

5.  ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

Authors:  Henk Harkema; John N Dowling; Tyler Thornblade; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2009-05-10       Impact factor: 6.317

6.  Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project.

Authors:  Richard G Jackson; Rashmi Patel; Nishamali Jayatilleke; Anna Kolliakou; Michael Ball; Genevieve Gorrell; Angus Roberts; Richard J Dobson; Robert Stewart
Journal:  BMJ Open       Date:  2017-01-17       Impact factor: 2.692

7.  ADEPt, a semantically-enriched pipeline for extracting adverse drug events from free-text electronic health records.

Authors:  Ehtesham Iqbal; Robbie Mallah; Daniel Rhodes; Honghan Wu; Alvin Romero; Nynn Chang; Olubanke Dzahini; Chandra Pandey; Matthew Broadbent; Robert Stewart; Richard J B Dobson; Zina M Ibrahim
Journal:  PLoS One       Date:  2017-11-09       Impact factor: 3.240

8.  Early recognition of multiple sclerosis using natural language processing of the electronic health record.

Authors:  Herbert S Chase; Lindsey R Mitrani; Gabriel G Lu; Dominick J Fulgieri
Journal:  BMC Med Inform Decis Mak       Date:  2017-02-28       Impact factor: 2.796

9.  Getting more out of biomedical documents with GATE's full lifecycle open source text analytics.

Authors:  Hamish Cunningham; Valentin Tablan; Angus Roberts; Kalina Bontcheva
Journal:  PLoS Comput Biol       Date:  2013-02-07       Impact factor: 4.475

10.  Development of phenotype algorithms using electronic medical records and incorporating natural language processing.

Authors:  Katherine P Liao; Tianxi Cai; Guergana K Savova; Shawn N Murphy; Elizabeth W Karlson; Ashwin N Ananthakrishnan; Vivian S Gainer; Stanley Y Shaw; Zongqi Xia; Peter Szolovits; Susanne Churchill; Isaac Kohane
Journal:  BMJ       Date:  2015-04-24
View more
  9 in total

1.  Natural Language Processing Applications in the Clinical Neurosciences: A Machine Learning Augmented Systematic Review.

Authors:  Quinlan D Buchlak; Nazanin Esmaili; Christine Bennett; Farrokh Farrokhi
Journal:  Acta Neurochir Suppl       Date:  2022

2.  Development of a natural language processing algorithm to extract seizure types and frequencies from the electronic health record.

Authors:  Barbara M Decker; Alexandra Turco; Jian Xu; Samuel W Terman; Nikitha Kosaraju; Alisha Jamil; Kathryn A Davis; Brian Litt; Colin A Ellis; Pouya Khankhanian; Chloe E Hill
Journal:  Seizure       Date:  2022-07-20       Impact factor: 3.414

3.  Use of Natural Language Processing to Improve Identification of Patients With Peripheral Artery Disease.

Authors:  E Hope Weissler; Jikai Zhang; Steven Lippmann; Shelley Rusincovitch; Ricardo Henao; W Schuyler Jones
Journal:  Circ Cardiovasc Interv       Date:  2020-10-12       Impact factor: 6.546

Review 4.  Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.

Authors:  Barbara M Decker; Chloé E Hill; Steven N Baldassano; Pouya Khankhanian
Journal:  Seizure       Date:  2021-01-13       Impact factor: 3.184

5.  Toward the Development of Data Governance Standards for Using Clinical Free-Text Data in Health Research: Position Paper.

Authors:  Kerina H Jones; Elizabeth M Ford; Nathan Lea; Lucy J Griffiths; Lamiece Hassan; Sharon Heys; Emma Squires; Goran Nenadic
Journal:  J Med Internet Res       Date:  2020-06-29       Impact factor: 5.428

Review 6.  The role of machine learning in clinical research: transforming the future of evidence generation.

Authors:  E Hope Weissler; Tristan Naumann; Tomas Andersson; Rajesh Ranganath; Olivier Elemento; Yuan Luo; Daniel F Freitag; James Benoit; Michael C Hughes; Faisal Khan; Paul Slater; Khader Shameer; Matthew Roe; Emmette Hutchison; Scott H Kollins; Uli Broedl; Zhaoling Meng; Jennifer L Wong; Lesley Curtis; Erich Huang; Marzyeh Ghassemi
Journal:  Trials       Date:  2021-08-16       Impact factor: 2.279

7.  The Potential of Research Drawing on Clinical Free Text to Bring Benefits to Patients in the United Kingdom: A Systematic Review of the Literature.

Authors:  Elizabeth Ford; Keegan Curlewis; Emma Squires; Lucy J Griffiths; Robert Stewart; Kerina H Jones
Journal:  Front Digit Health       Date:  2021-02-10

8.  Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing.

Authors:  Kevin Xie; Ryan S Gallagher; Erin C Conrad; Chadric O Garrick; Steven N Baldassano; John M Bernabei; Peter D Galer; Nina J Ghosn; Adam S Greenblatt; Tara Jennings; Alana Kornspun; Catherine V Kulick-Soper; Jal M Panchal; Akash R Pattnaik; Brittany H Scheid; Danmeng Wei; Micah Weitzman; Ramya Muthukrishnan; Joongwon Kim; Brian Litt; Colin A Ellis; Dan Roth
Journal:  J Am Med Inform Assoc       Date:  2022-04-13       Impact factor: 4.497

9.  Natural language processing in clinical neuroscience and psychiatry: A review.

Authors:  Claudio Crema; Giuseppe Attardi; Daniele Sartiano; Alberto Redolfi
Journal:  Front Psychiatry       Date:  2022-09-14       Impact factor: 5.435

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.