Literature DB >> 32112285

Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text.

Fábio Lopes1, César Teixeira2, Hugo Gonçalo Oliveira2.   

Abstract

Electronic Medical Records (EMRs) are written in an unstructured way, often using natural language. Information Extraction (IE) may be used for acquiring knowledge from such texts, including the automatic recognition of meaningful entities, through models for Named Entity Recognition (NER). However, while most work on the previous was made for English, this experience aimed at testing different methods in Portuguese text, more precisely, on the domain of Neurology, and take some conclusions. This paper comprised the comparison between Conditional Random Fields (CRF), bidirectional Long Short-term Memory - Conditional Random Fields (BiLSTM-CRF) and a BiLSTM-CRF with residual learning connections, using not only Portuguese texts from medical journals but also texts from the Coimbra Hospital and Universitary Centre (CHUC) Neurology Service. Furthermore, the performances of BiLSTM-CRF models using word embeddings (WEs) trained with clinical text and WEs trained with general language texts were compared. Deep learning models achieved F1-Scores of nearly 83% and 75%, respectively for relaxed and strict evaluation, on texts extracted from the medical journal. For texts collected from the Hospital, the same achieved F1-Scores of nearly 71% and 62%. This work concludes that deep learning models outperform the shallow learning models and that in-domain WEs get better results than general language WEs, even when the latter are trained with much more text than the former. Furthermore, the results show that it is possible to extract information from Hospital clinical texts with models trained with clinical cases extracted from medical journals, and thus openly available. Nevertheless, such results still require a healthcare technician to check if the information is well extracted.

Keywords:  Machine learning; Named entity recognition; Natural language processing; Portuguese clinical text

Mesh:

Year:  2020        PMID: 32112285     DOI: 10.1007/s10916-020-1542-8

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  11 in total

1.  The EPILEPSIAE database: an extensive electroencephalography database of epilepsy patients.

Authors:  Juliane Klatt; Hinnerk Feldwisch-Drentrup; Matthias Ihle; Vincent Navarro; Markus Neufang; Cesar Teixeira; Claude Adam; Mario Valderrama; Catalina Alvarado-Rojas; Adrien Witon; Michel Le Van Quyen; Francisco Sales; Antonio Dourado; Jens Timmer; Andreas Schulze-Bonhage; Bjoern Schelter
Journal:  Epilepsia       Date:  2012-06-27       Impact factor: 5.864

Review 2.  Natural Language Processing Technologies in Radiology Research and Clinical Applications.

Authors:  Tianrun Cai; Andreas A Giannopoulos; Sheng Yu; Tatiana Kelil; Beth Ripley; Kanako K Kumamaru; Frank J Rybicki; Dimitrios Mitsouras
Journal:  Radiographics       Date:  2016 Jan-Feb       Impact factor: 5.333

3.  Extracting structured medication event information from discharge summaries.

Authors:  Sigfried Gold; Noémie Elhadad; Xinxin Zhu; James J Cimino; George Hripcsak
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

4.  Rule-based information extraction from patients' clinical data.

Authors:  Agnieszka Mykowiecka; Małgorzata Marciniak; Anna Kupść
Journal:  J Biomed Inform       Date:  2009-07-29       Impact factor: 6.317

5.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

6.  Clinical Named Entity Recognition Using Deep Learning Models.

Authors:  Yonghui Wu; Min Jiang; Jun Xu; Degui Zhi; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

7.  Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: an annotation and machine learning study.

Authors:  Maria Skeppstedt; Maria Kvist; Gunnar H Nilsson; Hercules Dalianis
Journal:  J Biomed Inform       Date:  2014-02-04       Impact factor: 6.317

8.  A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.

Authors:  Yonghui Wu; Jun Xu; Min Jiang; Yaoyun Zhang; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

9.  Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.

Authors:  Iñigo Jauregi Unanue; Ehsan Zare Borzeshi; Massimo Piccardi
Journal:  J Biomed Inform       Date:  2017-11-13       Impact factor: 6.317

Review 10.  Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors:  Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2018-03-30
View more
  2 in total

1.  Model-based clinical note entity recognition for rheumatoid arthritis using bidirectional encoder representation from transformers.

Authors:  Meiting Li; Feifei Liu; Jia'an Zhu; Ran Zhang; Yi Qin; Dongping Gao
Journal:  Quant Imaging Med Surg       Date:  2022-01

2.  Natural language processing in clinical neuroscience and psychiatry: A review.

Authors:  Claudio Crema; Giuseppe Attardi; Daniele Sartiano; Alberto Redolfi
Journal:  Front Psychiatry       Date:  2022-09-14       Impact factor: 5.435

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.