Literature DB >> 31865900

Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches.

Rebecka Weegar1, Alicia Pérez2, Arantza Casillas2, Maite Oronoz2.   

Abstract

BACKGROUND: Text mining and natural language processing of clinical text, such as notes from electronic health records, requires specific consideration of the specialized characteristics of these texts. Deep learning methods could potentially mitigate domain specific challenges such as limited access to in-domain tools and data sets.
METHODS: A bi-directional Long Short-Term Memory network is applied to clinical notes in Spanish and Swedish for the task of medical named entity recognition. Several types of embeddings, both generated from in-domain and out-of-domain text corpora, and a number of generation and combination strategies for embeddings have been evaluated in order to investigate different input representations and the influence of domain on the final results.
RESULTS: For Spanish, a micro averaged F1-score of 75.25 was obtained and for Swedish, the corresponding score was 76.04. The best results for both languages were achieved using embeddings generated from in-domain corpora extracted from electronic health records, but embeddings generated from related domains were also found to be beneficial.
CONCLUSIONS: A recurrent neural network with in-domain embeddings improved the medical named entity recognition compared to shallow learning methods, showing this combination to be suitable for entity recognition in clinical text for both languages.

Entities:  

Keywords:  Clinical text mining; Medical named entity recognition; Recurrent neural network; Unstructured electronic health records

Mesh:

Year:  2019        PMID: 31865900      PMCID: PMC6927099          DOI: 10.1186/s12911-019-0981-y

Source DB:  PubMed          Journal:  BMC Med Inform Decis Mak        ISSN: 1472-6947            Impact factor:   2.796


  15 in total

1.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

2.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

3.  On the creation of a clinical gold standard corpus in Spanish: Mining adverse drug reactions.

Authors:  Maite Oronoz; Koldo Gojenola; Alicia Pérez; Arantza Díaz de Ilarraza; Arantza Casillas
Journal:  J Biomed Inform       Date:  2015-06-30       Impact factor: 6.317

4.  Incorporating dictionaries into deep neural networks for the Chinese clinical named entity recognition.

Authors:  Qi Wang; Yangming Zhou; Tong Ruan; Daqi Gao; Yuhang Xia; Ping He
Journal:  J Biomed Inform       Date:  2019-02-25       Impact factor: 6.317

5.  Character-level neural network for biomedical named entity recognition.

Authors:  Mourad Gridach
Journal:  J Biomed Inform       Date:  2017-05-11       Impact factor: 6.317

6.  Semi-supervised medical entity recognition: A study on Spanish and Swedish clinical corpora.

Authors:  Alicia Pérez; Rebecka Weegar; Arantza Casillas; Koldo Gojenola; Maite Oronoz; Hercules Dalianis
Journal:  J Biomed Inform       Date:  2017-05-16       Impact factor: 6.317

7.  Corpus domain effects on distributional semantic modeling of medical terms.

Authors:  Serguei V S Pakhomov; Greg Finley; Reed McEwan; Yan Wang; Genevieve B Melton
Journal:  Bioinformatics       Date:  2016-08-16       Impact factor: 6.937

8.  Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.

Authors:  Yonghui Wu; Min Jiang; Jianbo Lei; Hua Xu
Journal:  Stud Health Technol Inform       Date:  2015

9.  Structured prediction models for RNN based sequence labeling in clinical text.

Authors:  Abhyuday N Jagannatha; Hong Yu
Journal:  Proc Conf Empir Methods Nat Lang Process       Date:  2016-11

10.  Clinical Named Entity Recognition From Chinese Electronic Health Records via Machine Learning Methods.

Authors:  Yu Zhang; Xuwen Wang; Zhen Hou; Jiao Li
Journal:  JMIR Med Inform       Date:  2018-12-17
View more
  1 in total

1.  Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.

Authors:  Denis Newman-Griffis; Eric Fosler-Lussier
Journal:  Front Digit Health       Date:  2021-03-10
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.