Literature DB >> 29146561

Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.

Iñigo Jauregi Unanue1, Ehsan Zare Borzeshi2, Massimo Piccardi3.   

Abstract

BACKGROUND: Previous state-of-the-art systems on Drug Name Recognition (DNR) and Clinical Concept Extraction (CCE) have focused on a combination of text "feature engineering" and conventional machine learning algorithms such as conditional random fields and support vector machines. However, developing good features is inherently heavily time-consuming. Conversely, more modern machine learning approaches such as recurrent neural networks (RNNs) have proved capable of automatically learning effective features from either random assignments or automated word "embeddings".
OBJECTIVES: (i) To create a highly accurate DNR and CCE system that avoids conventional, time-consuming feature engineering. (ii) To create richer, more specialized word embeddings by using health domain datasets such as MIMIC-III. (iii) To evaluate our systems over three contemporary datasets.
METHODS: Two deep learning methods, namely the Bidirectional LSTM and the Bidirectional LSTM-CRF, are evaluated. A CRF model is set as the baseline to compare the deep learning systems to a traditional machine learning approach. The same features are used for all the models.
RESULTS: We have obtained the best results with the Bidirectional LSTM-CRF model, which has outperformed all previously proposed systems. The specialized embeddings have helped to cover unusual words in DrugBank and MedLine, but not in the i2b2/VA dataset.
CONCLUSIONS: We present a state-of-the-art system for DNR and CCE. Automated word embeddings has allowed us to avoid costly feature engineering and achieve higher accuracy. Nevertheless, the embeddings need to be retrained over datasets that are adequate for the domain, in order to adequately cover the domain-specific vocabulary.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Artificial intelligence [MeSH]; Clinical concept extraction; Drug name recognition; Machine learning [MeSH]; Neural networks (computer) [MeSH]

Mesh:

Year:  2017        PMID: 29146561     DOI: 10.1016/j.jbi.2017.11.007

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  12 in total

1.  Enhancing clinical concept extraction with contextual embeddings.

Authors:  Yuqi Si; Jingqi Wang; Hua Xu; Kirk Roberts
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

2.  Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.

Authors:  Yonghui Wu; Xi Yang; Jiang Bian; Yi Guo; Hua Xu; William Hogan
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

3.  Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text.

Authors:  Fábio Lopes; César Teixeira; Hugo Gonçalo Oliveira
Journal:  J Med Syst       Date:  2020-02-28       Impact factor: 4.460

Review 4.  Clinical concept extraction: A methodology review.

Authors:  Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-06       Impact factor: 6.317

5.  Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition.

Authors:  Jianfu Li; Yujia Zhou; Xiaoqian Jiang; Karthik Natarajan; Serguei Vs Pakhomov; Hongfang Liu; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

6.  Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.

Authors:  Wangjin Lee; Jinwook Choi
Journal:  BMC Med Inform Decis Mak       Date:  2019-07-15       Impact factor: 2.796

Review 7.  Medical Information Extraction in the Age of Deep Learning.

Authors:  Udo Hahn; Michel Oleynik
Journal:  Yearb Med Inform       Date:  2020-08-21

8.  Comparing general and specialized word embeddings for biomedical named entity recognition.

Authors:  Rigo E Ramos-Vargas; Israel Román-Godínez; Sulema Torres-Ramos
Journal:  PeerJ Comput Sci       Date:  2021-02-18

9.  Concept recognition as a machine translation problem.

Authors:  Mayla R Boguslav; Negacy D Hailu; Michael Bada; William A Baumgartner; Lawrence E Hunter
Journal:  BMC Bioinformatics       Date:  2021-12-17       Impact factor: 3.169

10.  Named Entity Recognition of Medical Text Based on the Deep Neural Network.

Authors:  Tianjiao Yang; Ying He; Ning Yang
Journal:  J Healthc Eng       Date:  2022-03-07       Impact factor: 2.682

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.