Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.

Literature DB >> 29146561

Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.

Iñigo Jauregi Unanue¹, Ehsan Zare Borzeshi², Massimo Piccardi³.

Abstract

BACKGROUND: Previous state-of-the-art systems on Drug Name Recognition (DNR) and Clinical Concept Extraction (CCE) have focused on a combination of text "feature engineering" and conventional machine learning algorithms such as conditional random fields and support vector machines. However, developing good features is inherently heavily time-consuming. Conversely, more modern machine learning approaches such as recurrent neural networks (RNNs) have proved capable of automatically learning effective features from either random assignments or automated word "embeddings".
OBJECTIVES: (i) To create a highly accurate DNR and CCE system that avoids conventional, time-consuming feature engineering. (ii) To create richer, more specialized word embeddings by using health domain datasets such as MIMIC-III. (iii) To evaluate our systems over three contemporary datasets.
METHODS: Two deep learning methods, namely the Bidirectional LSTM and the Bidirectional LSTM-CRF, are evaluated. A CRF model is set as the baseline to compare the deep learning systems to a traditional machine learning approach. The same features are used for all the models.
RESULTS: We have obtained the best results with the Bidirectional LSTM-CRF model, which has outperformed all previously proposed systems. The specialized embeddings have helped to cover unusual words in DrugBank and MedLine, but not in the i2b2/VA dataset.
CONCLUSIONS: We present a state-of-the-art system for DNR and CCE. Automated word embeddings has allowed us to avoid costly feature engineering and achieve higher accuracy. Nevertheless, the embeddings need to be retrained over datasets that are adequate for the domain, in order to adequately cover the domain-specific vocabulary.

Entities: Disease

Keywords: Artificial intelligence [MeSH]; Clinical concept extraction; Drug name recognition; Machine learning [MeSH]; Neural networks (computer) [MeSH]

Mesh：

Year: 2017 PMID： 29146561 DOI： 10.1016/j.jbi.2017.11.007

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

Keyword Cloud
Cited

12 in total

1. Enhancing clinical concept extraction with contextual embeddings.

Authors: Yuqi Si; Jingqi Wang; Hua Xu; Kirk Roberts
Journal: J Am Med Inform Assoc Date: 2019-11-01 Impact factor: 4.497

2. Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.

Authors: Yonghui Wu; Xi Yang; Jiang Bian; Yi Guo; Hua Xu; William Hogan
Journal: AMIA Annu Symp Proc Date: 2018-12-05

3. Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text.

Authors: Fábio Lopes; César Teixeira; Hugo Gonçalo Oliveira
Journal: J Med Syst Date: 2020-02-28 Impact factor: 4.460

Review 4. Clinical concept extraction: A methodology review.

Authors: Sunyang Fu; David Chen; Huan He; Sijia Liu; Sungrim Moon; Kevin J Peterson; Feichen Shen; Liwei Wang; Yanshan Wang; Andrew Wen; Yiqing Zhao; Sunghwan Sohn; Hongfang Liu
Journal: J Biomed Inform Date: 2020-08-06 Impact factor: 6.317

5. Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition.

Authors: Jianfu Li; Yujia Zhou; Xiaoqian Jiang; Karthik Natarajan; Serguei Vs Pakhomov; Hongfang Liu; Hua Xu
Journal: J Am Med Inform Assoc Date: 2021-09-18 Impact factor: 7.942

6. Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.

Authors: Wangjin Lee; Jinwook Choi
Journal: BMC Med Inform Decis Mak Date: 2019-07-15 Impact factor: 2.796

Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition.

1. Enhancing clinical concept extraction with contextual embeddings.

2. Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.

3. Comparing Different Methods for Named Entity Recognition in Portuguese Neurology Text.

Review 4. Clinical concept extraction: A methodology review.

5. Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition.

6. Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.

Review 7. Medical Information Extraction in the Age of Deep Learning.

8. Comparing general and specialized word embeddings for biomedical named entity recognition.

9. Concept recognition as a machine translation problem.

10. Named Entity Recognition of Medical Text Based on the Deep Neural Network.