Literature DB >> 31719024

Combining Contextualized Embeddings and Prior Knowledge for Clinical Named Entity Recognition: Evaluation Study.

Min Jiang1, Todd Sanger1, Xiong Liu1.   

Abstract

BACKGROUND: Named entity recognition (NER) is a key step in clinical natural language processing (NLP). Traditionally, rule-based systems leverage prior knowledge to define rules to identify named entities. Recently, deep learning-based NER systems have become more and more popular. Contextualized word embedding, as a new type of representation of the word, has been proposed to dynamically capture word sense using context information and has proven successful in many deep learning-based systems in either general domain or medical domain. However, there are very few studies that investigate the effects of combining multiple contextualized embeddings and prior knowledge on the clinical NER task.
OBJECTIVE: This study aims to improve the performance of NER in clinical text by combining multiple contextual embeddings and prior knowledge.
METHODS: In this study, we investigate the effects of combining multiple contextualized word embeddings with classic word embedding in deep neural networks to predict named entities in clinical text. We also investigate whether using a semantic lexicon could further improve the performance of the clinical NER system.
RESULTS: By combining contextualized embeddings such as ELMo and Flair, our system achieves the F-1 score of 87.30% when only training based on a portion of the 2010 Informatics for Integrating Biology and the Bedside NER task dataset. After incorporating the medical lexicon into the word embedding, the F-1 score was further increased to 87.44%. Another finding was that our system still could achieve an F-1 score of 85.36% when the size of the training data was reduced to 40%.
CONCLUSIONS: Combined contextualized embedding could be beneficial for the clinical NER task. Moreover, the semantic lexicon could be used to further improve the performance of the clinical NER system. ©Min Jiang, Todd Sanger, Xiong Liu. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 13.11.2019.

Entities:  

Keywords:  contextualized word embedding; deep learning; named entity recognition; natural language processing; prior knowledge; semantic embedding

Year:  2019        PMID: 31719024     DOI: 10.2196/14850

Source DB:  PubMed          Journal:  JMIR Med Inform


  8 in total

1.  TAX-Corpus: Taxonomy based Annotations for Colonoscopy Evaluation.

Authors:  Shorabuddin Syed; Adam Jackson Angel; Hafsa Bareen Syeda; Carole Franc Jennings; Joseph VanScoy; Mahanazuddin Syed; Melody Greer; Sudeepa Bhattacharyya; Shaymaa Al-Shukri; Meredith Zozus; Fred Prior; Benjamin Tharian
Journal:  Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap       Date:  2022-02

2.  DeIDNER Model: A Neural Network Named Entity Recognition Model for Use in the De-identification of Clinical Notes.

Authors:  Mahanazuddin Syed; Kevin Sexton; Melody Greer; Shorabuddin Syed; Joseph VanScoy; Farhan Kawsar; Erica Olson; Karan Patel; Jake Erwin; Sudeepa Bhattacharyya; Meredith Zozus; Fred Prior
Journal:  Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap       Date:  2022-02

3.  The h-ANN Model: Comprehensive Colonoscopy Concept Compilation Using Combined Contextual Embeddings.

Authors:  Shorabuddin Syed; Adam Jackson Angel; Hafsa Bareen Syeda; Carole France Jennings; Joseph VanScoy; Mahanazuddin Syed; Melody Greer; Sudeepa Bhattacharyya; Meredith Zozus; Benjamin Tharian; Fred Prior
Journal:  Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap       Date:  2022-02

4.  Health Communication through Positive and Solidarity Messages Amid the COVID-19 Pandemic: Automated Content Analysis of Facebook Uses.

Authors:  Angela Chang; Xuechang Xian; Matthew Tingchi Liu; Xinshu Zhao
Journal:  Int J Environ Res Public Health       Date:  2022-05-19       Impact factor: 4.614

5.  Multi-Level Representation Learning for Chinese Medical Entity Recognition: Model Development and Validation.

Authors:  Zhichang Zhang; Lin Zhu; Peilin Yu
Journal:  JMIR Med Inform       Date:  2020-05-04

6.  Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study.

Authors:  Jordan Jouffroy; Sarah F Feldman; Ivan Lerner; Bastien Rance; Anita Burgun; Antoine Neuraz
Journal:  JMIR Med Inform       Date:  2021-03-16

7.  Comparing general and specialized word embeddings for biomedical named entity recognition.

Authors:  Rigo E Ramos-Vargas; Israel Román-Godínez; Sulema Torres-Ramos
Journal:  PeerJ Comput Sci       Date:  2021-02-18

8.  Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.

Authors:  An Fang; Jiahui Hu; Wanqing Zhao; Ming Feng; Ji Fu; Shanshan Feng; Pei Lou; Huiling Ren; Xianlai Chen
Journal:  BMC Med Inform Decis Mak       Date:  2022-03-23       Impact factor: 2.796

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.