Literature DB >> 35430042

Chinese clinical named entity recognition via multi-head self-attention based BiLSTM-CRF.

Ying An1, Xianyun Xia2, Xianlai Chen3, Fang-Xiang Wu4, Jianxin Wang5.   

Abstract

Clinical named entity recognition (CNER) is a fundamental step for many clinical Natural Language Processing (NLP) systems, which aims to recognize and classify clinical entities such as diseases, symptoms, exams, body parts and treatments in clinical free texts. In recent years, with the development of deep learning technology, deep neural networks (DNNs) have been widely used in Chinese clinical named entity recognition and many other clinical NLP tasks. However, these state-of-the-art models failed to make full use of the global information and multi-level semantic features in clinical texts. We design an improved character-level representation approach which integrates the character embedding and the character-label embedding to enhance the specificity and diversity of feature representations. Then, a multi-head self-attention based Bi-directional Long Short-Term Memory Conditional Random Field (MUSA-BiLSTM-CRF) model is proposed. By introducing the multi-head self-attention and combining a medical dictionary, the model can more effectively capture the weight relationships between characters and multi-level semantic feature information, which is expected to greatly improve the performance of Chinese clinical named entity recognition. We evaluate our model on two CCKS challenge (CCKS2017 Task 2 and CCKS2018 Task 1) benchmark datasets and the experimental results show that our proposed model achieves the best performance competing with the state-of-the-art DNN based methods.
Copyright © 2022 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Bidirection long-short term memory; Chinese clinical named entity recognition; Conditional random field; Multi-head attention; Self-attention mechanism

Mesh:

Year:  2022        PMID: 35430042     DOI: 10.1016/j.artmed.2022.102282

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  2 in total

1.  A conditional random field based feature learning framework for battery capacity prediction.

Authors:  Hai-Kun Wang; Yang Zhang; Mohong Huang
Journal:  Sci Rep       Date:  2022-08-02       Impact factor: 4.996

2.  A Multigranularity Text Driven Named Entity Recognition CGAN Model for Traditional Chinese Medicine Literatures.

Authors:  Yuekun Ma; Yun Liu; Dezheng Zhang; Jiye Zhang; He Liu; Yonghong Xie
Journal:  Comput Intell Neurosci       Date:  2022-09-24
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.