Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Leveraging Multi-source knowledge for Chinese clinical named entity recognition via relational graph convolutional network.

Literature DB >> 35217186

Leveraging Multi-source knowledge for Chinese clinical named entity recognition via relational graph convolutional network.

Ying Xiong¹, Hao Peng², Yang Xiang³, Ka-Chun Wong⁴, Qingcai Chen¹, Jun Yan⁵, Buzhou Tang⁶.

Abstract

OBJECTIVE: External knowledge, such as lexicon of words in Chinese and domain knowledge graph (KG) of concepts, has been recently adopted to improve the performance of machine learning methods for named entity recognition (NER) as it can provide additional information beyond context. However, most existing studies only consider knowledge from one source (i.e., either lexicon or knowledge graph) in different ways and consider lexicon words or KG concepts independently with their boundaries. In this paper, we focus on leveraging multi-source knowledge in a unified manner where lexicon words or KG concepts are well combined with their boundaries for Chinese Clinical NER (CNER).
MATERIAL AND METHODS: We propose a novel method based on relational graph convolutional network (RGCN), called MKRGCN, to utilize multi-source knowledge in a unified manner for CNER. For any sentence, a relational graph based on words or concepts in each knowledge source is constructed, where lexicon words or KG concepts appearing in the sentence are linked to the containing tokens with the boundary information of the lexicon words or KG concepts. RGCN is used to model all relational graphs constructed from multi-source knowledge, and the representations of tokens from multi-source knowledge are integrated into the context representations of tokens via an attention mechanism. Based on the knowledge-enhanced representations of tokens, we deploy a conditional random field (CRF) layer for named entity label prediction. In this study, a lexicon of words and a medical knowledge graph are used as knowledge sources for Chinese CNER.
RESULTS: Our proposed method achieves the best performance on CCKS2017 and CCKS2018 in Chinese with F1-scores of 91.88% and 89.91%, respectively, significantly outperforming existing methods. The extended experiments on NCBI-Disease and BC2GM in English also prove the effectiveness of our method when only considering one knowledge source via RGCN.
CONCLUSION: The MKRGCN model can integrate knowledge from the external lexicon and knowledge graph effectively for Chinese CNER and has the potential to be applied to English NER.

Entities: Chemical

Keywords: Clinical named entity recognition; Graph neural network; Multi-source knowledge

Mesh：

Year: 2022 PMID： 35217186 DOI： 10.1016/j.jbi.2022.104035

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 8.000

Keyword Cloud
Cited

1 in total

1. A Multigranularity Text Driven Named Entity Recognition CGAN Model for Traditional Chinese Medicine Literatures.

Authors: Yuekun Ma; Yun Liu; Dezheng Zhang; Jiye Zhang; He Liu; Yonghong Xie
Journal: Comput Intell Neurosci Date: 2022-09-24

1 in total