Literature DB >> 28254090

Detecting negation and scope in Chinese clinical notes using character and word embedding.

Tian Kang1, Shaodian Zhang1, Nanfang Xu2, Dong Wen3, Xingting Zhang3, Jianbo Lei4.   

Abstract

BACKGROUND AND OBJECTIVES: Researchers have developed effective methods to index free-text clinical notes into structured database, in which negation detection is a critical but challenging step. In Chinese clinical records, negation detection is particularly challenging because it may depend on upstream Chinese information processing components such as word segmentation [1]. Traditionally, negation detection was carried out mostly using rule-based methods, whose comprehensiveness and portability were usually limited. Our objectives in this paper are to: 1) Construct a large Chinese clinical notes corpus with negation annotated; 2) develop a negation detection tool for Chinese clinical notes; 3) evaluate the performance of character and word embedding features in Chinese clinical natural language processing.
METHODS: In this paper, we construct a Chinese clinical corpus consisting of admission and discharge summaries, and propose sequence labeling based systems for negation and scope detection. Our systems rely on features from bag of characters, bag of words, character embedding and word embedding. For scopes, we introduce an additional feature to handle nested scopes with multiple negations.
RESULTS: The two annotators reached an agreement of 0.79 measured by Kappa in manual annotation. In cue detection, our systems are able to achieve a performance as high as 99.0% measured by F score, which significantly outperform its rule-based counterpart (79% F). The best system uses word embedding as features, which yields precision of 99.0% and recall of 99.1%. In scope detection, our system is able to achieve a performance of 94.6% measured by F score.
CONCLUSIONS: Our study provides a state-of-the-art negation-detecting tool for Chinese clinical free-text notes; Experimental results demonstrate that word embedding is effective in identifying negations, and that nested scopes can be identified effectively by our method.
Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

Keywords:  Chinese natural language processing; Clinical natural language processing; Clinical notes; Negation detection; Word embedding

Mesh:

Year:  2016        PMID: 28254090     DOI: 10.1016/j.cmpb.2016.11.009

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  3 in total

1.  A cascaded approach for Chinese clinical text de-identification with less annotation effort.

Authors:  Zhe Jian; Xusheng Guo; Shijian Liu; Handong Ma; Shaodian Zhang; Rui Zhang; Jianbo Lei
Journal:  J Biomed Inform       Date:  2017-07-26       Impact factor: 6.317

2.  A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text.

Authors:  Jun Liang; Xuemei Xian; Xiaojun He; Meifang Xu; Sheng Dai; Jun'yi Xin; Jie Xu; Jian Yu; Jianbo Lei
Journal:  J Healthc Eng       Date:  2017-07-05       Impact factor: 2.682

3.  MLEE: A method for extracting object-level medical knowledge graph entities from Chinese clinical records.

Authors:  Genghong Zhao; Wenjian Gu; Wei Cai; Zhiying Zhao; Xia Zhang; Jiren Liu
Journal:  Front Genet       Date:  2022-07-22       Impact factor: 4.772

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.