Literature DB >> 23934949

Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries.

Yan Xu1, Yining Wang, Tianren Liu, Jiahua Liu, Yubo Fan, Yi Qian, Junichi Tsujii, Eric I Chang.   

Abstract

OBJECTIVE: In this paper, we focus on three aspects: (1) to annotate a set of standard corpus in Chinese discharge summaries; (2) to perform word segmentation and named entity recognition in the above corpus; (3) to build a joint model that performs word segmentation and named entity recognition.
DESIGN: Two independent systems of word segmentation and named entity recognition were built based on conditional random field models. In the field of natural language processing, while most approaches use a single model to predict outputs, many works have proved that performance of many tasks can be improved by exploiting combined techniques. Therefore, in this paper, we proposed a joint model using dual decomposition to perform both the two tasks in order to exploit correlations between the two tasks. Three sets of features were designed to demonstrate the advantage of the joint model we proposed, compared with independent models, incremental models and a joint model trained on combined labels. MEASUREMENTS: Micro-averaged precision (P), recall (R), and F-measure (F) were used to evaluate results.
RESULTS: The gold standard corpus is created using 336 Chinese discharge summaries of 71 355 words. The framework using dual decomposition achieved 0.2% improvement for segmentation and 1% improvement for recognition, compared with each of the two tasks alone.
CONCLUSIONS: The joint model is efficient and effective in both segmentation and recognition compared with the two individual tasks. The model achieved encouraging results, demonstrating the feasibility of the two tasks.

Keywords:  Chinese Discharge Summary; Conditional Random Fields; Dual Decomposition; Named entity; Segmentation

Mesh:

Year:  2013        PMID: 23934949      PMCID: PMC3957392          DOI: 10.1136/amiajnl-2013-001806

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  11 in total

Review 1.  Evaluating the state of the art in coreference resolution for electronic medical records.

Authors:  Ozlem Uzuner; Andreea Bodnari; Shuying Shen; Tyler Forbush; John Pestian; Brett R South
Journal:  J Am Med Inform Assoc       Date:  2012-02-24       Impact factor: 4.497

2.  Named entity recognition of follow-up and time information in 20,000 radiology reports.

Authors:  Yan Xu; Junichi Tsujii; Eric I-Chao Chang
Journal:  J Am Med Inform Assoc       Date:  2012-07-06       Impact factor: 4.497

3.  Extracting medication information from clinical text.

Authors:  Ozlem Uzuner; Imre Solti; Eithon Cadag
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

4.  Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries.

Authors:  Yan Xu; Kai Hong; Junichi Tsujii; Eric I-Chao Chang
Journal:  J Am Med Inform Assoc       Date:  2012-05-14       Impact factor: 4.497

5.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

6.  Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.

Authors:  Wendy W Chapman; Prakash M Nadkarni; Lynette Hirschman; Leonard W D'Avolio; Guergana K Savova; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

7.  A context-blocks model for identifying clinical relationships in patient records.

Authors:  Rezarta Islamaj Doğan; Aurélie Névéol; Zhiyong Lu
Journal:  BMC Bioinformatics       Date:  2011-06-09       Impact factor: 3.169

8.  Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

Authors:  Hua Xu; Marianthi Markatou; Rositsa Dimova; Hongfang Liu; Carol Friedman
Journal:  BMC Bioinformatics       Date:  2006-07-05       Impact factor: 3.169

9.  Building large collections of Chinese and English medical terms from semi-structured and encyclopedia websites.

Authors:  Yan Xu; Yining Wang; Jian-Tao Sun; Jianwen Zhang; Junichi Tsujii; Eric Chang
Journal:  PLoS One       Date:  2013-07-09       Impact factor: 3.240

10.  Matching health information seekers' queries to medical terms.

Authors:  Lina F Soualmia; Elise Prieur-Gaston; Zied Moalla; Thierry Lecroq; Stéfan J Darmoni
Journal:  BMC Bioinformatics       Date:  2012-09-07       Impact factor: 3.169

View more
  17 in total

1.  A comprehensive study of named entity recognition in Chinese clinical text.

Authors:  Jianbo Lei; Buzhou Tang; Xueqin Lu; Kaihua Gao; Min Jiang; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2013-12-17       Impact factor: 4.497

2.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

3.  Speculation detection for Chinese clinical notes: Impacts of word segmentation and embedding models.

Authors:  Shaodian Zhang; Tian Kang; Xingting Zhang; Dong Wen; Noémie Elhadad; Jianbo Lei
Journal:  J Biomed Inform       Date:  2016-02-26       Impact factor: 6.317

4.  A cascaded approach for Chinese clinical text de-identification with less annotation effort.

Authors:  Zhe Jian; Xusheng Guo; Shijian Liu; Handong Ma; Shaodian Zhang; Rui Zhang; Jianbo Lei
Journal:  J Biomed Inform       Date:  2017-07-26       Impact factor: 6.317

5.  Automatic approach for constructing a knowledge graph of knee osteoarthritis in Chinese.

Authors:  Xin Li; Haoyang Liu; Xu Zhao; Guigang Zhang; Chunxiao Xing
Journal:  Health Inf Sci Syst       Date:  2020-02-27

6.  Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.

Authors:  Yonghui Wu; Min Jiang; Jianbo Lei; Hua Xu
Journal:  Stud Health Technol Inform       Date:  2015

7.  Bilingual term alignment from comparable corpora in English discharge summary and Chinese discharge summary.

Authors:  Yan Xu; Luoxin Chen; Junsheng Wei; Sophia Ananiadou; Yubo Fan; Yi Qian; Eric I-Chao Chang; Junichi Tsujii
Journal:  BMC Bioinformatics       Date:  2015-05-09       Impact factor: 3.169

8.  A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text.

Authors:  Jun Liang; Xuemei Xian; Xiaojun He; Meifang Xu; Sheng Dai; Jun'yi Xin; Jie Xu; Jian Yu; Jianbo Lei
Journal:  J Healthc Eng       Date:  2017-07-05       Impact factor: 2.682

9.  Development and validation of method for defining conditions using Chinese electronic medical record.

Authors:  Yuan Xu; Ning Li; Mingshan Lu; Robert P Myers; Elijah Dixon; Robin Walker; Libo Sun; Xiaofei Zhao; Hude Quan
Journal:  BMC Med Inform Decis Mak       Date:  2016-08-20       Impact factor: 2.796

Review 10.  Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors:  Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2018-03-30
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.