Literature DB >> 28369171

A transition-based joint model for disease named entity recognition and normalization.

Yinxia Lou1,2, Yue Zhang3, Tao Qian4, Fei Li1, Shufeng Xiong5, Donghong Ji1.   

Abstract

MOTIVATION: Disease named entities play a central role in many areas of biomedical research, and automatic recognition and normalization of such entities have received increasing attention in biomedical research communities. Existing methods typically used pipeline models with two independent phases: (i) a disease named entity recognition (DER) system is used to find the boundaries of mentions in text and (ii) a disease named entity normalization (DEN) system is used to connect the mentions recognized to concepts in a controlled vocabulary. The main problems of such models are: (i) there is error propagation from DER to DEN and (ii) DEN is useful for DER, but pipeline models cannot utilize this.
METHODS: We propose a transition-based model to jointly perform disease named entity recognition and normalization, casting the output construction process into an incremental state transition process, learning sequences of transition actions globally, which correspond to joint structural outputs. Beam search and online structured learning are used, with learning being designed to guide search. Compared with the only existing method for joint DEN and DER, our method allows non-local features to be used, which significantly improves the accuracies.
RESULTS: We evaluate our model on two corpora: the BioCreative V Chemical Disease Relation (CDR) corpus and the NCBI disease corpus. Experiments show that our joint framework achieves significantly higher performances compared to competitive pipeline baselines. Our method compares favourably to other state-of-the-art approaches.
AVAILABILITY AND IMPLEMENTATION: Data and code are available at https://github.com/louyinxia/jointRN. CONTACT: dhji@whu.edu.cn.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 28369171     DOI: 10.1093/bioinformatics/btx172

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

1.  Applying citizen science to gene, drug and disease relationship extraction from biomedical abstracts.

Authors:  Ginger Tsueng; Max Nanis; Jennifer T Fouquier; Michael Mayers; Benjamin M Good; Andrew I Su
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

Review 2.  Recent advances in biomedical literature mining.

Authors:  Sendong Zhao; Chang Su; Zhiyong Lu; Fei Wang
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

3.  Parallel sequence tagging for concept recognition.

Authors:  Lenz Furrer; Joseph Cornelius; Fabio Rinaldi
Journal:  BMC Bioinformatics       Date:  2022-03-24       Impact factor: 3.169

4.  Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study.

Authors:  Fei Li; Yonghao Jin; Weisong Liu; Bhanu Pratap Singh Rawat; Pengshan Cai; Hong Yu
Journal:  JMIR Med Inform       Date:  2019-09-12

Review 5.  Medical Information Extraction in the Age of Deep Learning.

Authors:  Udo Hahn; Michel Oleynik
Journal:  Yearb Med Inform       Date:  2020-08-21

6.  Multi-task learning for Chinese clinical named entity recognition with external knowledge.

Authors:  Ming Cheng; Shufeng Xiong; Fei Li; Pan Liang; Jianbo Gao
Journal:  BMC Med Inform Decis Mak       Date:  2021-12-31       Impact factor: 2.796

7.  Natural Language Processing Algorithms for Normalizing Expressions of Synonymous Symptoms in Traditional Chinese Medicine.

Authors:  Lu Zhou; Shuangqiao Liu; Caiyan Li; Yuemeng Sun; Yizhuo Zhang; Yuda Li; Huimin Yuan; Yan Sun; Fengqin Xu; Yuhang Li
Journal:  Evid Based Complement Alternat Med       Date:  2021-10-11       Impact factor: 2.629

8.  A Multitask Deep Learning Framework for DNER.

Authors:  Ran Jin; Tengda Hou; Tongrui Yu; Min Luo; Haoliang Hu
Journal:  Comput Intell Neurosci       Date:  2022-04-16

9.  PASCAL: a pseudo cascade learning framework for breast cancer treatment entity normalization in Chinese clinical text.

Authors:  Yang An; Jianlin Wang; Liang Zhang; Hanyu Zhao; Zhan Gao; Haitao Huang; Zhenguang Du; Zengtao Jiao; Jun Yan; Xiaopeng Wei; Bo Jin
Journal:  BMC Med Inform Decis Mak       Date:  2020-08-28       Impact factor: 2.796

10.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors:  Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.