Literature DB >> 34920699

Improving deep learning method for biomedical named entity recognition by using entity definition information.

Ying Xiong1,2, Shuai Chen1, Buzhou Tang3,4, Qingcai Chen1,2, Xiaolong Wang1, Jun Yan5, Yi Zhou6.   

Abstract

BACKGROUND: Biomedical named entity recognition (NER) is a fundamental task of biomedical text mining that finds the boundaries of entity mentions in biomedical text and determines their entity type. To accelerate the development of biomedical NER techniques in Spanish, the PharmaCoNER organizers launched a competition to recognize pharmacological substances, compounds, and proteins. Biomedical NER is usually recognized as a sequence labeling task, and almost all state-of-the-art sequence labeling methods ignore the meaning of different entity types. In this paper, we investigate some methods to introduce the meaning of entity types in deep learning methods for biomedical NER and apply them to the PharmaCoNER 2019 challenge. The meaning of each entity type is represented by its definition information. MATERIAL AND
METHOD: We investigate how to use entity definition information in the following two methods: (1) SQuad-style machine reading comprehension (MRC) methods that treat entity definition information as query and biomedical text as context and predict answer spans as entities. (2) Span-level one-pass (SOne) methods that predict entity spans of one type by one type and introduce entity type meaning, which is represented by entity definition information. All models are trained and tested on the PharmaCoNER 2019 corpus, and their performance is evaluated by strict micro-average precision, recall, and F1-score.
RESULTS: Entity definition information brings improvements to both SQuad-style MRC and SOne methods by about 0.003 in micro-averaged F1-score. The SQuad-style MRC model using entity definition information as query achieves the best performance with a micro-averaged precision of 0.9225, a recall of 0.9050, and an F1-score of 0.9137, respectively. It outperforms the best model of the PharmaCoNER 2019 challenge by 0.0032 in F1-score. Compared with the state-of-the-art model without using manually-crafted features, our model obtains a 1% improvement in F1-score, which is significant. These results indicate that entity definition information is useful for deep learning methods on biomedical NER.
CONCLUSION: Our entity definition information enhanced models achieve the state-of-the-art micro-average F1 score of 0.9137, which implies that entity definition information has a positive impact on biomedical NER detection. In the future, we will explore more entity definition information from knowledge graph.
© 2021. The Author(s).

Entities:  

Keywords:  Biomedical named entity recognition; Entity definition information; Machine reading comprehension; Span-level one-pass method

Mesh:

Year:  2021        PMID: 34920699      PMCID: PMC8680061          DOI: 10.1186/s12859-021-04236-y

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  16 in total

1.  The KnowledgeMap project: development of a concept-based medical school curriculum database.

Authors:  Joshua C Denny; Plomarz R Irani; Firas H Wehbe; Jeffrey D Smithers; Anderson Spickard
Journal:  AMIA Annu Symp Proc       Date:  2003

2.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

3.  Cross-type biomedical named entity recognition with deep multi-task learning.

Authors:  Xuan Wang; Yu Zhang; Xiang Ren; Yuhao Zhang; Marinka Zitnik; Jingbo Shang; Curtis Langlotz; Jiawei Han
Journal:  Bioinformatics       Date:  2019-05-15       Impact factor: 6.937

4.  Automatic de-identification of electronic medical records using token-level and character-level conditional random fields.

Authors:  Zengjian Liu; Yangxin Chen; Buzhou Tang; Xiaolong Wang; Qingcai Chen; Haodi Li; Jingfeng Wang; Qiwen Deng; Suisong Zhu
Journal:  J Biomed Inform       Date:  2015-06-26       Impact factor: 6.317

5.  Overview of BioCreAtIvE: critical assessment of information extraction for biology.

Authors:  Lynette Hirschman; Alexander Yeh; Christian Blaschke; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

6.  Evaluating word representation features in biomedical named entity recognition tasks.

Authors:  Buzhou Tang; Hongxin Cao; Xiaolong Wang; Qingcai Chen; Hua Xu
Journal:  Biomed Res Int       Date:  2014-03-06       Impact factor: 3.411

7.  Long short-term memory RNN for biomedical named entity recognition.

Authors:  Chen Lyu; Bo Chen; Yafeng Ren; Donghong Ji
Journal:  BMC Bioinformatics       Date:  2017-10-30       Impact factor: 3.169

8.  A neural network multi-task learning approach to biomedical named entity recognition.

Authors:  Gamal Crichton; Sampo Pyysalo; Billy Chiu; Anna Korhonen
Journal:  BMC Bioinformatics       Date:  2017-08-15       Impact factor: 3.169

9.  CNN-based ranking for biomedical entity normalization.

Authors:  Haodi Li; Qingcai Chen; Buzhou Tang; Xiaolong Wang; Hua Xu; Baohua Wang; Dong Huang
Journal:  BMC Bioinformatics       Date:  2017-10-03       Impact factor: 3.169

10.  Transfer learning for biomedical named entity recognition with neural networks.

Authors:  John M Giorgi; Gary D Bader
Journal:  Bioinformatics       Date:  2018-12-01       Impact factor: 6.937

View more
  1 in total

1.  Probiotics Inhibit Cartilage Damage and Progression of Osteoarthritis in Mice.

Authors:  Antonia Sophocleous; Asim Azfer; Carmen Huesa; Eleni Stylianou; Stuart H Ralston
Journal:  Calcif Tissue Int       Date:  2022-10-19       Impact factor: 4.000

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.