Literature DB >> 31003175

Document-level attention-based BiLSTM-CRF incorporating disease dictionary for disease named entity recognition.

Kai Xu1, Zhenguo Yang2, Peipei Kang3, Qi Wang4, Wenyin Liu5.   

Abstract

BACKGROUND: Disease named entity recognition (NER) plays an important role in biomedical research. There are a significant number of challenging issues to be addressed; among these, the identification of rare diseases and complex disease names and the problem of tagging inconsistency (i.e., if an entity is tagged differently in a document) are attracting substantial research attention.
METHODS: We propose a new neural network method named Dic-Att-BiLSTM-CRF (DABLC) for disease NER. DABLC applies an efficient exact string matching method to match disease entities with a disease dictionary; here, the dictionary is constructed based on the Disease Ontology. Furthermore, DABLC constructs a dictionary attention layer by incorporating a disease dictionary matching method and document-level attention mechanism. Finally, a bidirectional long short-term memory network and conditional random field (BiLSTM-CRF) with a dictionary attention layer is proposed to combine the disease dictionary to develop disease NER.
RESULTS: Extensive experiments are conducted on two widely-used corpora: the NCBI disease corpus and the BioCreative V CDR corpus. We apply each test on 10 executions of each model, with a 95% confidence interval. DABLC achieves the highest F1 scores (NCBI: Precision = 0.883, Recall = 0.89, F1 = 0.886; BioCreative V CDR: Precision = 0.891, Recall = 0.875, F1 = 0.883), outperforming the state-of-the-art methods.
CONCLUSION: DABLC combines the advantages of both external dictionary resources and deep attention neural networks. This aids the identification of rare diseases and complex disease names; moreover, it reduces the impact of tagging inconsistency. Special disease NER and deep learning models addressing long sentences are noteworthy areas for future examination.
Copyright © 2019 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Biomedical informatics; Machine learning; Named entity recognition; Neural network; String matching

Mesh:

Year:  2019        PMID: 31003175     DOI: 10.1016/j.compbiomed.2019.04.002

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  8 in total

1.  Model-based clinical note entity recognition for rheumatoid arthritis using bidirectional encoder representation from transformers.

Authors:  Meiting Li; Feifei Liu; Jia'an Zhu; Ran Zhang; Yi Qin; Dongping Gao
Journal:  Quant Imaging Med Surg       Date:  2022-01

Review 2.  Medical Information Extraction in the Age of Deep Learning.

Authors:  Udo Hahn; Michel Oleynik
Journal:  Yearb Med Inform       Date:  2020-08-21

3.  Deep learning model for multi-classification of infectious diseases from unstructured electronic medical records.

Authors:  Mengying Wang; Zhenhao Wei; Mo Jia; Lianzhong Chen; Hong Ji
Journal:  BMC Med Inform Decis Mak       Date:  2022-02-16       Impact factor: 2.796

4.  Deep learning-based methods for natural hazard named entity recognition.

Authors:  Junlin Sun; Yanrong Liu; Jing Cui; Handong He
Journal:  Sci Rep       Date:  2022-03-17       Impact factor: 4.996

5.  A machine learning framework for discovery and enrichment of metagenomics metadata from open access publications.

Authors:  Maaly Nassar; Alexander B Rogers; Francesco Talo'; Santiago Sanchez; Zunaira Shafique; Robert D Finn; Johanna McEntyre
Journal:  Gigascience       Date:  2022-08-11       Impact factor: 7.658

Review 6.  Artificial Intelligence (AI) in Rare Diseases: Is the Future Brighter?

Authors:  Sandra Brasil; Carlota Pascoal; Rita Francisco; Vanessa Dos Reis Ferreira; Paula A Videira; And Gonçalo Valadão
Journal:  Genes (Basel)       Date:  2019-11-27       Impact factor: 4.096

7.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors:  Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

8.  AT-NeuroEAE: A Joint Extraction Model of Events With Attributes for Research Sharing-Oriented Neuroimaging Provenance Construction.

Authors:  Shaofu Lin; Zhe Xu; Ying Sheng; Lihong Chen; Jianhui Chen
Journal:  Front Neurosci       Date:  2022-03-07       Impact factor: 4.677

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.