Literature DB >> 33416851

BERT-GT: Cross-sentence n-ary relation extraction with BERT and graph transformer.

Po-Ting Lai1, Zhiyong Lu1.   

Abstract

MOTIVATION: A biomedical relation statement is commonly expressed in multiple sentences and consists of many concepts, including gene, disease, chemical, and mutation. To automatically extract information from biomedical literature, existing biomedical text-mining approaches typically formulate the problem as a cross-sentence n-ary relation-extraction task that detects relations among n entities across multiple sentences, and use either a graph neural network (GNN) with long short-term memory (LSTM) or an attention mechanism. Recently, Transformer has been shown to outperform LSTM on many natural language processing (NLP) tasks.
RESULTS: In this work, we propose a novel architecture that combines Bidirectional Encoder Representations from Transformers with Graph Transformer (BERT-GT), through integrating a neighbor-attention mechanism into the BERT architecture. Unlike the original Transformer architecture, which utilizes the whole sentence(s) to calculate the attention of the current token, the neighbor-attention mechanism in our method calculates its attention utilizing only its neighbor tokens. Thus, each token can pay attention to its neighbor information with little noise. We show that this is critically important when the text is very long, as in cross-sentence or abstract-level relation-extraction tasks. Our benchmarking results show improvements of 5.44% and 3.89% in accuracy and F1-measure over the state-of-the-art on n-ary and chemical-protein relation datasets, suggesting BERT-GT is a robust approach that is applicable to other biomedical relation extraction tasks or datasets.
AVAILABILITY AND IMPLEMENTATION: the source code of BERT-GT will be made freely available at https://github.com/ncbi-nlp/bert_gt upon publication. Published by Oxford University Press 2021.

Year:  2021        PMID: 33416851      PMCID: PMC8023679          DOI: 10.1093/bioinformatics/btaa1087

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  Evaluation of the therapeutic effect of theta burst stimulation on drug-resistant auditory hallucinations in a schizophrenic patient and its impact on cognitive function and neuronal excitability: a case study.

Authors:  Djedia Sidhoumi; Sonia Braha; Noomane Bouaziz; Jérôme Brunelin; René Benadhira; Dominique Januel
Journal:  Clin Neurophysiol       Date:  2010-02-11       Impact factor: 3.708

2.  How user intelligence is improving PubMed.

Authors:  Nicolas Fiorini; Robert Leaman; David J Lipman; Zhiyong Lu
Journal:  Nat Biotechnol       Date:  2018-10-01       Impact factor: 54.908

3.  Recent advances of automated methods for searching and extracting genomic variant information from biomedical literature.

Authors:  Kyubum Lee; Chih-Hsuan Wei; Zhiyong Lu
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

4.  CD-REST: a system for extracting chemical-induced disease relation in literature.

Authors:  Jun Xu; Yonghui Wu; Yaoyun Zhang; Jingqi Wang; Hee-Jin Lee; Hua Xu
Journal:  Database (Oxford)       Date:  2016-03-25       Impact factor: 3.451

5.  BioCreative V CDR task corpus: a resource for chemical disease relation extraction.

Authors:  Jiao Li; Yueping Sun; Robin J Johnson; Daniela Sciaky; Chih-Hsuan Wei; Robert Leaman; Allan Peter Davis; Carolyn J Mattingly; Thomas C Wiegers; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-05-09       Impact factor: 3.451

6.  Exploiting syntactic and semantics information for chemical-disease relation extraction.

Authors:  Huiwei Zhou; Huijie Deng; Long Chen; Yunlong Yang; Chen Jia; Degen Huang
Journal:  Database (Oxford)       Date:  2016-04-14       Impact factor: 3.451

7.  Chemical-induced disease relation extraction via convolutional neural network.

Authors:  Jinghang Gu; Fuqing Sun; Longhua Qian; Guodong Zhou
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

8.  Extracting chemical-protein relations with ensembles of SVM and deep learning models.

Authors:  Yifan Peng; Anthony Rios; Ramakanth Kavuluru; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

9.  Chemical-induced disease extraction via recurrent piecewise convolutional neural networks.

Authors:  Haodi Li; Ming Yang; Qingcai Chen; Buzhou Tang; Xiaolong Wang; Jun Yan
Journal:  BMC Med Inform Decis Mak       Date:  2018-07-23       Impact factor: 2.796

10.  Assessing the state of the art in biomedical relation extraction: overview of the BioCreative V chemical-disease relation (CDR) task.

Authors:  Chih-Hsuan Wei; Yifan Peng; Robert Leaman; Allan Peter Davis; Carolyn J Mattingly; Jiao Li; Thomas C Wiegers; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-03-19       Impact factor: 3.451

View more
  2 in total

1.  A sequence labeling framework for extracting drug-protein relations from biomedical literature.

Authors:  Ling Luo; Po-Ting Lai; Chih-Hsuan Wei; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2022-07-19       Impact factor: 4.462

2.  GeMI: interactive interface for transformer-based Genomic Metadata Integration.

Authors:  Giuseppe Serna Garcia; Michele Leone; Anna Bernasconi; Mark J Carman
Journal:  Database (Oxford)       Date:  2022-06-03       Impact factor: 4.462

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.