Literature DB >> 32726411

Large-scale entity representation learning for biomedical relationship extraction.

Mario Sänger1, Ulf Leser1.   

Abstract

MOTIVATION: The automatic extraction of published relationships between molecular entities has important applications in many biomedical fields, ranging from Systems Biology to Personalized Medicine. Existing works focused on extracting relationships described in single articles or in single sentences. However, a single record is rarely sufficient to judge upon the biological correctness of a relation, as experimental evidence might be weak or only valid in a certain context. Furthermore, statements may be more speculative than confirmative, and different articles often contradict each other. Experts therefore always take the complete literature into account to take a reliable decision upon a relationship. It is an open research question how to do this effectively in an automatic manner.
RESULTS: We propose two novel relation extraction approaches which use recent representation learning techniques to create comprehensive models of biomedical entities or entity-pairs, respectively. These representations are learned by considering all publications from PubMed mentioning an entity or a pair. They are used as input for a neural network for classifying relations globally, i.e. the derived predictions are corpus-based, not sentence- or article based as in prior art. Experiments on the extraction of mutation-disease, drug-disease and drug-drug relationships show that the learned embeddings indeed capture semantic information of the entities under study and outperform traditional methods by 4-29% regarding F1 score.
AVAILABILITY AND IMPLEMENTATION: Source codes are available at: https://github.com/mariosaenger/bio-re-with-entity-embeddings. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Year:  2021        PMID: 32726411     DOI: 10.1093/bioinformatics/btaa674

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  Hierarchical network analysis of co-occurring bioentities in literature.

Authors:  Heejung Yang; Namgil Lee; Beomjun Park; Jinyoung Park; Jiho Lee; Hyeon Seok Jang; Hojin Yoo
Journal:  Sci Rep       Date:  2022-05-12       Impact factor: 4.996

2.  YTLR: Extracting yeast transcription factor-gene associations from the literature using automated literature readers.

Authors:  Tzu-Hsien Yang; Chung-Yu Wang; Hsiu-Chun Tsai; Ya-Chiao Yang; Cheng-Tse Liu
Journal:  Comput Struct Biotechnol J       Date:  2022-08-24       Impact factor: 6.155

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.