Literature DB >> 30822516

Exploring semi-supervised variational autoencoders for biomedical relation extraction.

Yijia Zhang1, Zhiyong Lu2.   

Abstract

The biomedical literature provides a rich source of knowledge such as protein-protein interactions (PPIs), drug-drug interactions (DDIs) and chemical-protein interactions (CPIs). Biomedical relation extraction aims to automatically extract biomedical relations from biomedical text for various biomedical research. State-of-the-art methods for biomedical relation extraction are primarily based on supervised machine learning and therefore depend on (sufficient) labeled data. However, creating large sets of training data is prohibitively expensive and labor-intensive, especially so in biomedicine as domain knowledge is required. In contrast, there is a large amount of unlabeled biomedical text available in PubMed. Hence, computational methods capable of employing unlabeled data to reduce the burden of manual annotation are of particular interest in biomedical relation extraction. We present a novel semi-supervised approach based on variational autoencoder (VAE) for biomedical relation extraction. Our model consists of the following three parts, a classifier, an encoder and a decoder. The classifier is implemented using multi-layer convolutional neural networks (CNNs), and the encoder and decoder are implemented using both bidirectional long short-term memory networks (Bi-LSTMs) and CNNs, respectively. The semi-supervised mechanism allows our model to learn features from both the labeled and unlabeled data. We evaluate our method on multiple public PPI, DDI and CPI corpora. Experimental results show that our method effectively exploits the unlabeled data to improve the performance and reduce the dependence on labeled data. To our best knowledge, this is the first semi-supervised VAE-based method for (biomedical) relation extraction. Our results suggest that exploiting such unlabeled data can be greatly beneficial to improved performance in various biomedical relation extraction, especially when only limited labeled data (e.g. 2000 samples or less) is available in such tasks. Published by Elsevier Inc.

Entities:  

Keywords:  Biomedical literature; Relation extraction; Semi-supervised learning; Variational autoencoder

Mesh:

Year:  2019        PMID: 30822516      PMCID: PMC6708455          DOI: 10.1016/j.ymeth.2019.02.021

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  21 in total

1.  The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions.

Authors:  María Herrero-Zazo; Isabel Segura-Bedmar; Paloma Martínez; Thierry Declerck
Journal:  J Biomed Inform       Date:  2013-07-29       Impact factor: 6.317

2.  Rapidly Mixing Gibbs Sampling for a Class of Factor Graphs Using Hierarchy Width.

Authors:  Christopher De Sa; Ce Zhang; Kunle Olukotun; Christopher Ré
Journal:  Adv Neural Inf Process Syst       Date:  2015-12

3.  Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.

Authors:  Wendy W Chapman; Prakash M Nadkarni; Lynette Hirschman; Leonard W D'Avolio; Guergana K Savova; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

4.  Walk-weighted subsequence kernels for protein-protein interaction extraction.

Authors:  Seonho Kim; Juntae Yoon; Jihoon Yang; Seog Park
Journal:  BMC Bioinformatics       Date:  2010-02-25       Impact factor: 3.169

5.  A hybrid model based on neural networks for biomedical relation extraction.

Authors:  Yijia Zhang; Hongfei Lin; Zhihao Yang; Jian Wang; Shaowu Zhang; Yuanyuan Sun; Liang Yang
Journal:  J Biomed Inform       Date:  2018-03-27       Impact factor: 6.317

6.  Comparative analysis of five protein-protein interaction corpora.

Authors:  Sampo Pyysalo; Antti Airola; Juho Heimonen; Jari Björne; Filip Ginter; Tapio Salakoski
Journal:  BMC Bioinformatics       Date:  2008-04-11       Impact factor: 3.169

7.  Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

Authors:  Ayush Singhal; Michael Simmons; Zhiyong Lu
Journal:  PLoS Comput Biol       Date:  2016-11-30       Impact factor: 4.475

8.  Extracting chemical-protein relations with ensembles of SVM and deep learning models.

Authors:  Yifan Peng; Anthony Rios; Ramakanth Kavuluru; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

9.  A single kernel-based approach to extract drug-drug interactions from biomedical literature.

Authors:  Yijia Zhang; Hongfei Lin; Zhihao Yang; Jian Wang; Yanpeng Li
Journal:  PLoS One       Date:  2012-11-01       Impact factor: 3.240

10.  All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning.

Authors:  Antti Airola; Sampo Pyysalo; Jari Björne; Tapio Pahikkala; Filip Ginter; Tapio Salakoski
Journal:  BMC Bioinformatics       Date:  2008-11-19       Impact factor: 3.169

View more
  6 in total

1.  A hybrid method based on semi-supervised learning for relation extraction in Chinese EMRs.

Authors:  Chunming Yang; Dan Xiao; Yuanyuan Luo; Bo Li; Xujian Zhao; Hui Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2022-06-27       Impact factor: 3.298

Review 2.  On the road to explainable AI in drug-drug interactions prediction: A systematic review.

Authors:  Thanh Hoa Vo; Ngan Thi Kim Nguyen; Quang Hien Kha; Nguyen Quoc Khanh Le
Journal:  Comput Struct Biotechnol J       Date:  2022-04-19       Impact factor: 6.155

3.  Extraction of chemical-protein interactions from the literature using neural networks and narrow instance representation.

Authors:  Rui Antunes; Sérgio Matos
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

Review 4.  Constructing knowledge graphs and their biomedical applications.

Authors:  David N Nicholson; Casey S Greene
Journal:  Comput Struct Biotechnol J       Date:  2020-06-02       Impact factor: 7.271

5.  COVID-19 Surveiller: toward a robust and effective pandemic surveillance system basedon social media mining.

Authors:  Jyun-Yu Jiang; Yichao Zhou; Xiusi Chen; Yan-Ru Jhou; Liqi Zhao; Sabrina Liu; Po-Chun Yang; Jule Ahmar; Wei Wang
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2021-11-22       Impact factor: 4.226

Review 6.  Computational systems biology in disease modeling and control, review and perspectives.

Authors:  Rongting Yue; Abhishek Dutta
Journal:  NPJ Syst Biol Appl       Date:  2022-10-03
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.