Literature DB >> 29982330

Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.

Masashi Tsubaki1, Kentaro Tomii1,2, Jun Sese1,2.   

Abstract

Motivation: In bioinformatics, machine learning-based methods that predict the compound-protein interactions (CPIs) play an important role in the virtual screening for drug discovery. Recently, end-to-end representation learning for discrete symbolic data (e.g. words in natural language processing) using deep neural networks has demonstrated excellent performance on various difficult problems. For the CPI problem, data are provided as discrete symbolic data, i.e. compounds are represented as graphs where the vertices are atoms, the edges are chemical bonds, and proteins are sequences in which the characters are amino acids. In this study, we investigate the use of end-to-end representation learning for compounds and proteins, integrate the representations, and develop a new CPI prediction approach by combining a graph neural network (GNN) for compounds and a convolutional neural network (CNN) for proteins.
Results: Our experiments using three CPI datasets demonstrated that the proposed end-to-end approach achieves competitive or higher performance as compared to various existing CPI prediction methods. In addition, the proposed approach significantly outperformed existing methods on an unbalanced dataset. This suggests that data-driven representations of compounds and proteins obtained by end-to-end GNNs and CNNs are more robust than traditional chemical and biological features obtained from databases. Although analyzing deep learning models is difficult due to their black-box nature, we address this issue using a neural attention mechanism, which allows us to consider which subsequences in a protein are more important for a drug compound when predicting its interaction. The neural attention mechanism also provides effective visualization, which makes it easier to analyze a model even when modeling is performed using real-valued representations instead of discrete features. Availability and implementation: https://github.com/masashitsubaki. Supplementary information: Supplementary data are available at Bioinformatics online.

Mesh:

Substances:

Year:  2019        PMID: 29982330     DOI: 10.1093/bioinformatics/bty535

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  53 in total

1.  Neural networks for protein structure and function prediction and dynamic analysis.

Authors:  Yuko Tsuchiya; Kentaro Tomii
Journal:  Biophys Rev       Date:  2020-03-12

2.  Overview of the big data bioinformatics symposium (2SCA) at BSJ2019.

Authors:  Tsuyoshi Shirai; Tohru Terada
Journal:  Biophys Rev       Date:  2020-02-14

3.  GCRNN: graph convolutional recurrent neural network for compound-protein interaction prediction.

Authors:  Ermal Elbasani; Soualihou Ngnamsie Njimbouom; Tae-Jin Oh; Eung-Hee Kim; Hyun Lee; Jeong-Dong Kim
Journal:  BMC Bioinformatics       Date:  2022-01-11       Impact factor: 3.169

4.  DeepH-DTA: Deep Learning for Predicting Drug-Target Interactions: A Case Study of COVID-19 Drug Repurposing.

Authors:  Mohamed Abdel-Basset; Hossam Hawash; Mohamed Elhoseny; Ripon K Chakrabortty; Michael Ryan
Journal:  IEEE Access       Date:  2020-09-15       Impact factor: 3.367

5.  AttentionSiteDTI: an interpretable graph-based model for drug-target interaction prediction using NLP sentence-level relation classification.

Authors:  Mehdi Yazdani-Jahromi; Niloofar Yousefi; Aida Tayebi; Elayaraja Kolanthai; Craig J Neal; Sudipta Seal; Ozlem Ozmen Garibay
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

6.  Cross-modality and self-supervised protein embedding for compound-protein affinity and contact prediction.

Authors:  Yuning You; Yang Shen
Journal:  Bioinformatics       Date:  2022-09-16       Impact factor: 6.931

7.  CSConv2d: A 2-D Structural Convolution Neural Network with a Channel and Spatial Attention Mechanism for Protein-Ligand Binding Affinity Prediction.

Authors:  Xun Wang; Dayan Liu; Jinfu Zhu; Alfonso Rodriguez-Paton; Tao Song
Journal:  Biomolecules       Date:  2021-04-27

8.  Graph-based machine learning interprets and predicts diagnostic isomer-selective ion-molecule reactions in tandem mass spectrometry.

Authors:  Jonathan Fine; Judy Kuan-Yu Liu; Armen Beck; Kawthar Z Alzarieni; Xin Ma; Victoria M Boulos; Hilkka I Kenttämaa; Gaurav Chopra
Journal:  Chem Sci       Date:  2020-10-05       Impact factor: 9.825

Review 9.  Representation learning applications in biological sequence analysis.

Authors:  Hitoshi Iuchi; Taro Matsutani; Keisuke Yamada; Natsuki Iwano; Shunsuke Sumi; Shion Hosoda; Shitao Zhao; Tsukasa Fukunaga; Michiaki Hamada
Journal:  Comput Struct Biotechnol J       Date:  2021-05-23       Impact factor: 7.271

10.  Using drug descriptions and molecular structures for drug-drug interaction extraction from literature.

Authors:  Masaki Asada; Makoto Miwa; Yutaka Sasaki
Journal:  Bioinformatics       Date:  2021-07-19       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.