Literature DB >> 34152393

Node Similarity Based Graph Convolution for Link Prediction in Biological Networks.

Mustafa Coşkun1,2, Mehmet Koyutürk3,4.   

Abstract

BACKGROUND: Link prediction is an important and well-studied problem in network biology. Recently, graph representation learning methods, including Graph Convolutional Network (GCN)-based node embedding have drawn increasing attention in link prediction.
MOTIVATION: An important component of GCN-based network embedding is the convolution matrix, which is used to propagate features across the network. Existing algorithms use the degree-normalized adjacency matrix for this purpose, as this matrix is closely related to the graph Laplacian, capturing the spectral properties of the network. In parallel, it has been shown that GCNs with a single layer can generate more robust embeddings by reducing the number of parameters. Laplacian-based convolution is not well suited to single layered GCNs, as it limits the propagation of information to immediate neighbors of a node.
RESULTS: Capitalizing on the rich literature on unsupervised link prediction, we propose using node similarity based convolution matrices in GCNs to compute node embeddings for link prediction. We consider eight representative node similarity measures (Common Neighbors, Jaccard Index, Adamic-Adar, Resource Allocation, Hub Depressed Index, Hub Promoted Index, Sorenson Index, Salton Index) for this purpose. We systematically compare the performance of the resulting algorithms against GCNs that use the degree-normalized adjacency matrix for convolution, as well as other link prediction algorithms. In our experiments, we use three link prediction tasks involving biomedical networks: drug-disease association (DDA) prediction, drug-drug interaction (DDI) prediction, protein-protein interaction (PPI) prediction. Our results show that node similarity-based convolution matrices significantly improve the link prediction performance of GCN-based embeddings.
CONCLUSION: As sophisticated machine learning frameworks are increasingly employed in biological applications, historically well-established methods can be useful in making a head-start. AVAILABILITY: Our method, SiGraC, is implemented as a Python library and is freely available at https://github.com/mustafaCoskunAgu/SiGraC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 34152393      PMCID: PMC8652026          DOI: 10.1093/bioinformatics/btab464

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


  22 in total

1.  Predicting protein-protein interactions from protein sequences by a stacked sparse autoencoder deep neural network.

Authors:  Yan-Bin Wang; Zhu-Hong You; Xiao Li; Tong-Hai Jiang; Xing Chen; Xi Zhou; Lei Wang
Journal:  Mol Biosyst       Date:  2017-06-27

2.  LRSSL: predict and interpret drug-disease associations based on data integration using sparse subspace learning.

Authors:  Xujun Liang; Pengfei Zhang; Lu Yan; Ying Fu; Fang Peng; Lingzhi Qu; Meiying Shao; Yongheng Chen; Zhuchu Chen
Journal:  Bioinformatics       Date:  2017-04-15       Impact factor: 6.937

3.  Vavien: an algorithm for prioritizing candidate disease genes based on topological similarity of proteins in interaction networks.

Authors:  Sinan Erten; Gurkan Bebek; Mehmet Koyutürk
Journal:  J Comput Biol       Date:  2011-10-28       Impact factor: 1.479

4.  Compact Integration of Multi-Network Topology for Functional Analysis of Genes.

Authors:  Hyunghoon Cho; Bonnie Berger; Jian Peng
Journal:  Cell Syst       Date:  2016-11-23       Impact factor: 10.304

5.  PREDICT: a method for inferring novel drug indications with application to personalized medicine.

Authors:  Assaf Gottlieb; Gideon Y Stein; Eytan Ruppin; Roded Sharan
Journal:  Mol Syst Biol       Date:  2011-06-07       Impact factor: 11.429

6.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

7.  Drug Response Prediction as a Link Prediction Problem.

Authors:  Zachary Stanfield; Mustafa Coşkun; Mehmet Koyutürk
Journal:  Sci Rep       Date:  2017-01-09       Impact factor: 4.379

8.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration.

Authors:  Andrea Franceschini; Damian Szklarczyk; Sune Frankild; Michael Kuhn; Milan Simonovic; Alexander Roth; Jianyi Lin; Pablo Minguez; Peer Bork; Christian von Mering; Lars J Jensen
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

9.  The Comparative Toxicogenomics Database: update 2019.

Authors:  Allan Peter Davis; Cynthia J Grondin; Robin J Johnson; Daniela Sciaky; Roy McMorran; Jolene Wiegers; Thomas C Wiegers; Carolyn J Mattingly
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Graph embedding on biomedical networks: methods, applications and evaluations.

Authors:  Xiang Yue; Zhen Wang; Jingong Huang; Srinivasan Parthasarathy; Soheil Moosavinasab; Yungui Huang; Simon M Lin; Wen Zhang; Ping Zhang; Huan Sun
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

View more
  2 in total

1.  Topsy-Turvy: integrating a global view into sequence-based PPI prediction.

Authors:  Rohit Singh; Kapil Devkota; Samuel Sledzieski; Bonnie Berger; Lenore Cowen
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

2.  Causal reasoning over knowledge graphs leveraging drug-perturbed and disease-specific transcriptomic signatures for drug discovery.

Authors:  Daniel Domingo-Fernández; Yojana Gadiya; Abhishek Patel; Sarah Mubeen; Daniel Rivas-Barragan; Chris W Diana; Biswapriya B Misra; David Healey; Joe Rokicki; Viswa Colluru
Journal:  PLoS Comput Biol       Date:  2022-02-25       Impact factor: 4.475

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.