Literature DB >> 33719344

An Efficient Computational Model for Large-Scale Prediction of Protein-Protein Interactions Based on Accurate and Scalable Graph Embedding.

Xiao-Rui Su1,2,3, Zhu-Hong You1,2,3, Lun Hu1,2,3, Yu-An Huang1, Yi Wang1,2,3, Hai-Cheng Yi1,2,3.   

Abstract

Protein-protein interaction (PPI) is the basis of the whole molecular mechanisms of living cells. Although traditional experiments are able to detect PPIs accurately, they often encounter high cost and require more time. As a result, computational methods have been used to predict PPIs to avoid these problems. Graph structure, as the important and pervasive data carriers, is considered as the most suitable structure to present biomedical entities and relationships. Although graph embedding is the most popular approach for graph representation learning, it usually suffers from high computational and space cost, especially in large-scale graphs. Therefore, developing a framework, which can accelerate graph embedding and improve the accuracy of embedding results, is important to large-scale PPIs prediction. In this paper, we propose a multi-level model LPPI to improve both the quality and speed of large-scale PPIs prediction. Firstly, protein basic information is collected as its attribute, including positional gene sets, motif gene sets, and immunological signatures. Secondly, we construct a weighted graph by using protein attributes to calculate node similarity. Then GraphZoom is used to accelerate the embedding process by reducing the size of the weighted graph. Next, graph embedding methods are used to learn graph topology features from the reconstructed graph. Finally, the linear Logistic Regression (LR) model is used to predict the probability of interactions of two proteins. LPPI achieved a high accuracy of 0.99997 and 0.9979 on the PPI network dataset and GraphSAGE-PPI dataset, respectively. Our further results show that the LPPI is promising for large-scale PPI prediction in both accuracy and efficiency, which is beneficial to other large-scale biomedical molecules interactions detection.
Copyright © 2021 Su, You, Hu, Huang, Wang and Yi.

Entities:  

Keywords:  GraphZoom; graph embedding; large-scale; protein-protein interaction; weighted graph

Year:  2021        PMID: 33719344      PMCID: PMC7953052          DOI: 10.3389/fgene.2021.635451

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  15 in total

1.  Predicting protein-protein interactions based only on sequences information.

Authors:  Juwen Shen; Jian Zhang; Xiaomin Luo; Weiliang Zhu; Kunqian Yu; Kaixian Chen; Yixue Li; Hualiang Jiang
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-05       Impact factor: 11.205

2.  Functional organization of the yeast proteome by systematic analysis of protein complexes.

Authors:  Anne-Claude Gavin; Markus Bösche; Roland Krause; Paola Grandi; Martina Marzioch; Andreas Bauer; Jörg Schultz; Jens M Rick; Anne-Marie Michon; Cristina-Maria Cruciat; Marita Remor; Christian Höfert; Malgorzata Schelder; Miro Brajenovic; Heinz Ruffner; Alejandro Merino; Karin Klein; Manuela Hudak; David Dickson; Tatjana Rudi; Volker Gnau; Angela Bauch; Sonja Bastuck; Bettina Huhse; Christina Leutwein; Marie-Anne Heurtier; Richard R Copley; Angela Edelmann; Erich Querfurth; Vladimir Rybin; Gerard Drewes; Manfred Raida; Tewis Bouwmeester; Peer Bork; Bertrand Seraphin; Bernhard Kuster; Gitte Neubauer; Giulio Superti-Furga
Journal:  Nature       Date:  2002-01-10       Impact factor: 49.962

3.  PPI-Detect: A support vector machine model for sequence-based prediction of protein-protein interactions.

Authors:  Sandra Romero-Molina; Yasser B Ruiz-Blanco; Mirja Harms; Jan Münch; Elsa Sanchez-Garcia
Journal:  J Comput Chem       Date:  2019-02-15       Impact factor: 3.376

4.  Contextual Correlation Preserving Multiview Featured Graph Clustering.

Authors:  Tiantian He; Yang Liu; Tobey H Ko; Keith C C Chan; Yew-Soon Ong
Journal:  IEEE Trans Cybern       Date:  2019-07-19       Impact factor: 11.448

5.  Predicting protein-protein interactions from protein sequences by a stacked sparse autoencoder deep neural network.

Authors:  Yan-Bin Wang; Zhu-Hong You; Xiao Li; Tong-Hai Jiang; Xing Chen; Xi Zhou; Lei Wang
Journal:  Mol Biosyst       Date:  2017-06-27

6.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

7.  PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.

Authors:  Yanbin Wang; Zhuhong You; Xiao Li; Xing Chen; Tonghai Jiang; Jingting Zhang
Journal:  Int J Mol Sci       Date:  2017-05-11       Impact factor: 5.923

8.  Detection of Interactions between Proteins by Using Legendre Moments Descriptor to Extract Discriminatory Information Embedded in PSSM.

Authors:  Yan-Bin Wang; Zhu-Hong You; Li-Ping Li; Yu-An Huang; Hai-Cheng Yi
Journal:  Molecules       Date:  2017-08-18       Impact factor: 4.411

9.  A domain-based approach to predict protein-protein interactions.

Authors:  Mudita Singhal; Haluk Resat
Journal:  BMC Bioinformatics       Date:  2007-06-13       Impact factor: 3.169

10.  Graph embedding on biomedical networks: methods, applications and evaluations.

Authors:  Xiang Yue; Zhen Wang; Jingong Huang; Srinivasan Parthasarathy; Soheil Moosavinasab; Yungui Huang; Simon M Lin; Wen Zhang; Ping Zhang; Huan Sun
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

View more
  1 in total

1.  Multi-view heterogeneous molecular network representation learning for protein-protein interaction prediction.

Authors:  Xiao-Rui Su; Lun Hu; Zhu-Hong You; Peng-Wei Hu; Bo-Wei Zhao
Journal:  BMC Bioinformatics       Date:  2022-06-16       Impact factor: 3.307

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.