Literature DB >> 33560405

A novel method for data fusion over Entity-Relation graphs and its application to protein-protein interaction prediction.

Daniele Raimondi1, Jaak Simm1, Adam Arany1, Yves Moreau1.   

Abstract

MOTIVATION: Modern Bioinformatics is facing increasingly complex problems to solve, and we are indeed rapidly approaching an era in which the ability to seamlessly integrate heterogeneous sources of information will be crucial for the scientific progress. Here we present a novel non-linear data fusion framework that generalizes the conventional Matrix Factorization paradigm allowing inference over arbitrary Entity-Relation graphs, and we applied it to the prediction of Protein-Protein Interactions (PPIs). Improving our knowledge of Protein Protein Interaction (PPI) networks at the proteome scale is indeed crucial to understand protein function, physiological and disease states and cell life in general.
RESULTS: We devised three data-fusion based models for the proteome-level prediction of PPIs, and we show that our method outperforms state of the art approaches on common benchmarks. Moreover, we investigate its predictions on newly published PPIs, showing that this new data has a clear shift in its underlying distributions and we thus train and test our models on this extended dataset. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Year:  2021        PMID: 33560405     DOI: 10.1093/bioinformatics/btab092

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  From genotype to phenotype in Arabidopsis thaliana: in-silico genome interpretation predicts 288 phenotypes from sequencing data.

Authors:  Daniele Raimondi; Massimiliano Corso; Piero Fariselli; Yves Moreau
Journal:  Nucleic Acids Res       Date:  2022-02-22       Impact factor: 16.971

2.  A virus-target host proteins recognition method based on integrated complexes data and seed extension.

Authors:  Shengrong Xia; Yingchun Xia; Chulei Xiang; Hui Wang; Chao Wang; Jin He; Guolong Shi; Lichuan Gu
Journal:  BMC Bioinformatics       Date:  2022-06-28       Impact factor: 3.307

3.  HPMPdb: A machine learning-ready database of protein molecular phenotypes associated to human missense variants.

Authors:  Daniele Raimondi; Francesco Codicè; Gabriele Orlando; Joost Schymkowitz; Frederic Rousseau; Yves Moreau
Journal:  Curr Res Struct Biol       Date:  2022-05-13

Review 4.  Deep learning frameworks for protein-protein interaction prediction.

Authors:  Xiaotian Hu; Cong Feng; Tianyi Ling; Ming Chen
Journal:  Comput Struct Biotechnol J       Date:  2022-06-15       Impact factor: 6.155

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.