Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Graph2GO: a multi-modal attributed network embedding method for inferring protein functions.

Literature DB >> 32770210

Graph2GO: a multi-modal attributed network embedding method for inferring protein functions.

Kunjie Fan¹, Yuanfang Guan², Yan Zhang^1,3.

Abstract

BACKGROUND: Identifying protein functions is important for many biological applications. Since experimental functional characterization of proteins is time-consuming and costly, accurate and efficient computational methods for predicting protein functions are in great demand for generating the testable hypotheses guiding large-scale experiments."
RESULTS: Here, we propose Graph2GO, a multi-modal graph-based representation learning model that can integrate heterogeneous information, including multiple types of interaction networks (sequence similarity network and protein-protein interaction network) and protein features (amino acid sequence, subcellular location, and protein domains) to predict protein functions on gene ontology. Comparing Graph2GO to BLAST, as a baseline model, and to two popular protein function prediction methods (Mashup and deepNF), we demonstrated that our model can achieve state-of-the-art performance. We show the robustness of our model by testing on multiple species. We also provide a web server supporting function query and downstream analysis on-the-fly.
CONCLUSIONS: Graph2GO is the first model that has utilized attributed network representation learning methods to model both interaction networks and protein features for predicting protein functions, and achieved promising performance. Our model can be easily extended to include more protein features to further improve the performance. Besides, Graph2GO is also applicable to other application scenarios involving biological networks, and the learned latent representations can be used as feature inputs for machine learning tasks in various downstream analyses.

Entities: Chemical Disease Gene Species

Keywords: attributed network embedding; graph neural network; multi-modal model; protein function prediction; representation learning

Year: 2020 PMID： 32770210 PMCID： PMC7414417 DOI： 10.1093/gigascience/giaa081

Source DB: PubMed Journal: Gigascience ISSN： 2047-217X Impact factor: 6.524

31 in total

1. Prediction of human protein function from post-translational modifications and localization features.

Authors: L J Jensen; R Gupta; N Blom; D Devos; J Tamames; C Kesmir; H Nielsen; H H Staerfeldt; K Rapacki; C Workman; C A F Andersen; S Knudsen; A Krogh; A Valencia; S Brunak
Journal: J Mol Biol Date: 2002-06-21 Impact factor: 5.469

2. UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View.

Authors: Emmanuel Boutet; Damien Lieberherr; Michael Tognolli; Michel Schneider; Parit Bansal; Alan J Bridge; Sylvain Poux; Lydie Bougueleret; Ioannis Xenarios
Journal: Methods Mol Biol Date: 2016

Review 3. Deep learning.

Authors: Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal: Nature Date: 2015-05-28 Impact factor: 49.962

4. COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information.

Authors: Chengxin Zhang; Peter L Freddolino; Yang Zhang
Journal: Nucleic Acids Res Date: 2017-07-03 Impact factor: 16.971

5. Predicting RNA-protein interactions using only sequence information.

Authors: Usha K Muppirala; Vasant G Honavar; Drena Dobbs
Journal: BMC Bioinformatics Date: 2011-12-22 Impact factor: 3.169

Review 6. Network-based prediction of protein function.

Authors: Roded Sharan; Igor Ulitsky; Ron Shamir
Journal: Mol Syst Biol Date: 2007-03-13 Impact factor: 11.429

7. Gene annotation bias impedes biomedical research.

Authors: Winston A Haynes; Aurelie Tomczak; Purvesh Khatri
Journal: Sci Rep Date: 2018-01-22 Impact factor: 4.379

8. A large-scale evaluation of computational protein function prediction.

Authors: Predrag Radivojac; Wyatt T Clark; Tal Ronnen Oron; Alexandra M Schnoes; Tobias Wittkop; Artem Sokolov; Kiley Graim; Christopher Funk; Karin Verspoor; Asa Ben-Hur; Gaurav Pandey; Jeffrey M Yunes; Ameet S Talwalkar; Susanna Repo; Michael L Souza; Damiano Piovesan; Rita Casadio; Zheng Wang; Jianlin Cheng; Hai Fang; Julian Gough; Patrik Koskinen; Petri Törönen; Jussi Nokso-Koivisto; Liisa Holm; Domenico Cozzetto; Daniel W A Buchan; Kevin Bryson; David T Jones; Bhakti Limaye; Harshal Inamdar; Avik Datta; Sunitha K Manjari; Rajendra Joshi; Meghana Chitale; Daisuke Kihara; Andreas M Lisewski; Serkan Erdin; Eric Venner; Olivier Lichtarge; Robert Rentzsch; Haixuan Yang; Alfonso E Romero; Prajwal Bhat; Alberto Paccanaro; Tobias Hamp; Rebecca Kaßner; Stefan Seemayer; Esmeralda Vicedo; Christian Schaefer; Dominik Achten; Florian Auer; Ariane Boehm; Tatjana Braun; Maximilian Hecht; Mark Heron; Peter Hönigschmid; Thomas A Hopf; Stefanie Kaufmann; Michael Kiening; Denis Krompass; Cedric Landerer; Yannick Mahlich; Manfred Roos; Jari Björne; Tapio Salakoski; Andrew Wong; Hagit Shatkay; Fanny Gatzmann; Ingolf Sommer; Mark N Wass; Michael J E Sternberg; Nives Škunca; Fran Supek; Matko Bošnjak; Panče Panov; Sašo Džeroski; Tomislav Šmuc; Yiannis A I Kourmpetis; Aalt D J van Dijk; Cajo J F ter Braak; Yuanpeng Zhou; Qingtian Gong; Xinran Dong; Weidong Tian; Marco Falda; Paolo Fontana; Enrico Lavezzo; Barbara Di Camillo; Stefano Toppo; Liang Lan; Nemanja Djuric; Yuhong Guo; Slobodan Vucetic; Amos Bairoch; Michal Linial; Patricia C Babbitt; Steven E Brenner; Christine Orengo; Burkhard Rost; Sean D Mooney; Iddo Friedberg
Journal: Nat Methods Date: 2013-01-27 Impact factor: 28.547

5. Integration of Human Protein Sequence and Protein-Protein Interaction Data by Graph Autoencoder to Identify Novel Protein-Abnormal Phenotype Associations.

Authors: Yuan Liu; Ruirui He; Yingjie Qu; Yuan Zhu; Dianke Li; Xinping Ling; Simin Xia; Zhenqiu Li; Dong Li
Journal: Cells Date: 2022-08-10 Impact factor: 7.666

5 in total

Graph2GO: a multi-modal attributed network embedding method for inferring protein functions.

1. Prediction of human protein function from post-translational modifications and localization features.

2. UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View.

Review 3. Deep learning.

4. COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information.

5. Predicting RNA-protein interactions using only sequence information.

Review 6. Network-based prediction of protein function.

7. Gene annotation bias impedes biomedical research.

8. A large-scale evaluation of computational protein function prediction.

9. Using indirect protein interactions for the prediction of Gene Ontology functions.

10. DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.

1. TALE: Transformer-based protein function Annotation with joint sequence-Label Embedding.

2. PANNZER-A practical tool for protein function prediction.

Review 3. Artificial intelligence and machine learning methods in predicting anti-cancer drug combination effects.

4. Machine learning predicts nucleosome binding modes of transcription factors.

5. Integration of Human Protein Sequence and Protein-Protein Interaction Data by Graph Autoencoder to Identify Novel Protein-Abnormal Phenotype Associations.