Literature DB >> 31106361

NetGO: improving large-scale protein function prediction with massive network information.

Ronghui You1,2,3, Shuwei Yao1,2,3, Yi Xiong4, Xiaodi Huang5, Fengzhu Sun2,3,6, Hiroshi Mamitsuka7,8, Shanfeng Zhu1,2,3.   

Abstract

Automated function prediction (AFP) of proteins is of great significance in biology. AFP can be regarded as a problem of the large-scale multi-label classification where a protein can be associated with multiple gene ontology terms as its labels. Based on our GOLabeler-a state-of-the-art method for the third critical assessment of functional annotation (CAFA3), in this paper we propose NetGO, a web server that is able to further improve the performance of the large-scale AFP by incorporating massive protein-protein network information. Specifically, the advantages of NetGO are threefold in using network information: (i) NetGO relies on a powerful learning to rank framework from machine learning to effectively integrate both sequence and network information of proteins; (ii) NetGO uses the massive network information of all species (>2000) in STRING (other than only some specific species) and (iii) NetGO still can use network information to annotate a protein by homology transfer, even if it is not contained in STRING. Separating training and testing data with the same time-delayed settings of CAFA, we comprehensively examined the performance of NetGO. Experimental results have clearly demonstrated that NetGO significantly outperforms GOLabeler and other competing methods. The NetGO web server is freely available at http://issubmission.sjtu.edu.cn/netgo/.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Year:  2019        PMID: 31106361      PMCID: PMC6602452          DOI: 10.1093/nar/gkz388

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  33 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Guilt-by-association goes global.

Authors:  S Oliver
Journal:  Nature       Date:  2000-02-10       Impact factor: 49.962

3.  A network of protein-protein interactions in yeast.

Authors:  B Schwikowski; P Uetz; S Fields
Journal:  Nat Biotechnol       Date:  2000-12       Impact factor: 54.908

4.  Characterizing the state of the art in the computational assignment of gene function: lessons from the first critical assessment of functional annotation (CAFA).

Authors:  Jesse Gillis; Paul Pavlidis
Journal:  BMC Bioinformatics       Date:  2013       Impact factor: 3.169

5.  DcGO: database of domain-centric ontologies on functions, phenotypes, diseases and more.

Authors:  Hai Fang; Julian Gough
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

6.  CombFunc: predicting protein function using heterogeneous data sources.

Authors:  Mark N Wass; Geraint Barton; Michael J E Sternberg
Journal:  Nucleic Acids Res       Date:  2012-05-27       Impact factor: 16.971

7.  The GOA database: gene Ontology annotation updates for 2015.

Authors:  Rachael P Huntley; Tony Sawford; Prudence Mutowo-Meullenet; Aleksandra Shypitsyna; Carlos Bonilla; Maria J Martin; Claire O'Donovan
Journal:  Nucleic Acids Res       Date:  2014-11-06       Impact factor: 19.160

8.  Protein function prediction by massive integration of evolutionary analyses and multiple data sources.

Authors:  Domenico Cozzetto; Daniel W A Buchan; Kevin Bryson; David T Jones
Journal:  BMC Bioinformatics       Date:  2013-02-28       Impact factor: 3.169

9.  Homology-based inference sets the bar high for protein function prediction.

Authors:  Tobias Hamp; Rebecca Kassner; Stefan Seemayer; Esmeralda Vicedo; Christian Schaefer; Dominik Achten; Florian Auer; Ariane Boehm; Tatjana Braun; Maximilian Hecht; Mark Heron; Peter Hönigschmid; Thomas A Hopf; Stefanie Kaufmann; Michael Kiening; Denis Krompass; Cedric Landerer; Yannick Mahlich; Manfred Roos; Burkhard Rost
Journal:  BMC Bioinformatics       Date:  2013-02-28       Impact factor: 3.169

10.  A large-scale evaluation of computational protein function prediction.

Authors:  Predrag Radivojac; Wyatt T Clark; Tal Ronnen Oron; Alexandra M Schnoes; Tobias Wittkop; Artem Sokolov; Kiley Graim; Christopher Funk; Karin Verspoor; Asa Ben-Hur; Gaurav Pandey; Jeffrey M Yunes; Ameet S Talwalkar; Susanna Repo; Michael L Souza; Damiano Piovesan; Rita Casadio; Zheng Wang; Jianlin Cheng; Hai Fang; Julian Gough; Patrik Koskinen; Petri Törönen; Jussi Nokso-Koivisto; Liisa Holm; Domenico Cozzetto; Daniel W A Buchan; Kevin Bryson; David T Jones; Bhakti Limaye; Harshal Inamdar; Avik Datta; Sunitha K Manjari; Rajendra Joshi; Meghana Chitale; Daisuke Kihara; Andreas M Lisewski; Serkan Erdin; Eric Venner; Olivier Lichtarge; Robert Rentzsch; Haixuan Yang; Alfonso E Romero; Prajwal Bhat; Alberto Paccanaro; Tobias Hamp; Rebecca Kaßner; Stefan Seemayer; Esmeralda Vicedo; Christian Schaefer; Dominik Achten; Florian Auer; Ariane Boehm; Tatjana Braun; Maximilian Hecht; Mark Heron; Peter Hönigschmid; Thomas A Hopf; Stefanie Kaufmann; Michael Kiening; Denis Krompass; Cedric Landerer; Yannick Mahlich; Manfred Roos; Jari Björne; Tapio Salakoski; Andrew Wong; Hagit Shatkay; Fanny Gatzmann; Ingolf Sommer; Mark N Wass; Michael J E Sternberg; Nives Škunca; Fran Supek; Matko Bošnjak; Panče Panov; Sašo Džeroski; Tomislav Šmuc; Yiannis A I Kourmpetis; Aalt D J van Dijk; Cajo J F ter Braak; Yuanpeng Zhou; Qingtian Gong; Xinran Dong; Weidong Tian; Marco Falda; Paolo Fontana; Enrico Lavezzo; Barbara Di Camillo; Stefano Toppo; Liang Lan; Nemanja Djuric; Yuhong Guo; Slobodan Vucetic; Amos Bairoch; Michal Linial; Patricia C Babbitt; Steven E Brenner; Christine Orengo; Burkhard Rost; Sean D Mooney; Iddo Friedberg
Journal:  Nat Methods       Date:  2013-01-27       Impact factor: 28.547

View more
  18 in total

1.  NetGO 2.0: improving large-scale protein function prediction with massive sequence, text, domain, family and network information.

Authors:  Shuwei Yao; Ronghui You; Shaojun Wang; Yi Xiong; Xiaodi Huang; Shanfeng Zhu
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

2.  DeepGOWeb: fast and accurate protein function prediction on the (Semantic) Web.

Authors:  Maxat Kulmanov; Fernando Zhapa-Camacho; Robert Hoehndorf
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

3.  IsoResolve: predicting splice isoform functions by integrating gene and isoform-level features with domain adaptation.

Authors:  Hong-Dong Li; Changhuo Yang; Zhimin Zhang; Mengyun Yang; Fang-Xiang Wu; Gilbert S Omenn; Jianxin Wang
Journal:  Bioinformatics       Date:  2021-05-01       Impact factor: 6.937

4.  Accurate protein function prediction via graph attention networks with predicted structure information.

Authors:  Boqiao Lai; Jinbo Xu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

5.  ContactPFP: Protein function prediction using predicted contact information.

Authors:  Yuki Kagaya; Sean T Flannery; Aashish Jain; Daisuke Kihara
Journal:  Front Bioinform       Date:  2022-06-02

6.  De Novo Protein Design for Novel Folds Using Guided Conditional Wasserstein Generative Adversarial Networks.

Authors:  Mostafa Karimi; Shaowen Zhu; Yue Cao; Yang Shen
Journal:  J Chem Inf Model       Date:  2020-09-30       Impact factor: 4.956

7.  TALE: Transformer-based protein function Annotation with joint sequence-Label Embedding.

Authors:  Yue Cao; Yang Shen
Journal:  Bioinformatics       Date:  2021-03-23       Impact factor: 6.937

8.  PANNZER-A practical tool for protein function prediction.

Authors:  Petri Törönen; Liisa Holm
Journal:  Protein Sci       Date:  2021-10-14       Impact factor: 6.725

9.  STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity.

Authors:  Xiangeng Wang; Xiaolei Zhu; Mingzhi Ye; Yanjing Wang; Cheng-Dong Li; Yi Xiong; Dong-Qing Wei
Journal:  Front Bioeng Biotechnol       Date:  2019-11-06

10.  SDN2GO: An Integrated Deep Learning Model for Protein Function Prediction.

Authors:  Yideng Cai; Jiacheng Wang; Lei Deng
Journal:  Front Bioeng Biotechnol       Date:  2020-04-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.