Literature DB >> 29534977

MetaGO: Predicting Gene Ontology of Non-homologous Proteins Through Low-Resolution Protein Structure Prediction and Protein-Protein Network Mapping.

Chengxin Zhang1, Wei Zheng1, Peter L Freddolino2, Yang Zhang3.   

Abstract

Homology-based transferal remains the major approach to computational protein function annotations, but it becomes increasingly unreliable when the sequence identity between query and template decreases below 30%. We propose a novel pipeline, MetaGO, to deduce Gene Ontology attributes of proteins by combining sequence homology-based annotation with low-resolution structure prediction and comparison, and partner's homology-based protein-protein network mapping. The pipeline was tested on a large-scale set of 1000 non-redundant proteins from the CAFA3 experiment. Under the stringent benchmark conditions where templates with >30% sequence identity to the query are excluded, MetaGO achieves average F-measures of 0.487, 0.408, and 0.598, for Molecular Function, Biological Process, and Cellular Component, respectively, which are significantly higher than those achieved by other state-of-the-art function annotations methods. Detailed data analysis shows that the major advantage of the MetaGO lies in the new functional homolog detections from partner's homology-based network mapping and structure-based local and global structure alignments, the confidence scores of which can be optimally combined through logistic regression. These data demonstrate the power of using a hybrid model incorporating protein structure and interaction networks to deduce new functional insights beyond traditional sequence homology-based referrals, especially for proteins that lack homologous function templates. The MetaGO pipeline is available at http://zhanglab.ccmb.med.umich.edu/MetaGO/.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Gene Ontology; protein function prediction; protein structure prediction; protein–protein interaction; sequence profiles

Mesh:

Substances:

Year:  2018        PMID: 29534977      PMCID: PMC6014880          DOI: 10.1016/j.jmb.2018.03.004

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  38 in total

1.  Enzyme function less conserved than anticipated.

Authors:  Burkhard Rost
Journal:  J Mol Biol       Date:  2002-04-26       Impact factor: 5.469

2.  Scoring function for automated assessment of protein structure template quality.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proteins       Date:  2004-12-01

3.  Phenotypic landscape of a bacterial cell.

Authors:  Robert J Nichols; Saunak Sen; Yoe Jin Choo; Pedro Beltrao; Matylda Zietek; Rachna Chaba; Sueyoung Lee; Krystyna M Kazmierczak; Karis J Lee; Angela Wong; Michael Shales; Susan Lovett; Malcolm E Winkler; Nevan J Krogan; Athanasios Typas; Carol A Gross
Journal:  Cell       Date:  2010-12-23       Impact factor: 41.582

4.  Inference of protein function from protein structure.

Authors:  Debnath Pal; David Eisenberg
Journal:  Structure       Date:  2005-01       Impact factor: 5.006

Review 5.  Computational methods for identification of functional residues in protein structures.

Authors:  Fuxiao Xin; Predrag Radivojac
Journal:  Curr Protein Pept Sci       Date:  2011-09       Impact factor: 3.272

6.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

7.  PDBsum additions.

Authors:  Tjaart A P de Beer; Karel Berka; Janet M Thornton; Roman A Laskowski
Journal:  Nucleic Acids Res       Date:  2013-10-22       Impact factor: 16.971

8.  BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions.

Authors:  Jianyi Yang; Ambrish Roy; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2012-10-18       Impact factor: 16.971

9.  LOMETS: a local meta-threading-server for protein structure prediction.

Authors:  Sitao Wu; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2007-05-03       Impact factor: 16.971

10.  The Pfam protein families database: towards a more sustainable future.

Authors:  Robert D Finn; Penelope Coggill; Ruth Y Eberhardt; Sean R Eddy; Jaina Mistry; Alex L Mitchell; Simon C Potter; Marco Punta; Matloob Qureshi; Amaia Sangrador-Vegas; Gustavo A Salazar; John Tate; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2015-12-15       Impact factor: 16.971

View more
  20 in total

1.  Structure and Protein Interaction-Based Gene Ontology Annotations Reveal Likely Functions of Uncharacterized Proteins on Human Chromosome 17.

Authors:  Chengxin Zhang; Xiaoqiong Wei; Gilbert S Omenn; Yang Zhang
Journal:  J Proteome Res       Date:  2018-10-16       Impact factor: 4.466

2.  NetGO: improving large-scale protein function prediction with massive network information.

Authors:  Ronghui You; Shuwei Yao; Yi Xiong; Xiaodi Huang; Fengzhu Sun; Hiroshi Mamitsuka; Shanfeng Zhu
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

3.  Functions of Essential Genes and a Scale-Free Protein Interaction Network Revealed by Structure-Based Function and Interaction Prediction for a Minimal Genome.

Authors:  Chengxin Zhang; Wei Zheng; Micah Cheng; Gilbert S Omenn; Peter L Freddolino; Yang Zhang
Journal:  J Proteome Res       Date:  2021-01-04       Impact factor: 4.466

4.  Escherichia coli YigI is a Conserved Gammaproteobacterial Acyl-CoA Thioesterase Permitting Metabolism of Unusual Fatty Acid Substrates.

Authors:  Michael Schmidt; Theresa Proctor; Rucheng Diao; Peter L Freddolino
Journal:  J Bacteriol       Date:  2022-07-25       Impact factor: 3.476

Review 5.  I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction.

Authors:  Xiaogen Zhou; Wei Zheng; Yang Li; Robin Pearce; Chengxin Zhang; Eric W Bell; Guijun Zhang; Yang Zhang
Journal:  Nat Protoc       Date:  2022-08-05       Impact factor: 17.021

6.  US-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes.

Authors:  Chengxin Zhang; Morgan Shine; Anna Marie Pyle; Yang Zhang
Journal:  Nat Methods       Date:  2022-08-29       Impact factor: 47.990

7.  Blinded Testing of Function Annotation for uPE1 Proteins by I-TASSER/COFACTOR Pipeline Using the 2018-2019 Additions to neXtProt and the CAFA3 Challenge.

Authors:  Chengxin Zhang; Lydie Lane; Gilbert S Omenn; Yang Zhang
Journal:  J Proteome Res       Date:  2019-10-18       Impact factor: 4.466

8.  Using deep maxout neural networks to improve the accuracy of function prediction from protein interaction networks.

Authors:  Cen Wan; Domenico Cozzetto; Rui Fa; David T Jones
Journal:  PLoS One       Date:  2019-07-23       Impact factor: 3.240

9.  The sugarcane mitochondrial genome: assembly, phylogenetics and transcriptomics.

Authors:  Dyfed Lloyd Evans; Thandekile Thandiwe Hlongwane; Shailesh V Joshi; Diego M Riaño Pachón
Journal:  PeerJ       Date:  2019-09-24       Impact factor: 2.984

10.  SDN2GO: An Integrated Deep Learning Model for Protein Function Prediction.

Authors:  Yideng Cai; Jiacheng Wang; Lei Deng
Journal:  Front Bioeng Biotechnol       Date:  2020-04-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.