Literature DB >> 35875419

ContactPFP: Protein function prediction using predicted contact information.

Yuki Kagaya1, Sean T Flannery2, Aashish Jain2, Daisuke Kihara1,2.   

Abstract

Computational function prediction is one of the most important problems in bioinformatics as elucidating the function of genes is a central task in molecular biology and genomics. Most of the existing function prediction methods use protein sequences as the primary source of input information because the sequence is the most available information for query proteins. There are attempts to consider other attributes of query proteins. Among these attributes, the three-dimensional (3D) structure of proteins is known to be very useful in identifying the evolutionary relationship of proteins, from which functional similarity can be inferred. Here, we report a novel protein function prediction method, ContactPFP, which uses predicted residue-residue contact maps as input structural features of query proteins. Although 3D structure information is known to be useful, it has not been routinely used in function prediction because the 3D structure is not experimentally determined for many proteins. In ContactPFP, we overcome this limitation by using residue-residue contact prediction, which has become increasingly accurate due to rapid development in the protein structure prediction field. ContactPFP takes a query protein sequence as input and uses predicted residue-residue contact as a proxy for the 3D protein structure. To characterize how predicted contacts contribute to function prediction accuracy, we compared the performance of ContactPFP with several well-established sequence-based function prediction methods. The comparative study revealed the advantages and weaknesses of ContactPFP compared to contemporary sequence-based methods. There were many cases where it showed higher prediction accuracy. We examined factors that affected the accuracy of ContactPFP using several illustrative cases that highlight the strength of our method.

Entities:  

Keywords:  PFP; contact prediction; function annotation; function prediction; functional genomics; gene function; protein structure; residue contacts

Year:  2022        PMID: 35875419      PMCID: PMC9302406          DOI: 10.3389/fbinf.2022.896295

Source DB:  PubMed          Journal:  Front Bioinform        ISSN: 2673-7647


  68 in total

1.  UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View.

Authors:  Emmanuel Boutet; Damien Lieberherr; Michael Tognolli; Michel Schneider; Parit Bansal; Alan J Bridge; Sylvain Poux; Lydie Bougueleret; Ioannis Xenarios
Journal:  Methods Mol Biol       Date:  2016

2.  Detecting local ligand-binding site similarity in nonhomologous proteins by surface patch comparison.

Authors:  Lee Sael; Daisuke Kihara
Journal:  Proteins       Date:  2012-01-24

Review 3.  Function prediction of uncharacterized proteins.

Authors:  Troy Hawkins; Daisuke Kihara
Journal:  J Bioinform Comput Biol       Date:  2007-02       Impact factor: 1.122

4.  DeepGOPlus: improved protein function prediction from sequence.

Authors:  Maxat Kulmanov; Robert Hoehndorf
Journal:  Bioinformatics       Date:  2020-01-15       Impact factor: 6.937

5.  NetGO: improving large-scale protein function prediction with massive network information.

Authors:  Ronghui You; Shuwei Yao; Yi Xiong; Xiaodi Huang; Fengzhu Sun; Hiroshi Mamitsuka; Shanfeng Zhu
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

6.  Database resources of the National Center for Biotechnology Information.

Authors:  Eric W Sayers; Jeffrey Beck; Evan E Bolton; Devon Bourexis; James R Brister; Kathi Canese; Donald C Comeau; Kathryn Funk; Sunghwan Kim; William Klimke; Aron Marchler-Bauer; Melissa Landrum; Stacy Lathrop; Zhiyong Lu; Thomas L Madden; Nuala O'Leary; Lon Phan; Sanjida H Rangwala; Valerie A Schneider; Yuri Skripchenko; Jiyao Wang; Jian Ye; Barton W Trawick; Kim D Pruitt; Stephen T Sherry
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

7.  Effective inter-residue contact definitions for accurate protein fold recognition.

Authors:  Chao Yuan; Hao Chen; Daisuke Kihara
Journal:  BMC Bioinformatics       Date:  2012-11-09       Impact factor: 3.169

8.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.

Authors:  Baris E Suzek; Yuqi Wang; Hongzhan Huang; Peter B McGarvey; Cathy H Wu
Journal:  Bioinformatics       Date:  2014-11-13       Impact factor: 6.937

9.  UniProt: the universal protein knowledgebase in 2021.

Authors: 
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

10.  SMART: recent updates, new developments and status in 2020.

Authors:  Ivica Letunic; Supriya Khedkar; Peer Bork
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.