Literature DB >> 30165572

Phylo-PFP: improved automated protein function prediction using phylogenetic distance of distantly related sequences.

Aashish Jain1, Daisuke Kihara1,2.   

Abstract

MOTIVATION: Function annotation of proteins is fundamental in contemporary biology across fields including genomics, molecular biology, biochemistry, systems biology and bioinformatics. Function prediction is indispensable in providing clues for interpreting omics-scale data as well as in assisting biologists to build hypotheses for designing experiments. As sequencing genomes is now routine due to the rapid advancement of sequencing technologies, computational protein function prediction methods have become increasingly important. A conventional method of annotating a protein sequence is to transfer functions from top hits of a homology search; however, this approach has substantial short comings including a low coverage in genome annotation.
RESULTS: Here we have developed Phylo-PFP, a new sequence-based protein function prediction method, which mines functional information from a broad range of similar sequences, including those with a low sequence similarity identified by a PSI-BLAST search. To evaluate functional similarity between identified sequences and the query protein more accurately, Phylo-PFP reranks retrieved sequences by considering their phylogenetic distance. Compared to the Phylo-PFP's predecessor, PFP, which was among the top ranked methods in the second round of the Critical Assessment of Functional Annotation (CAFA2), Phylo-PFP demonstrated substantial improvement in prediction accuracy. Phylo-PFP was further shown to outperform prediction programs to date that were ranked top in CAFA2.
AVAILABILITY AND IMPLEMENTATION: Phylo-PFP web server is available for at http://kiharalab.org/phylo_pfp.php. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2019        PMID: 30165572      PMCID: PMC6394400          DOI: 10.1093/bioinformatics/bty704

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  NetGO: improving large-scale protein function prediction with massive network information.

Authors:  Ronghui You; Shuwei Yao; Yi Xiong; Xiaodi Huang; Fengzhu Sun; Hiroshi Mamitsuka; Shanfeng Zhu
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

2.  ContactPFP: Protein function prediction using predicted contact information.

Authors:  Yuki Kagaya; Sean T Flannery; Aashish Jain; Daisuke Kihara
Journal:  Front Bioinform       Date:  2022-06-02

3.  CrowdGO: Machine learning and semantic similarity guided consensus Gene Ontology annotation.

Authors:  Maarten J M F Reijnders; Robert M Waterhouse
Journal:  PLoS Comput Biol       Date:  2022-05-13       Impact factor: 4.779

4.  Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.

Authors:  Jiajun Hong; Yongchao Luo; Yang Zhang; Junbiao Ying; Weiwei Xue; Tian Xie; Lin Tao; Feng Zhu
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

5.  In silico annotation of unreviewed acetylcholinesterase (AChE) in some lepidopteran insect pest species reveals the causes of insecticide resistance.

Authors:  Qudsia Yousafi; Ayesha Sarfaraz; Muhammad Saad Khan; Shahzad Saleem; Umbreen Shahzad; Azhar Abbas Khan; Mazhar Sadiq; Allah Ditta Abid; Muhammad Sohail Shahzad; Najam Ul Hassan
Journal:  Saudi J Biol Sci       Date:  2021-01-21       Impact factor: 4.219

Review 6.  Automatic Gene Function Prediction in the 2020's.

Authors:  Stavros Makrodimitris; Roeland C H J van Ham; Marcel J T Reinders
Journal:  Genes (Basel)       Date:  2020-10-27       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.