Literature DB >> 31350877

DeepGOPlus: improved protein function prediction from sequence.

Maxat Kulmanov1, Robert Hoehndorf1.   

Abstract

MOTIVATION: Protein function prediction is one of the major tasks of bioinformatics that can help in wide range of biological problems such as understanding disease mechanisms or finding drug targets. Many methods are available for predicting protein functions from sequence based features, protein-protein interaction networks, protein structure or literature. However, other than sequence, most of the features are difficult to obtain or not available for many proteins thereby limiting their scope. Furthermore, the performance of sequence-based function prediction methods is often lower than methods that incorporate multiple features and predicting protein functions may require a lot of time.
RESULTS: We developed a novel method for predicting protein functions from sequence alone which combines deep convolutional neural network (CNN) model with sequence similarity based predictions. Our CNN model scans the sequence for motifs which are predictive for protein functions and combines this with functions of similar proteins (if available). We evaluate the performance of DeepGOPlus using the CAFA3 evaluation measures and achieve an Fmax of 0.390, 0.557 and 0.614 for BPO, MFO and CCO evaluations, respectively. These results would have made DeepGOPlus one of the three best predictors in CCO and the second best performing method in the BPO and MFO evaluations. We also compare DeepGOPlus with state-of-the-art methods such as DeepText2GO and GOLabeler on another dataset. DeepGOPlus can annotate around 40 protein sequences per second on common hardware, thereby making fast and accurate function predictions available for a wide range of proteins.
AVAILABILITY AND IMPLEMENTATION: http://deepgoplus.bio2vec.net/ . SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31350877     DOI: 10.1093/bioinformatics/btz595

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  49 in total

1.  NetGO 2.0: improving large-scale protein function prediction with massive sequence, text, domain, family and network information.

Authors:  Shuwei Yao; Ronghui You; Shaojun Wang; Yi Xiong; Xiaodi Huang; Shanfeng Zhu
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

2.  DeepGOWeb: fast and accurate protein function prediction on the (Semantic) Web.

Authors:  Maxat Kulmanov; Fernando Zhapa-Camacho; Robert Hoehndorf
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

Review 3.  Machine learning: its challenges and opportunities in plant system biology.

Authors:  Mohsen Hesami; Milad Alizadeh; Andrew Maxwell Phineas Jones; Davoud Torkamaneh
Journal:  Appl Microbiol Biotechnol       Date:  2022-05-16       Impact factor: 4.813

4.  Blinded Testing of Function Annotation for uPE1 Proteins by I-TASSER/COFACTOR Pipeline Using the 2018-2019 Additions to neXtProt and the CAFA3 Challenge.

Authors:  Chengxin Zhang; Lydie Lane; Gilbert S Omenn; Yang Zhang
Journal:  J Proteome Res       Date:  2019-10-18       Impact factor: 4.466

5.  Genome wide identification and expression analysis of pepper C2H2 zinc finger transcription factors in response to anthracnose pathogen Colletotrichum truncatum.

Authors:  Richa Sharma; Bijayalaxmi Mahanty; Rukmini Mishra; Raj Kumar Joshi
Journal:  3 Biotech       Date:  2021-02-08       Impact factor: 2.406

6.  TALE: Transformer-based protein function Annotation with joint sequence-Label Embedding.

Authors:  Yue Cao; Yang Shen
Journal:  Bioinformatics       Date:  2021-03-23       Impact factor: 6.937

Review 7.  Genome annotation of disease-causing microorganisms.

Authors:  Yibo Dong; Chang Li; Kami Kim; Liwang Cui; Xiaoming Liu
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

8.  Multiple Protein Subcellular Locations Prediction Based on Deep Convolutional Neural Networks with Self-Attention Mechanism.

Authors:  Hanhan Cong; Hong Liu; Yi Cao; Yuehui Chen; Cheng Liang
Journal:  Interdiscip Sci       Date:  2022-01-23       Impact factor: 2.233

9.  PANDA2: protein function prediction using graph neural networks.

Authors:  Chenguang Zhao; Tong Liu; Zheng Wang
Journal:  NAR Genom Bioinform       Date:  2022-02-02

10.  Engineering cellular metabolite transport for biosynthesis of computationally predicted tropane alkaloid derivatives in yeast.

Authors:  Prashanth Srinivasan; Christina D Smolke
Journal:  Proc Natl Acad Sci U S A       Date:  2021-06-22       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.