Literature DB >> 30941889

DeepFunc: A Deep Learning Framework for Accurate Prediction of Protein Functions from Protein Sequences and Interactions.

Fuhao Zhang1, Hong Song1, Min Zeng1, Yaohang Li1,2, Lukasz Kurgan3, Min Li1.   

Abstract

Annotation of protein functions plays an important role in understanding life at the molecular level. High-throughput sequencing produces massive numbers of raw proteins sequences and only about 1% of them have been manually annotated with functions. Experimental annotations of functions are expensive, time-consuming and do not keep up with the rapid growth of the sequence numbers. This motivates the development of computational approaches that predict protein functions. A novel deep learning framework, DeepFunc, is proposed which accurately predicts protein functions from protein sequence- and network-derived information. More precisely, DeepFunc uses a long and sparse binary vector to encode information concerning domains, families, and motifs collected from the InterPro tool that is associated with the input protein sequence. This vector is processed with two neural layers to obtain a low-dimensional vector which is combined with topological information extracted from protein-protein interactions (PPIs) and functional linkages. The combined information is processed by a deep neural network that predicts protein functions. DeepFunc is empirically and comparatively tested on a benchmark testing dataset and the Critical Assessment of protein Function Annotation algorithms (CAFA) 3 dataset. The experimental results demonstrate that DeepFunc outperforms current methods on the testing dataset and that it secures the highest Fmax  = 0.54 and AUC = 0.94 on the CAFA3 dataset.
© 2019 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  deep learning; functional linkages; protein domains; protein functions; protein sequences; protein-protein interactions

Mesh:

Substances:

Year:  2019        PMID: 30941889     DOI: 10.1002/pmic.201900019

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  12 in total

1.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors:  Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

2.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

3.  Modeling multi-scale data via a network of networks.

Authors:  Shawn Gu; Meng Jiang; Pietro Hiram Guzzi; Tijana Milenković
Journal:  Bioinformatics       Date:  2022-03-03       Impact factor: 6.931

4.  DeephageTP: a convolutional neural network framework for identifying phage-specific proteins from metagenomic sequencing data.

Authors:  Yunmeng Chu; Shun Guo; Dachao Cui; Xiongfei Fu; Yingfei Ma
Journal:  PeerJ       Date:  2022-06-08       Impact factor: 3.061

5.  TALE: Transformer-based protein function Annotation with joint sequence-Label Embedding.

Authors:  Yue Cao; Yang Shen
Journal:  Bioinformatics       Date:  2021-03-23       Impact factor: 6.937

6.  Improving protein domain classification for third-generation sequencing reads using deep learning.

Authors:  Nan Du; Jiayu Shang; Yanni Sun
Journal:  BMC Genomics       Date:  2021-04-09       Impact factor: 3.969

7.  PFP-WGAN: Protein function prediction by discovering Gene Ontology term correlations with generative adversarial networks.

Authors:  Seyyede Fatemeh Seyyedsalehi; Mahdieh Soleymani; Hamid R Rabiee; Mohammad R K Mofrad
Journal:  PLoS One       Date:  2021-02-25       Impact factor: 3.240

Review 8.  Current progress and open challenges for applying deep learning across the biosciences.

Authors:  Nicolae Sapoval; Amirali Aghazadeh; Michael G Nute; Dinler A Antunes; Advait Balaji; Richard Baraniuk; C J Barberan; Ruth Dannenfelser; Chen Dun; Mohammadamin Edrisi; R A Leo Elworth; Bryce Kille; Anastasios Kyrillidis; Luay Nakhleh; Cameron R Wolfe; Zhi Yan; Vicky Yao; Todd J Treangen
Journal:  Nat Commun       Date:  2022-04-01       Impact factor: 14.919

Review 9.  Deep learning in prediction of intrinsic disorder in proteins.

Authors:  Bi Zhao; Lukasz Kurgan
Journal:  Comput Struct Biotechnol J       Date:  2022-03-08       Impact factor: 7.271

10.  Deep learning program to predict protein functions based on sequence information.

Authors:  Chang Woo Ko; June Huh; Jong-Wan Park
Journal:  MethodsX       Date:  2022-01-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.