Literature DB >> 34039967

Structure-based protein function prediction using graph convolutional networks.

Vladimir Gligorijević1, P Douglas Renfrew2, Tomasz Kosciolek3,4, Julia Koehler Leman2, Daniel Berenberg2,5, Tommi Vatanen6,7, Chris Chandler2, Bryn C Taylor8, Ian M Fisk9, Hera Vlamakis6, Ramnik J Xavier6,10,11,12, Rob Knight3,13,14, Kyunghyun Cho15,16, Richard Bonneau17,18,19,20.   

Abstract

The rapid increase in the number of proteins in sequence databases and the diversity of their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network for predicting protein functions by leveraging sequence features extracted from a protein language model and protein structures. It outperforms current leading methods and sequence-based Convolutional Neural Networks and scales to the size of current sequence repositories. Augmenting the training set of experimental structures with homology models allows us to significantly expand the number of predictable functions. DeepFRI has significant de-noising capability, with only a minor drop in performance when experimental structures are replaced by protein models. Class activation mapping allows function predictions at an unprecedented resolution, allowing site-specific annotations at the residue-level in an automated manner. We show the utility and high performance of our method by annotating structures from the PDB and SWISS-MODEL, making several new confident function predictions. DeepFRI is available as a webserver at https://beta.deepfri.flatironinstitute.org/ .

Entities:  

Year:  2021        PMID: 34039967     DOI: 10.1038/s41467-021-23303-9

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


  42 in total

1.  The ENZYME database in 2000.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Supra-domains: evolutionary units larger than single protein domains.

Authors:  Christine Vogel; Carlo Berzuini; Matthew Bashton; Julian Gough; Sarah A Teichmann
Journal:  J Mol Biol       Date:  2004-02-20       Impact factor: 5.469

3.  How representative are the known structures of the proteins in a complete genome? A comprehensive structural census.

Authors:  M Gerstein
Journal:  Fold Des       Date:  1998

4.  [Therapeutic use of cefotiam in bacterial infections in pediatrics].

Authors:  F Cerutti; R Gusmano; R C Pedrinazzi; U Ramenghi; A Spada; E Vassena
Journal:  Minerva Pediatr       Date:  1983-11-15       Impact factor: 1.312

5.  Protein structure determination using metagenome sequence data.

Authors:  Sergey Ovchinnikov; Hahnbeom Park; Neha Varghese; Po-Ssu Huang; Georgios A Pavlopoulos; David E Kim; Hetunandan Kamisetty; Nikos C Kyrpides; David Baker
Journal:  Science       Date:  2017-01-20       Impact factor: 47.728

6.  DISOPRED3: precise disordered region predictions with annotated protein-binding activity.

Authors:  David T Jones; Domenico Cozzetto
Journal:  Bioinformatics       Date:  2014-11-12       Impact factor: 6.937

7.  KEGG: new perspectives on genomes, pathways, diseases and drugs.

Authors:  Minoru Kanehisa; Miho Furumichi; Mao Tanabe; Yoko Sato; Kanae Morishima
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

8.  CATH: an expanded resource to predict protein function through structure and sequence.

Authors:  Natalie L Dawson; Tony E Lewis; Sayoni Das; Jonathan G Lees; David Lee; Paul Ashford; Christine A Orengo; Ian Sillitoe
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

9.  SWISS-MODEL: homology modelling of protein structures and complexes.

Authors:  Andrew Waterhouse; Martino Bertoni; Stefan Bienert; Gabriel Studer; Gerardo Tauriello; Rafal Gumienny; Florian T Heer; Tjaart A P de Beer; Christine Rempfer; Lorenza Bordoli; Rosalba Lepore; Torsten Schwede
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

10.  Archiving and disseminating integrative structure models.

Authors:  Brinda Vallat; Benjamin Webb; John Westbrook; Andrej Sali; Helen M Berman
Journal:  J Biomol NMR       Date:  2019-07-05       Impact factor: 2.835

View more
  39 in total

Review 1.  A guide to machine learning for biologists.

Authors:  Joe G Greener; Shaun M Kandathil; Lewis Moffat; David T Jones
Journal:  Nat Rev Mol Cell Biol       Date:  2021-09-13       Impact factor: 94.444

2.  GraphVAMPNet, using graph neural networks and variational approach to Markov processes for dynamical modeling of biomolecules.

Authors:  Mahdi Ghorbani; Samarjeet Prasad; Jeffery B Klauda; Bernard R Brooks
Journal:  J Chem Phys       Date:  2022-05-14       Impact factor: 3.488

3.  Accurate protein function prediction via graph attention networks with predicted structure information.

Authors:  Boqiao Lai; Jinbo Xu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

4.  Protein-RNA interaction prediction with deep learning: structure matters.

Authors:  Junkang Wei; Siyuan Chen; Licheng Zong; Xin Gao; Yu Li
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

5.  ContactPFP: Protein function prediction using predicted contact information.

Authors:  Yuki Kagaya; Sean T Flannery; Aashish Jain; Daisuke Kihara
Journal:  Front Bioinform       Date:  2022-06-02

6.  Proteogenomics reveals sex-biased aging genes and coordinated splicing in cardiac aging.

Authors:  Yu Han; Sara A Wennersten; Julianna M Wright; R W Ludwig; Edward Lau; Maggie P Y Lam
Journal:  Am J Physiol Heart Circ Physiol       Date:  2022-08-05       Impact factor: 5.125

7.  Global diversity and distribution of prophages are lineage-specific within the Ralstonia solanacearum species complex.

Authors:  Samuel T E Greenrod; Martina Stoycheva; John Elphinstone; Ville-Petri Friman
Journal:  BMC Genomics       Date:  2022-10-06       Impact factor: 4.547

8.  LM-GVP: an extensible sequence and structure informed deep learning framework for protein property prediction.

Authors:  Zichen Wang; Steven A Combs; Ryan Brand; Miguel Romero Calvo; Panpan Xu; George Price; Nataliya Golovach; Emmanuel O Salawu; Colby J Wise; Sri Priya Ponnapalli; Peter M Clark
Journal:  Sci Rep       Date:  2022-04-27       Impact factor: 4.996

9.  PANDA2: protein function prediction using graph neural networks.

Authors:  Chenguang Zhao; Tong Liu; Zheng Wang
Journal:  NAR Genom Bioinform       Date:  2022-02-02

Review 10.  Differentiable biology: using deep learning for biophysics-based and data-driven modeling of molecular mechanisms.

Authors:  Mohammed AlQuraishi; Peter K Sorger
Journal:  Nat Methods       Date:  2021-10-04       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.