Literature DB >> 31005579

End-to-End Differentiable Learning of Protein Structure.

Mohammed AlQuraishi1.   

Abstract

Predicting protein structure from sequence is a central challenge of biochemistry. Co-evolution methods show promise, but an explicit sequence-to-structure map remains elusive. Advances in deep learning that replace complex, human-designed pipelines with differentiable models optimized end to end suggest the potential benefits of similarly reformulating structure prediction. Here, we introduce an end-to-end differentiable model for protein structure learning. The model couples local and global protein structure via geometric units that optimize global geometry without violating local covalent chemistry. We test our model using two challenging tasks: predicting novel folds without co-evolutionary data and predicting known folds without structural templates. In the first task, the model achieves state-of-the-art accuracy, and in the second, it comes within 1-2 Å; competing methods using co-evolution and experimental templates have been refined over many years, and it is likely that the differentiable approach has substantial room for further improvement, with applications ranging from drug discovery to protein design.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  biophysics; co-evolution; deep learning; geometric deep learning; homology modeling; machine learning; protein design; protein folding; protein structure prediction; structural biology

Mesh:

Year:  2019        PMID: 31005579      PMCID: PMC6513320          DOI: 10.1016/j.cels.2019.03.006

Source DB:  PubMed          Journal:  Cell Syst        ISSN: 2405-4712            Impact factor:   10.304


  39 in total

1.  How significant is a protein structure similarity with TM-score = 0.5?

Authors:  Jinrui Xu; Yang Zhang
Journal:  Bioinformatics       Date:  2010-02-17       Impact factor: 6.937

2.  Practical conversion from torsion space to Cartesian space for in silico protein synthesis.

Authors:  Jerod Parsons; J Bradley Holmes; J Maurice Rojas; Jerry Tsai; Charlie E M Strauss
Journal:  J Comput Chem       Date:  2005-07-30       Impact factor: 3.376

3.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

4.  A large-scale experiment to assess protein structure prediction methods.

Authors:  J Moult; J T Pedersen; R Judson; K Fidelis
Journal:  Proteins       Date:  1995-11

5.  The Protein Data Bank: a computer-based archival file for macromolecular structures.

Authors:  F C Bernstein; T F Koetzle; G J Williams; E F Meyer; M D Brice; J R Rodgers; O Kennard; T Shimanouchi; M Tasumi
Journal:  J Mol Biol       Date:  1977-05-25       Impact factor: 5.469

6.  Predicting backbone Cα angles and dihedrals from protein sequences by stacked sparse auto-encoder deep neural network.

Authors:  James Lyons; Abdollah Dehzangi; Rhys Heffernan; Alok Sharma; Kuldip Paliwal; Abdul Sattar; Yaoqi Zhou; Yuedong Yang
Journal:  J Comput Chem       Date:  2014-09-12       Impact factor: 3.376

7.  Evaluation of the template-based modeling in CASP12.

Authors:  Andriy Kryshtafovych; Bohdan Monastyrskyy; Krzysztof Fidelis; John Moult; Torsten Schwede; Anna Tramontano
Journal:  Proteins       Date:  2017-12-04

8.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules.

Authors:  Andrew Leaver-Fay; Michael Tyka; Steven M Lewis; Oliver F Lange; James Thompson; Ron Jacak; Kristian Kaufman; P Douglas Renfrew; Colin A Smith; Will Sheffler; Ian W Davis; Seth Cooper; Adrien Treuille; Daniel J Mandell; Florian Richter; Yih-En Andrew Ban; Sarel J Fleishman; Jacob E Corn; David E Kim; Sergey Lyskov; Monica Berrondo; Stuart Mentzer; Zoran Popović; James J Havranek; John Karanicolas; Rhiju Das; Jens Meiler; Tanja Kortemme; Jeffrey J Gray; Brian Kuhlman; David Baker; Philip Bradley
Journal:  Methods Enzymol       Date:  2011       Impact factor: 1.600

9.  Assessment of contact predictions in CASP12: Co-evolution and deep learning coming of age.

Authors:  Joerg Schaarschmidt; Bohdan Monastyrskyy; Andriy Kryshtafovych; Alexandre M J J Bonvin
Journal:  Proteins       Date:  2017-11-07

10.  A vocabulary of ancient peptides at the origin of folded proteins.

Authors:  Vikram Alva; Johannes Söding; Andrei N Lupas
Journal:  Elife       Date:  2015-12-14       Impact factor: 8.140

View more
  71 in total

Review 1.  Machine Learning Force Fields and Coarse-Grained Variables in Molecular Dynamics: Application to Materials and Biological Systems.

Authors:  Paraskevi Gkeka; Gabriel Stoltz; Amir Barati Farimani; Zineb Belkacemi; Michele Ceriotti; John D Chodera; Aaron R Dinner; Andrew L Ferguson; Jean-Bernard Maillet; Hervé Minoux; Christine Peter; Fabio Pietrucci; Ana Silveira; Alexandre Tkatchenko; Zofia Trstanova; Rafal Wiewiora; Tony Lelièvre
Journal:  J Chem Theory Comput       Date:  2020-07-16       Impact factor: 6.006

2.  Distance-based protein folding powered by deep learning.

Authors:  Jinbo Xu
Journal:  Proc Natl Acad Sci U S A       Date:  2019-08-09       Impact factor: 11.205

3.  AlphaFold at CASP13.

Authors:  Mohammed AlQuraishi
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

4.  Machine Learning in a Molecular Modeling Course for Chemistry, Biochemistry, and Biophysics Students.

Authors:  Jacob M Remington; Jonathon B Ferrell; Marlo Zorman; Adam Petrucci; Severin T Schneebeli; Jianing Li
Journal:  Biophysicist (Rockv)       Date:  2020-08-13

5.  Structural diversity in the Mycobacteria DUF3349 superfamily.

Authors:  Garry W Buchko; Jan Abendroth; John I Robinson; Isabelle Q Phan; Peter J Myler; Thomas E Edwards
Journal:  Protein Sci       Date:  2019-11-21       Impact factor: 6.725

6.  Learning to Make Chemical Predictions: the Interplay of Feature Representation, Data, and Machine Learning Methods.

Authors:  Mojtaba Haghighatlari; Jie Li; Farnaz Heidar-Zadeh; Yuchen Liu; Xingyi Guan; Teresa Head-Gordon
Journal:  Chem       Date:  2020-06-16       Impact factor: 22.804

Review 7.  Experimentally-driven protein structure modeling.

Authors:  Nikolay V Dokholyan
Journal:  J Proteomics       Date:  2020-04-05       Impact factor: 4.044

8.  Unified rational protein engineering with sequence-based deep representation learning.

Authors:  Ethan C Alley; Grigory Khimulya; Surojit Biswas; Mohammed AlQuraishi; George M Church
Journal:  Nat Methods       Date:  2019-10-21       Impact factor: 28.547

9.  SidechainNet: An all-atom protein structure dataset for machine learning.

Authors:  Jonathan Edward King; David Ryan Koes
Journal:  Proteins       Date:  2021-07-12

10.  Guardians of the Cell: State-of-the-Art of Membrane Proteins from a Computational Point-of-View.

Authors:  Nícia Rosário-Ferreira; Catarina Marques-Pereira; Raquel P Gouveia; Joana Mourão; Irina S Moreira
Journal:  Methods Mol Biol       Date:  2021
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.