Literature DB >> 31399549

Distance-based protein folding powered by deep learning.

Jinbo Xu1.   

Abstract

Direct coupling analysis (DCA) for protein folding has made very good progress, but it is not effective for proteins that lack many sequence homologs, even coupled with time-consuming conformation sampling with fragments. We show that we can accurately predict interresidue distance distribution of a protein by deep learning, even for proteins with ∼60 sequence homologs. Using only the geometric constraints given by the resulting distance matrix we may construct 3D models without involving extensive conformation sampling. Our method successfully folded 21 of the 37 CASP12 hard targets with a median family size of 58 effective sequence homologs within 4 h on a Linux computer of 20 central processing units. In contrast, DCA-predicted contacts cannot be used to fold any of these hard targets in the absence of extensive conformation sampling, and the best CASP12 group folded only 11 of them by integrating DCA-predicted contacts into fragment-based conformation sampling. Rigorous experimental validation in CASP13 shows that our distance-based folding server successfully folded 17 of 32 hard targets (with a median family size of 36 sequence homologs) and obtained 70% precision on the top L/5 long-range predicted contacts. The latest experimental validation in CAMEO shows that our server predicted correct folds for 2 membrane proteins while all of the other servers failed. These results demonstrate that it is now feasible to predict correct fold for many more proteins lack of similar structures in the Protein Data Bank even on a personal computer.

Entities:  

Keywords:  deep learning; direct coupling analysis; protein contact prediction; protein distance prediction; protein folding

Mesh:

Substances:

Year:  2019        PMID: 31399549      PMCID: PMC6708335          DOI: 10.1073/pnas.1821309116

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  47 in total

1.  Tools for comparative protein structure modeling and analysis.

Authors:  Narayanan Eswar; Bino John; Nebojsa Mirkovic; Andras Fiser; Valentin A Ilyin; Ursula Pieper; Ashley C Stuart; Marc A Marti-Renom; M S Madhusudhan; Bozidar Yerkovich; Andrej Sali
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Protein structure prediction and analysis using the Robetta server.

Authors:  David E Kim; Dylan Chivian; David Baker
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.

Authors:  David T Jones; Daniel W A Buchan; Domenico Cozzetto; Massimiliano Pontil
Journal:  Bioinformatics       Date:  2011-11-17       Impact factor: 6.937

4.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families.

Authors:  Faruck Morcos; Andrea Pagnani; Bryan Lunt; Arianna Bertolino; Debora S Marks; Chris Sander; Riccardo Zecchina; José N Onuchic; Terence Hwa; Martin Weigt
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-21       Impact factor: 11.205

5.  Modeling errors in NOE data with a log-normal distribution improves the quality of NMR structures.

Authors:  Wolfgang Rieping; Michael Habeck; Michael Nilges
Journal:  J Am Chem Soc       Date:  2005-11-23       Impact factor: 15.419

6.  Version 1.2 of the Crystallography and NMR system.

Authors:  Axel T Brunger
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

7.  Identification of direct residue contacts in protein-protein interaction by message passing.

Authors:  Martin Weigt; Robert A White; Hendrik Szurmant; James A Hoch; Terence Hwa
Journal:  Proc Natl Acad Sci U S A       Date:  2008-12-30       Impact factor: 11.205

8.  I-TASSER: a unified platform for automated protein structure and function prediction.

Authors:  Ambrish Roy; Alper Kucukural; Yang Zhang
Journal:  Nat Protoc       Date:  2010-03-25       Impact factor: 13.491

9.  Distance matrix-based approach to protein structure prediction.

Authors:  Andrzej Kloczkowski; Robert L Jernigan; Zhijun Wu; Guang Song; Lei Yang; Andrzej Kolinski; Piotr Pokarowski
Journal:  J Struct Funct Genomics       Date:  2009-02-18

10.  The HHpred interactive server for protein homology detection and structure prediction.

Authors:  Johannes Söding; Andreas Biegert; Andrei N Lupas
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

View more
  101 in total

1.  AlphaFold at CASP13.

Authors:  Mohammed AlQuraishi
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

2.  High-accuracy protein structures by combining machine-learning with physics-based refinement.

Authors:  Lim Heo; Michael Feig
Journal:  Proteins       Date:  2019-11-15

3.  Improved protein structure prediction using predicted interresidue orientations.

Authors:  Jianyi Yang; Ivan Anishchenko; Hahnbeom Park; Zhenling Peng; Sergey Ovchinnikov; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2020-01-02       Impact factor: 11.205

4.  Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.

Authors:  Yang Li; Chengxin Zhang; Eric W Bell; Wei Zheng; Xiaogen Zhou; Dong-Jun Yu; Yang Zhang
Journal:  PLoS Comput Biol       Date:  2021-03-26       Impact factor: 4.475

5.  Critical assessment of methods of protein structure prediction (CASP)-Round XIII.

Authors:  Andriy Kryshtafovych; Torsten Schwede; Maya Topf; Krzysztof Fidelis; John Moult
Journal:  Proteins       Date:  2019-10-23

6.  Protein inter-residue contact and distance prediction by coupling complementary coevolution features with deep residual networks in CASP14.

Authors:  Yang Li; Chengxin Zhang; Wei Zheng; Xiaogen Zhou; Eric W Bell; Dong-Jun Yu; Yang Zhang
Journal:  Proteins       Date:  2021-08-19

7.  Physics-based protein structure refinement in the era of artificial intelligence.

Authors:  Lim Heo; Giacomo Janson; Michael Feig
Journal:  Proteins       Date:  2021-06-29

8.  SidechainNet: An all-atom protein structure dataset for machine learning.

Authors:  Jonathan Edward King; David Ryan Koes
Journal:  Proteins       Date:  2021-07-12

9.  Study of Real-Valued Distance Prediction for Protein Structure Prediction with Deep Learning.

Authors:  Jin Li; Jinbo Xu
Journal:  Bioinformatics       Date:  2021-05-07       Impact factor: 6.937

Review 10.  The Protective A673T Mutation of Amyloid Precursor Protein (APP) in Alzheimer's Disease.

Authors:  Qing Xia; XinYu Yang; JiaBin Shi; ZiJie Liu; YaHui Peng; WenJing Wang; BoWen Li; Yu Zhao; JiaYing Xiao; Lei Huang; DaYong Wang; Xu Gao
Journal:  Mol Neurobiol       Date:  2021-04-29       Impact factor: 5.590

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.