Literature DB >> 31365149

Deep-learning contact-map guided protein structure prediction in CASP13.

Wei Zheng1, Yang Li1,2, Chengxin Zhang1, Robin Pearce1, S M Mortuza1, Yang Zhang1,3.   

Abstract

We report the results of two fully automated structure prediction pipelines, "Zhang-Server" and "QUARK", in CASP13. The pipelines were built upon the C-I-TASSER and C-QUARK programs, which in turn are based on I-TASSER and QUARK but with three new modules: (a) a novel multiple sequence alignment (MSA) generation protocol to construct deep sequence-profiles for contact prediction; (b) an improved meta-method, NeBcon, which combines multiple contact predictors, including ResPRE that predicts contact-maps by coupling precision-matrices with deep residual convolutional neural-networks; and (c) an optimized contact potential to guide structure assembly simulations. For 50 CASP13 FM domains that lacked homologous templates, average TM-scores of the first models produced by C-I-TASSER and C-QUARK were 28% and 56% higher than those constructed by I-TASSER and QUARK, respectively. For the first time, contact-map predictions demonstrated usefulness on TBM domains with close homologous templates, where TM-scores of C-I-TASSER models were significantly higher than those of I-TASSER models with a P-value <.05. Detailed data analyses showed that the success of C-I-TASSER and C-QUARK was mainly due to the increased accuracy of deep-learning-based contact-maps, as well as the careful balance between sequence-based contact restraints, threading templates, and generic knowledge-based potentials. Nevertheless, challenges still remain for predicting quaternary structure of multi-domain proteins, due to the difficulties in domain partitioning and domain reassembly. In addition, contact prediction in terminal regions was often unsatisfactory due to the sparsity of MSAs. Development of new contact-based domain partitioning and assembly methods and training contact models on sparse MSAs may help address these issues.
© 2019 Wiley Periodicals, Inc.

Entities:  

Keywords:  CASP13; ab initio folding; contact prediction; deep convolutional neural networks; deep multiple sequence alignment; protein structure prediction

Mesh:

Substances:

Year:  2019        PMID: 31365149      PMCID: PMC6851476          DOI: 10.1002/prot.25792

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  51 in total

1.  How significant is a protein structure similarity with TM-score = 0.5?

Authors:  Jinrui Xu; Yang Zhang
Journal:  Bioinformatics       Date:  2010-02-17       Impact factor: 6.937

2.  Automated protein structure modeling in CASP9 by I-TASSER pipeline combined with QUARK-based ab initio folding and FG-MD-based structure refinement.

Authors:  Dong Xu; Jian Zhang; Ambrish Roy; Yang Zhang
Journal:  Proteins       Date:  2011-08-23

3.  Protein homology detection by HMM-HMM comparison.

Authors:  Johannes Söding
Journal:  Bioinformatics       Date:  2004-11-05       Impact factor: 6.937

4.  Template-based modeling and free modeling by I-TASSER in CASP7.

Authors:  Yang Zhang
Journal:  Proteins       Date:  2007

5.  A method to identify protein sequences that fold into a known three-dimensional structure.

Authors:  J U Bowie; R Lüthy; D Eisenberg
Journal:  Science       Date:  1991-07-12       Impact factor: 47.728

6.  Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era.

Authors:  Hetunandan Kamisetty; Sergey Ovchinnikov; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2013-09-05       Impact factor: 11.205

7.  Toward optimal fragment generations for ab initio protein structure assembly.

Authors:  Dong Xu; Yang Zhang
Journal:  Proteins       Date:  2012-10-16

8.  Evaluation of the template-based modeling in CASP12.

Authors:  Andriy Kryshtafovych; Bohdan Monastyrskyy; Krzysztof Fidelis; John Moult; Torsten Schwede; Anna Tramontano
Journal:  Proteins       Date:  2017-12-04

9.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.

Authors:  Baris E Suzek; Yuqi Wang; Hongzhan Huang; Peter B McGarvey; Cathy H Wu
Journal:  Bioinformatics       Date:  2014-11-13       Impact factor: 6.937

10.  Evaluation of free modeling targets in CASP11 and ROLL.

Authors:  Lisa N Kinch; Wenlin Li; Bohdan Monastyrskyy; Andriy Kryshtafovych; Nick V Grishin
Journal:  Proteins       Date:  2016-01-20
View more
  54 in total

1.  Whole genome analysis of more than 10 000 SARS-CoV-2 virus unveils global genetic diversity and target region of NSP6.

Authors:  Indrajit Saha; Nimisha Ghosh; Ayan Pradhan; Nikhil Sharma; Debasree Maity; Kaushik Mitra
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

2.  Fold recognition by scoring protein maps using the congruence coefficient.

Authors:  Pietro Di Lena; Pierre Baldi
Journal:  Bioinformatics       Date:  2021-05-01       Impact factor: 6.937

3.  Improved protein structure prediction using predicted interresidue orientations.

Authors:  Jianyi Yang; Ivan Anishchenko; Hahnbeom Park; Zhenling Peng; Sergey Ovchinnikov; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2020-01-02       Impact factor: 11.205

4.  Functions of Essential Genes and a Scale-Free Protein Interaction Network Revealed by Structure-Based Function and Interaction Prediction for a Minimal Genome.

Authors:  Chengxin Zhang; Wei Zheng; Micah Cheng; Gilbert S Omenn; Peter L Freddolino; Yang Zhang
Journal:  J Proteome Res       Date:  2021-01-04       Impact factor: 4.466

5.  Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.

Authors:  Yang Li; Chengxin Zhang; Eric W Bell; Wei Zheng; Xiaogen Zhou; Dong-Jun Yu; Yang Zhang
Journal:  PLoS Comput Biol       Date:  2021-03-26       Impact factor: 4.475

6.  Endoplasmic reticulum-associated degradation is required for nephrin maturation and kidney glomerular filtration function.

Authors:  Sei Yoshida; Xiaoqiong Wei; Gensheng Zhang; Christopher L O'Connor; Mauricio Torres; Zhangsen Zhou; Liangguang Lin; Rajasree Menon; Xiaoxi Xu; Wenyue Zheng; Yi Xiong; Edgar Otto; Chih-Hang Anthony Tang; Rui Hua; Rakesh Verma; Hiroyuki Mori; Yang Zhang; Chih-Chi Andrew Hu; Ming Liu; Puneet Garg; Jeffrey B Hodgin; Shengyi Sun; Markus Bitzer; Ling Qi
Journal:  J Clin Invest       Date:  2021-04-01       Impact factor: 14.808

7.  A New Protocol for Atomic-Level Protein Structure Modeling and Refinement Using Low-to-Medium Resolution Cryo-EM Density Maps.

Authors:  Biao Zhang; Xi Zhang; Robin Pearce; Hong-Bin Shen; Yang Zhang
Journal:  J Mol Biol       Date:  2020-08-06       Impact factor: 5.469

8.  Mechanism for DPY30 and ASH2L intrinsically disordered regions to modulate the MLL/SET1 activity on chromatin.

Authors:  Young-Tae Lee; Alex Ayoub; Sang-Ho Park; Liang Sha; Jing Xu; Fengbiao Mao; Wei Zheng; Yang Zhang; Uhn-Soo Cho; Yali Dou
Journal:  Nat Commun       Date:  2021-05-19       Impact factor: 14.919

Review 9.  Recent Applications of Deep Learning Methods on Evolution- and Contact-Based Protein Structure Prediction.

Authors:  Donghyuk Suh; Jai Woo Lee; Sun Choi; Yoonji Lee
Journal:  Int J Mol Sci       Date:  2021-06-02       Impact factor: 5.923

Review 10.  Macromolecular modeling and design in Rosetta: recent methods and frameworks.

Authors:  Julia Koehler Leman; Brian D Weitzner; Steven M Lewis; Jared Adolf-Bryfogle; Nawsad Alam; Rebecca F Alford; Melanie Aprahamian; David Baker; Kyle A Barlow; Patrick Barth; Benjamin Basanta; Brian J Bender; Kristin Blacklock; Jaume Bonet; Scott E Boyken; Phil Bradley; Chris Bystroff; Patrick Conway; Seth Cooper; Bruno E Correia; Brian Coventry; Rhiju Das; René M De Jong; Frank DiMaio; Lorna Dsilva; Roland Dunbrack; Alexander S Ford; Brandon Frenz; Darwin Y Fu; Caleb Geniesse; Lukasz Goldschmidt; Ragul Gowthaman; Jeffrey J Gray; Dominik Gront; Sharon Guffy; Scott Horowitz; Po-Ssu Huang; Thomas Huber; Tim M Jacobs; Jeliazko R Jeliazkov; David K Johnson; Kalli Kappel; John Karanicolas; Hamed Khakzad; Karen R Khar; Sagar D Khare; Firas Khatib; Alisa Khramushin; Indigo C King; Robert Kleffner; Brian Koepnick; Tanja Kortemme; Georg Kuenze; Brian Kuhlman; Daisuke Kuroda; Jason W Labonte; Jason K Lai; Gideon Lapidoth; Andrew Leaver-Fay; Steffen Lindert; Thomas Linsky; Nir London; Joseph H Lubin; Sergey Lyskov; Jack Maguire; Lars Malmström; Enrique Marcos; Orly Marcu; Nicholas A Marze; Jens Meiler; Rocco Moretti; Vikram Khipple Mulligan; Santrupti Nerli; Christoffer Norn; Shane Ó'Conchúir; Noah Ollikainen; Sergey Ovchinnikov; Michael S Pacella; Xingjie Pan; Hahnbeom Park; Ryan E Pavlovicz; Manasi Pethe; Brian G Pierce; Kala Bharath Pilla; Barak Raveh; P Douglas Renfrew; Shourya S Roy Burman; Aliza Rubenstein; Marion F Sauer; Andreas Scheck; William Schief; Ora Schueler-Furman; Yuval Sedan; Alexander M Sevy; Nikolaos G Sgourakis; Lei Shi; Justin B Siegel; Daniel-Adriano Silva; Shannon Smith; Yifan Song; Amelie Stein; Maria Szegedy; Frank D Teets; Summer B Thyme; Ray Yu-Ruei Wang; Andrew Watkins; Lior Zimmerman; Richard Bonneau
Journal:  Nat Methods       Date:  2020-06-01       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.