Literature DB >> 34331351

Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14.

Wei Zheng1, Yang Li1,2, Chengxin Zhang1, Xiaogen Zhou1, Robin Pearce1, Eric W Bell1, Xiaoqiang Huang1, Yang Zhang1,3.   

Abstract

In this article, we report 3D structure prediction results by two of our best server groups ("Zhang-Server" and "QUARK") in CASP14. These two servers were built based on the D-I-TASSER and D-QUARK algorithms, which integrated four newly developed components into the classical protein folding pipelines, I-TASSER and QUARK, respectively. The new components include: (a) a new multiple sequence alignment (MSA) collection tool, DeepMSA2, which is extended from the DeepMSA program; (b) a contact-based domain boundary prediction algorithm, FUpred, to detect protein domain boundaries; (c) a residual convolutional neural network-based method, DeepPotential, to predict multiple spatial restraints by co-evolutionary features derived from the MSA; and (d) optimized spatial restraint energy potentials to guide the structure assembly simulations. For 37 FM targets, the average TM-scores of the first models produced by D-I-TASSER and D-QUARK were 96% and 112% higher than those constructed by I-TASSER and QUARK, respectively. The data analysis indicates noticeable improvements produced by each of the four new components, especially for the newly added spatial restraints from DeepPotential and the well-tuned force field that combines spatial restraints, threading templates, and generic knowledge-based potentials. However, challenges still exist in the current pipelines. These include difficulties in modeling multi-domain proteins due to low accuracy in inter-domain distance prediction and modeling protein domains from oligomer complexes, as the co-evolutionary analysis cannot distinguish inter-chain and intra-chain distances. Specifically tuning the deep learning-based predictors for multi-domain targets and protein complexes may be helpful to address these issues.
© 2021 Wiley Periodicals LLC.

Entities:  

Keywords:  CASP14; ab initio folding; deep learning; domain partition; multiple sequence alignment; protein structure prediction; residue-residue distance prediction

Mesh:

Substances:

Year:  2021        PMID: 34331351      PMCID: PMC8616857          DOI: 10.1002/prot.26193

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  53 in total

1.  SPICKER: a clustering approach to identify near-native protein folds.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  J Comput Chem       Date:  2004-04-30       Impact factor: 3.376

2.  How significant is a protein structure similarity with TM-score = 0.5?

Authors:  Jinrui Xu; Yang Zhang
Journal:  Bioinformatics       Date:  2010-02-17       Impact factor: 6.937

3.  Protein homology detection by HMM-HMM comparison.

Authors:  Johannes Söding
Journal:  Bioinformatics       Date:  2004-11-05       Impact factor: 6.937

4.  Toward optimal fragment generations for ab initio protein structure assembly.

Authors:  Dong Xu; Yang Zhang
Journal:  Proteins       Date:  2012-10-16

5.  I-TASSER: a unified platform for automated protein structure and function prediction.

Authors:  Ambrish Roy; Alper Kucukural; Yang Zhang
Journal:  Nat Protoc       Date:  2010-03-25       Impact factor: 13.491

6.  ResQ: An Approach to Unified Estimation of B-Factor and Residue-Specific Error in Protein Structure Prediction.

Authors:  Jianyi Yang; Yan Wang; Yang Zhang
Journal:  J Mol Biol       Date:  2015-10-03       Impact factor: 5.469

7.  Improved protein structure prediction using predicted interresidue orientations.

Authors:  Jianyi Yang; Ivan Anishchenko; Hahnbeom Park; Zhenling Peng; Sergey Ovchinnikov; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2020-01-02       Impact factor: 11.205

8.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.

Authors:  Baris E Suzek; Yuqi Wang; Hongzhan Huang; Peter B McGarvey; Cathy H Wu
Journal:  Bioinformatics       Date:  2014-11-13       Impact factor: 6.937

9.  ROTAS: a rotamer-dependent, atomic statistical potential for assessment and prediction of protein structures.

Authors:  Jungkap Park; Kazuhiro Saitou
Journal:  BMC Bioinformatics       Date:  2014-09-18       Impact factor: 3.169

10.  Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations.

Authors:  Wei Zheng; Chengxin Zhang; Yang Li; Robin Pearce; Eric W Bell; Yang Zhang
Journal:  Cell Rep Methods       Date:  2021-06-21
View more
  11 in total

1.  Progressive assembly of multi-domain protein structures from cryo-EM density maps.

Authors:  Xiaogen Zhou; Yang Li; Chengxin Zhang; Wei Zheng; Guijun Zhang; Yang Zhang
Journal:  Nat Comput Sci       Date:  2022-04-28

Review 2.  I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction.

Authors:  Xiaogen Zhou; Wei Zheng; Yang Li; Robin Pearce; Chengxin Zhang; Eric W Bell; Guijun Zhang; Yang Zhang
Journal:  Nat Protoc       Date:  2022-08-05       Impact factor: 17.021

Review 3.  Protein Function Analysis through Machine Learning.

Authors:  Chris Avery; John Patterson; Tyler Grear; Theodore Frater; Donald J Jacobs
Journal:  Biomolecules       Date:  2022-09-06

4.  DEMO2: Assemble multi-domain protein structures by coupling analogous template alignments with deep-learning inter-domain restraint prediction.

Authors:  Xiaogen Zhou; Chunxiang Peng; Wei Zheng; Yang Li; Guijun Zhang; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2022-05-10       Impact factor: 19.160

5.  Deep learning geometrical potential for high-accuracy ab initio protein structure prediction.

Authors:  Yang Li; Chengxin Zhang; Dong-Jun Yu; Yang Zhang
Journal:  iScience       Date:  2022-05-18

6.  LOMETS3: integrating deep learning and profile alignment for advanced protein template recognition and function annotation.

Authors:  Wei Zheng; Qiqige Wuyun; Xiaogen Zhou; Yang Li; Peter L Freddolino; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2022-04-14       Impact factor: 19.160

Review 7.  Using metagenomic data to boost protein structure prediction and discovery.

Authors:  Qingzhen Hou; Fabrizio Pucci; Fengming Pan; Fuzhong Xue; Marianne Rooman; Qiang Feng
Journal:  Comput Struct Biotechnol J       Date:  2022-01-03       Impact factor: 7.271

Review 8.  Protein Design with Deep Learning.

Authors:  Marianne Defresne; Sophie Barbe; Thomas Schiex
Journal:  Int J Mol Sci       Date:  2021-10-29       Impact factor: 5.923

9.  Improved Protein Structure Prediction Using a New Multi-Scale Network and Homologous Templates.

Authors:  Hong Su; Wenkai Wang; Zongyang Du; Zhenling Peng; Shang-Hua Gao; Ming-Ming Cheng; Jianyi Yang
Journal:  Adv Sci (Weinh)       Date:  2021-10-31       Impact factor: 16.806

10.  A-Prot: protein structure modeling using MSA transformer.

Authors:  Yiyu Hong; Juyong Lee; Junsu Ko
Journal:  BMC Bioinformatics       Date:  2022-03-16       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.