Literature DB >> 31588495

EvoEF2: accurate and fast energy function for computational protein design.

Xiaoqiang Huang1, Robin Pearce1, Yang Zhang1,2.   

Abstract

MOTIVATION: The accuracy and success rate of de novo protein design remain limited, mainly due to the parameter over-fitting of current energy functions and their inability to discriminate incorrect designs from correct designs.
RESULTS: We developed an extended energy function, EvoEF2, for efficient de novo protein sequence design, based on a previously proposed physical energy function, EvoEF. Remarkably, EvoEF2 recovered 32.5%, 47.9% and 22.3% of all, core and surface residues for 148 test monomers, and was generally applicable to protein-protein interaction design, as it recapitulated 30.9%, 42.4%, 31.3% and 21.4% of all, core, interface and surface residues for 88 test dimers, significantly outperforming EvoEF on the native sequence recapitulation. We further used I-TASSER to evaluate the foldability of the 148 designed monomer sequences, where all of them were predicted to fold into structures with high fold- and atomic-level similarity to their corresponding native structures, as demonstrated by the fact that 87.8% of the predicted structures shared a root-mean-square-deviation less than 2 Å to their native counterparts. The study also demonstrated that the usefulness of physical energy functions is highly correlated with the parameter optimization processes, and EvoEF2, with parameters optimized using sequence recapitulation, is more suitable for computational protein sequence design than EvoEF, which was optimized on thermodynamic mutation data.
AVAILABILITY AND IMPLEMENTATION: The source code of EvoEF2 and the benchmark datasets are freely available at https://zhanglab.ccmb.med.umich.edu/EvoEF. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2020        PMID: 31588495      PMCID: PMC7144094          DOI: 10.1093/bioinformatics/btz740

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  Twilight zone of protein sequence alignments.

Authors:  B Rost
Journal:  Protein Eng       Date:  1999-02

2.  How significant is a protein structure similarity with TM-score = 0.5?

Authors:  Jinrui Xu; Yang Zhang
Journal:  Bioinformatics       Date:  2010-02-17       Impact factor: 6.937

3.  Optimizing energy functions for protein-protein interface design.

Authors:  Oz Sharabi; Chen Yanover; Ayelet Dekel; Julia M Shifman
Journal:  J Comput Chem       Date:  2011-01-15       Impact factor: 3.376

4.  Empirical solvent-mediated potentials hold for both intra-molecular and inter-molecular inter-residue interactions.

Authors:  O Keskin; I Bahar; A Y Badretdinov; O B Ptitsyn; R L Jernigan
Journal:  Protein Sci       Date:  1998-12       Impact factor: 6.725

5.  A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions.

Authors:  Maxim V Shapovalov; Roland L Dunbrack
Journal:  Structure       Date:  2011-06-08       Impact factor: 5.006

6.  Computational design of enzyme-ligand binding using a combined energy function and deterministic sequence optimization algorithm.

Authors:  Ye Tian; Xiaoqiang Huang; Yushan Zhu
Journal:  J Mol Model       Date:  2015-07-11       Impact factor: 1.810

7.  Improved prediction of protein side-chain conformations with SCWRL4.

Authors:  Georgii G Krivov; Maxim V Shapovalov; Roland L Dunbrack
Journal:  Proteins       Date:  2009-12

8.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

9.  Protein subunit interfaces: heterodimers versus homodimers.

Authors:  Cui Zhanhua; Jacob Gah-Kok Gan; Li Lei; Meena Kishore Sakharkar; Pandjassarame Kangueane
Journal:  Bioinformation       Date:  2005-08-11

10.  Predicting the Effect of Mutations on Protein-Protein Binding Interactions through Structure-Based Interface Profiles.

Authors:  Jeffrey R Brender; Yang Zhang
Journal:  PLoS Comput Biol       Date:  2015-10-27       Impact factor: 4.475

View more
  16 in total

1.  FASPR: an open-source tool for fast and accurate protein side-chain packing.

Authors:  Xiaoqiang Huang; Robin Pearce; Yang Zhang
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

2.  SSIPe: accurately estimating protein-protein binding affinity change upon mutations using evolutionary profiles in combination with an optimized physical energy function.

Authors:  Xiaoqiang Huang; Wei Zheng; Robin Pearce; Yang Zhang
Journal:  Bioinformatics       Date:  2020-04-15       Impact factor: 6.937

3.  Endoplasmic reticulum-associated degradation is required for nephrin maturation and kidney glomerular filtration function.

Authors:  Sei Yoshida; Xiaoqiong Wei; Gensheng Zhang; Christopher L O'Connor; Mauricio Torres; Zhangsen Zhou; Liangguang Lin; Rajasree Menon; Xiaoxi Xu; Wenyue Zheng; Yi Xiong; Edgar Otto; Chih-Hang Anthony Tang; Rui Hua; Rakesh Verma; Hiroyuki Mori; Yang Zhang; Chih-Chi Andrew Hu; Ming Liu; Puneet Garg; Jeffrey B Hodgin; Shengyi Sun; Markus Bitzer; Ling Qi
Journal:  J Clin Invest       Date:  2021-04-01       Impact factor: 14.808

Review 4.  Protein engineering for natural product biosynthesis and synthetic biology applications.

Authors:  Miles A Calzini; Alexandra A Malico; Melissa M Mitchler; Gavin J Williams
Journal:  Protein Eng Des Sel       Date:  2021-02-15       Impact factor: 1.952

Review 5.  Data-driven computational protein design.

Authors:  Vincent Frappier; Amy E Keating
Journal:  Curr Opin Struct Biol       Date:  2021-04-25       Impact factor: 7.786

6.  ADDRESS: A Database of Disease-associated Human Variants Incorporating Protein Structure and Folding Stabilities.

Authors:  Jaie Woodard; Chengxin Zhang; Yang Zhang
Journal:  J Mol Biol       Date:  2021-02-02       Impact factor: 6.151

Review 7.  Deep learning techniques have significantly impacted protein structure prediction and protein design.

Authors:  Robin Pearce; Yang Zhang
Journal:  Curr Opin Struct Biol       Date:  2021-02-24       Impact factor: 7.786

8.  Rosetta:MSF:NN: Boosting performance of multi-state computational protein design with a neural network.

Authors:  Julian Nazet; Elmar Lang; Rainer Merkl
Journal:  PLoS One       Date:  2021-08-26       Impact factor: 3.240

9.  De novo design of protein peptides to block association of the SARS-CoV-2 spike protein with human ACE2.

Authors:  Xiaoqiang Huang; Robin Pearce; Yang Zhang
Journal:  Aging (Albany NY)       Date:  2020-06-16       Impact factor: 5.682

10.  Applications of Protein Secondary Structure Algorithms in SARS-CoV-2 Research.

Authors:  Alibek Kruglikov; Mohan Rakesh; Yulong Wei; Xuhua Xia
Journal:  J Proteome Res       Date:  2021-02-22       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.