Literature DB >> 18707538

Minimizing and learning energy functions for side-chain prediction.

Chen Yanover1, Ora Schueler-Furman, Yair Weiss.   

Abstract

Side-chain prediction is an important subproblem of the general protein folding problem. Despite much progress in side-chain prediction, performance is far from satisfactory. As an example, the ROSETTA program that uses simulated annealing to select the minimum energy conformations, correctly predicts the first two side-chain angles for approximately 72% of the buried residues in a standard data set. Is further improvement more likely to come from better search methods, or from better energy functions? Given that exact minimization of the energy is NP hard, it is difficult to get a systematic answer to this question. In this paper, we present a novel search method and a novel method for learning energy functions from training data that are both based on Tree Reweighted Belief Propagation (TRBP). We find that TRBP can obtain the global optimum of the ROSETTA energy function in a few minutes of computation for approximately 85% of the proteins in a standard benchmark set. TRBP can also effectively bound the partition function which enables using the Conditional Random Fields (CRF) framework for learning. Interestingly, finding the global minimum does not significantly improve side-chain prediction for an energy function based on ROSETTA's default energy terms (less than 0:1%), while learning new weights gives a significant boost from 72% to 78%. Using a recently modified ROSETTA energy function with a softer Lennard-Jones repulsive term, the global optimum does improve prediction accuracy from 77% to 78%. Here again, learning new weights improves side-chain modeling even further to 80%. Finally, the highest accuracy (82.6%) is obtained using an extended rotamer library and CRF learned weights. Our results suggest that combining machine learning with approximate inference can improve the state-of-the-art in side-chain prediction.

Entities:  

Mesh:

Year:  2008        PMID: 18707538     DOI: 10.1089/cmb.2007.0158

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  12 in total

1.  Integrative structure modeling of macromolecular assemblies from proteomics data.

Authors:  Keren Lasker; Jeremy L Phillips; Daniel Russel; Javier Velázquez-Muriel; Dina Schneidman-Duhovny; Elina Tjioe; Ben Webb; Avner Schlessinger; Andrej Sali
Journal:  Mol Cell Proteomics       Date:  2010-05-27       Impact factor: 5.911

2.  Expanded explorations into the optimization of an energy function for protein design.

Authors:  Yao-Ming Huang; Christopher Bystroff
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Sep-Oct       Impact factor: 3.710

3.  Greedy Direction Method of Multiplier for MAP Inference of Large Output Domain.

Authors:  Xiangru Huang; Qixing Huang; Ian E H Yen; Pradeep Ravikumar; Ruohan Zhang; Inderjit S Dhillon
Journal:  JMLR Workshop Conf Proc       Date:  2017-04

4.  Comparative evaluation of spin-label modeling methods for protein structural studies.

Authors:  Maxx H Tessmer; Elizabeth R Canarie; Stefan Stoll
Journal:  Biophys J       Date:  2022-08-10       Impact factor: 3.699

5.  Side-chain Packing Using SE(3)-Transformer.

Authors:  Akhil Jindal; Sergei Kotelnikov; Dzmitry Padhorny; Dima Kozakov; Yimin Zhu; Rezaul Chowdhury; Sandor Vajda
Journal:  Pac Symp Biocomput       Date:  2022

6.  Inferential optimization for simultaneous fitting of multiple components into a CryoEM map of their assembly.

Authors:  Keren Lasker; Maya Topf; Andrej Sali; Haim J Wolfson
Journal:  J Mol Biol       Date:  2009-02-20       Impact factor: 5.469

Review 7.  Template-based protein modeling: recent methodological advances.

Authors:  Pankaj R Daga; Ronak Y Patel; Robert J Doerksen
Journal:  Curr Top Med Chem       Date:  2010       Impact factor: 3.295

Review 8.  Principles of flexible protein-protein docking.

Authors:  Nelly Andrusier; Efrat Mashiach; Ruth Nussinov; Haim J Wolfson
Journal:  Proteins       Date:  2008-11-01

9.  Beyond rotamers: a generative, probabilistic model of side chains in proteins.

Authors:  Tim Harder; Wouter Boomsma; Martin Paluszewski; Jes Frellsen; Kristoffer E Johansson; Thomas Hamelryck
Journal:  BMC Bioinformatics       Date:  2010-06-05       Impact factor: 3.169

10.  Fast and accurate prediction of protein side-chain conformations.

Authors:  Shide Liang; Dandan Zheng; Chi Zhang; Daron M Standley
Journal:  Bioinformatics       Date:  2011-08-27       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.