Literature DB >> 21702492

Sampling multiple scoring functions can improve protein loop structure prediction accuracy.

Yaohang Li1, Ionel Rata, Eric Jakobsson.   

Abstract

Accurately predicting loop structures is important for understanding functions of many proteins. In order to obtain loop models with high accuracy, efficiently sampling the loop conformation space to discover reasonable structures is a critical step. In loop conformation sampling, coarse-grain energy (scoring) functions coupling with reduced protein representations are often used to reduce the number of degrees of freedom as well as sampling computational time. However, due to implicitly considering many factors by reduced representations, the coarse-grain scoring functions may have potential insensitivity and inaccuracy, which can mislead the sampling process and consequently ignore important loop conformations. In this paper, we present a new computational sampling approach to obtain reasonable loop backbone models, so-called the Pareto optimal sampling (POS) method. The rationale of the POS method is to sample the function space of multiple, carefully selected scoring functions to discover an ensemble of diversified structures yielding Pareto optimality to all sampled conformations. The POS method can efficiently tolerate insensitivity and inaccuracy in individual scoring functions and thereby lead to significant accuracy improvement in loop structure prediction. We apply the POS method to a set of 4-12-residue loop targets using a function space composed of backbone-only Rosetta and distance-scale finite ideal-gas reference (DFIRE) and a triplet backbone dihedral potential developed in our lab. Our computational results show that in 501 out of 502 targets, the model sets generated by POS contain structure models are within subangstrom resolution. Moreover, the top-ranked models have a root mean square deviation (rmsd) less than 1 A in 96.8, 84.1, and 72.2% of the short (4-6 residues), medium (7-9 residues), and long (10-12 residues) targets, respectively, when the all-atom models are generated by local optimization from the backbone models and are ranked by our recently developed Pareto optimal consensus (POC) method. Similar sampling effectiveness can also be found in a set of 13-residue loop targets.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21702492      PMCID: PMC3211142          DOI: 10.1021/ci200143u

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  38 in total

1.  Ab initio protein structure prediction of CASP III targets using ROSETTA.

Authors:  K T Simons; R Bonneau; I Ruczinski; D Baker
Journal:  Proteins       Date:  1999

2.  Cyclic coordinate descent: A robotics algorithm for protein loop closure.

Authors:  Adrian A Canutescu; Roland L Dunbrack
Journal:  Protein Sci       Date:  2003-05       Impact factor: 6.725

3.  A hierarchical approach to all-atom protein loop prediction.

Authors:  Matthew P Jacobson; David L Pincus; Chaya S Rapp; Tyler J F Day; Barry Honig; David E Shaw; Richard A Friesner
Journal:  Proteins       Date:  2004-05-01

4.  The rigid connecting loop stabilizes hairpin folding of the two helices of the ATP synthase subunit c.

Authors:  Oleg Y Dmitriev; Robert H Fillingame
Journal:  Protein Sci       Date:  2007-08-31       Impact factor: 6.725

5.  Modeling antibody hypervariable loops: a combined algorithm.

Authors:  A C Martin; J C Cheetham; A R Rees
Journal:  Proc Natl Acad Sci U S A       Date:  1989-12       Impact factor: 11.205

6.  Evaluation of the conformational free energies of loops in proteins.

Authors:  K C Smith; B Honig
Journal:  Proteins       Date:  1994-02

7.  The intraclass correlation coefficient as a measure of reliability.

Authors:  J J Bartko
Journal:  Psychol Rep       Date:  1966-08

8.  Improving predicted protein loop structure ranking using a Pareto-optimality consensus method.

Authors:  Yaohang Li; Ionel Rata; See-wing Chiu; Eric Jakobsson
Journal:  BMC Struct Biol       Date:  2010-07-20

9.  Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity.

Authors:  J S Fetrow; A Godzik; J Skolnick
Journal:  J Mol Biol       Date:  1998-10-02       Impact factor: 5.469

10.  A supersecondary structure library and search algorithm for modeling loops in protein structures.

Authors:  Narcis Fernandez-Fuentes; Baldomero Oliva; András Fiser
Journal:  Nucleic Acids Res       Date:  2006-04-14       Impact factor: 16.971

View more
  4 in total

1.  3dRNAscore: a distance and torsion angle dependent evaluation function of 3D RNA structures.

Authors:  Jian Wang; Yunjie Zhao; Chunyan Zhu; Yi Xiao
Journal:  Nucleic Acids Res       Date:  2015-02-24       Impact factor: 16.971

Review 2.  Computational design of structured loops for new protein functions.

Authors:  Kale Kundert; Tanja Kortemme
Journal:  Biol Chem       Date:  2019-02-25       Impact factor: 4.700

Review 3.  Advancements in therapeutically targeting orphan GPCRs.

Authors:  Jennifer A Stockert; Lakshmi A Devi
Journal:  Front Pharmacol       Date:  2015-05-08       Impact factor: 5.810

Review 4.  Conformational sampling in template-free protein loop structure modeling: an overview.

Authors:  Yaohang Li
Journal:  Comput Struct Biotechnol J       Date:  2013-02-25       Impact factor: 7.271

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.