| Literature DB >> 24688696 |
Abstract
Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a "mini protein folding problem" under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.Entities:
Year: 2013 PMID: 24688696 PMCID: PMC3962101 DOI: 10.5936/csbj.201302003
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Figure 1Distribution of loop lengths in the protein chain list generated by the PISCES server [21] on Aug. 28, 2012 containing 13255 chains with 2.0A resolution, 90% sequence identity, and 0.25 R-factor cutoff.
Figure 2φ-ψ propensity maps of Leucine in the loops in presence of hydrophobic neighbors (ILE and VAL): (a) LEU as a singlet; (b, c, d) LEU with ILE and VAL as the nearest, one position away, and two positions away neighbors in sequence. The nearest and second nearest neighbors have strong influences to the backbone torsion angle conformations of Leucine and the influences from further neighbors are significantly weakened.
Figure 3(a) Multiple energy functions coordinate plot of loop 1btkA(14:24) decoys in Jacobson loop decoy set using 5 energy functions, including Rosetta, DOPE, dDFIRE, backbone torsion potential using triplets, and OPLS-AA. All scores are linearly normalized in [0, 1]. RMSD is calculated for all backbone atoms in the loop. None of these energy functions can identify a near native decoy (< 1.0A) with the lowest energy value. (b) Native loop (gold) and loop decoys with lowest scores in Rosetta (blue, 2.73A), DOPE and dDFIRE (green, 2.85A), Triplet (red, 2.34A), and OPLS-AA (purple 2.27A).
Figure 4Addressing φ-ψ angles of a 4-residue loop to bridge the gap between the targeted anchored points
Energy functions, sampling methods, and loop closure mechanisms in recently published loop structure modeling works.
| Loop Modeling Methods | Energy Functions | Coarse-grained Sampling | Fine-grained Sampling | Loop Closure |
|---|---|---|---|---|
| Fiser et al. [ | Statistical potential integrating simple restraints or pseudo-energy terms | Random Buildup | Conjugate gradients – MD with simulated annealing – Conjugate gradients | Guaranteed in random buildup |
| Deane and Blundell [ | Statistical all-atom, distance-dependent conditional probability function | Search Polypeptide Fragment Database | - | Filtering based on closure gap |
| Deane and Blundell [ | ||||
| Xiang et al. [ | Colony energy | Random Buildup | Fast torsion minimizer | Random tweak |
| DePristo et al. [ | RAPDF-1 and RAPDF-2 (coarse) | Sample dihedral angles from fine-grained torsion angle state sets | Limited-memory BFGS | Gap-closure restraint |
| De Bakker et al. [ | AMBER-GBSA (fine) | |||
| Rohl et al. [ | Rosetta | MC, Simulated Annealing | MC energy minimization of all-atom Rosetta scoring function | Gap closure term in energy function [ |
| Mandell et al. [ | Kinematic closure [ | |||
| Jacobson et al. [ | OPLS-AA SGB (A hydrophobic term is added later in [ | Rotamer Library Buildup | PLOP (Truncated Newton Local Optimization) | Meet in the middle |
| Zhu et al. [ | ||||
| Zhao et al. [ | ||||
| (PLOP) | ||||
| Spassov et al. [ | CHARMM with polar hydrogen force field parameters | Sampling backbone torsion angles in low energy basins of iso-energy contour | Newton-Raphson Minimization | Meet in the middle |
| Soto et al. [ | DFIRE (coarse) | Random sampling (same as LOOPY) | PLOP | Direct tweak |
| OPLS/SBG-NP (fine) | ||||
| Cui et al. [ | Grid-based force field | Local move MC, Simulated Annealing | Steepest Descent Energy Minimization | Filter local moves with reverse proximity criterion |
| Jamroz and Kolinski [ | - | Hybrid Modeller, Rosetta, and CABS | - | |
| Lee et al. [ | DFIRE | Fragment Assembly | Side Chain Optimization | Analytical loop closure |
| Li et al. [ | Rosetta, DFIRE, Triplet | MOMCMC | PLOP | CCD |
| Liang et al. [ | Backbone potential, OSCAR force field, OPLS/SGB-NP | Random Buildup | Energy Minimization | CCD |
Loop prediction accuracy in recently published works. The number of loop targets is specified in curly brackets.
| Average RMSD (A) {Number of Loop Targets} | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Methods | Data Source | Loop Length | ||||||||||||||||
| 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 + | ||
| Fiser et al. [ | Figure 9 | 0.5 | 0.5 | 1 | 1 | 2 | 2 | 2.5 | 3.5 | 3.5 | 5.5 | 6 | 6.5 | 6 | - | - | - | - |
| {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | ||||||
| Deane and Blundell [ | Table V | - | 0.78 | 1.09 | 1.96 | 2.36 | 3.29 | 3.5 | - | - | - | - | - | - | - | - | - | - |
| {153} | {184} | {162} | {97} | {78} | {60} | |||||||||||||
| Xiang et al. [ | Table I | - | - | - | 0.85 | 0.92 | 1.23 | 1.45 | 2.68 | 2.21 | 3.52 | 3.42 | - | - | - | - | - | - |
| {161} | {107} | {74} | {61} | {58} | {34} | {37} | {21} | |||||||||||
| De Bakker et al. [ | Table III | 0.35 | 0.37 | 0.47 | 0.9 | 0.95 | 1.37 | 2.28 | 2.41 | 3.48 | 4.94 | 4.99 | - | - | - | - | - | - |
| {34} | {34} | {35} | {35} | {36} | {38} | {32} | {37} | {37} | {33} | {34} | ||||||||
| Rohl et al. [ | Table II and Table VI | - | - | 0.69 | - | - | - | 1.45 | - | - | - | 3.62 | 5.15 | |||||
| {40} | {40} | {40} | {avg. over 10 13- to 35-residue loops} | |||||||||||||||
| Jacobson et al. [ | Table IX | - | - | 0.2 | 0.24 | 0.28 | 0.3 | 0.44 | 0.51 | 1.09 | 1.87 | 1.93 | - | - | - | - | - | - |
| {35} | {117} | {100} | {82} | {66} | {57} | {40} | {18} | {10} | ||||||||||
| Zhu et al. [ | Table II | - | - | - | - | - | - | - | - | - | 1 | 1.15 | 1.25 | - | - | - | - | - |
| {38} | {31} | {35} | ||||||||||||||||
| Spassov et al. [ | Table I | 0.26 | 0.31 | 0.42 | 0.49 | 0.81 | 1.07 | 1.33 | 1.63 | 2.66 | 3.35 | 4.08 | - | - | - | - | - | - |
| {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | {40} | ||||||||
| Soto et al. [ | Table V | - | - | - | - | - | - | 1.31 | 1.88 | 1.93 | 2.5 | 2.65 | 3.74 | - | - | - | - | - |
| {63} | {56} | {40} | {54} | {40} | {40} | |||||||||||||
| Cui et al. [ | Table I | - | - | 0.75 | - | - | - | - | - | - | - | - | - | |||||
| {avg. over 14 4- to 9-residue loops} | ||||||||||||||||||
| Mandell et al. [ | Figure 2 | - | - | - | - | - | - | - | - | - | - | 0.8 | - | - | - | - | - | - |
| {63} | ||||||||||||||||||
| Jamroz and Kolinski [ | Table II | - | - | 1.07 | 2.23 | - | - | - | 7.87 | |||||||||
| {49} | {64} | {73} | ||||||||||||||||
| Lee et al. [ | Table IV | - | - | 0.54 | 0.92 | 1.36 | 1.17 | 1.87 | 2.08 | 3.09 | 3.43 | 3.84 | - | - | - | - | - | - |
| {35} | {35} | {36} | {38} | {32} | {37} | {37} | {33} | {34} | ||||||||||
| Li et al. [ | Tables II and III | - | - | 0.33 | 0.58 | 0.86 | 0.63 | - | - | - | - | - | ||||||
| {252} | {205} | {68} | {35} | |||||||||||||||
| Zhao et al. [ | Table III | - | - | - | - | - | - | - | - | - | - | - | - | 1.19 | 1.55 | 1.43 | 2.3 | - |
| {36} | {30} | {14} | {9} | |||||||||||||||
| Liang et al. [ | Table III | - | - | 0.4 | 0.52 | 0.7 | 0.83 | 1.1 | 1.6 | 2.08 | 2.73 | 3.58 | - | - | - | - | - | - |
| {2809} | {1863} | {1456} | {1053} | {862} | {634} | {528} | {392} | {325} | ||||||||||