| Literature DB >> 23565140 |
Lucas Gregorio Nivón1, Rocco Moretti, David Baker.
Abstract
Computational design of protein function involves a search for amino acids with the lowest energy subject to a set of constraints specifying function. In many cases a set of natural protein backbone structures, or "scaffolds", are searched to find regions where functional sites (an enzyme active site, ligand binding pocket, protein-protein interaction region, etc.) can be placed, and the identities of the surrounding amino acids are optimized to satisfy functional constraints. Input native protein structures almost invariably have regions that score very poorly with the design force field, and any design based on these unmodified structures may result in mutations away from the native sequence solely as a result of the energetic strain. Because the input structure is already a stable protein, it is desirable to keep the total number of mutations to a minimum and to avoid mutations resulting from poorly-scoring input structures. Here we describe a protocol using cycles of minimization with combined backbone/sidechain restraints that is Pareto-optimal with respect to RMSD to the native structure and energetic strain reduction. The protocol should be broadly useful in the preparation of scaffold libraries for functional site design.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23565140 PMCID: PMC3614904 DOI: 10.1371/journal.pone.0059004
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Restraint-protocol evaluation.
Average all-atom RMSD to the original structure versus per residue energies across the 51 protein test set for relax runs using different restraint parameters. The relax protocol conducts cycles of sidechain repacking and all-atom minimization while ramping the repulsive energy up and down – the protocol without additional restraints is the “unrestrained” set. The harmonic set adds all-atom restraints using a harmonic potential with a width specified by the sd parameter. The bounded sets employ restraints with a zero value within a set “width” parameter and a linear value specified by an sd term further from the origin. The sidechain-sidechain restraints place harmonic restraints between pairs of sidechain atoms at a specified cutoff distance, which can be varied to produce more or less restrictive restraints.
RMSD after restrained relax with various restraint protocols and sequence recovery after Rosetta design.
| Pre-treatment Protocol | Protocol Number | Seq. rec. in design | All-atom RMSD (Å) | Energy per residue (after relax) |
| No relax | 1 | 0.397 | 0 | −0.36 |
| relax no restraint | 2 | 0.534 | 0.956 | −2.13 |
| relax + backbone restraint (BB) | 3 | 0.471 | 1.002 | −1.98 |
| relax + BB + Sidechain-sidechain distance restraint (BB+scsc) | 4 | 0.432 | 0.338 | −1.60 |
| relax + BB + Sidechain coordinate restraint (BB + sccoord) | 5 | 0.431 | 0.176 | −1.54 |
| Relax + ex rotamers (6x slower) | 6 | 0.436 | 0.176 | −1.55 |
Sequence recovery is the mean over 50 separate runs. Calculations were performed with Rosetta r48056, no hydrogen-bond flags in relax, and using score12_w_corrections for relax. All protocols use the relax algorithm with various added restraints, indicated by the protocol number and description.
RMSD after restrained relax, and sequence recovery after Rosetta design over a 36-protein subset using electron density data.
| Pre-treatment Protocol | Protocol Number | Seq. rec. in design | All-atom RMSD (Å) | Energy per residue (after relax) |
| No relax | 1 | 0.389 | 0 | −0.61 |
| relax no restraint | 2 | 0.511 | 1.512 | −2.13 |
| relax + BB + electron density | 7 | 0.391 | 0.441 | −1.15 |
| relax + BB + sccoord | 5 | 0.418 | 0.176 | −1.62 |
The same tests as in Table 1 were run on the 36-protein subset for which electron density was available, but the less-optimal relax restraint protocols were not tested for this smaller set. Density data were taken from the http://eds.bmc.uu.se/eds/server.