| Literature DB >> 21941539 |
Jonas Winkler1, Giuliano Armano, J Nikolaj Dybowski, Oliver Kuhn, Filippo Ledda, Dominik Heider.
Abstract
Computational design of novel proteins with well-defined functions is an ongoing topic in computational biology. In this work, we generated and optimized a new synthetic fusion protein using an evolutionary approach. The optimization was guided by directed evolution based on hydrophobicity scores, molecular weight, and secondary structure predictions. Several methods were used to refine the models built from the resulting sequences. We have successfully combined two unrelated naturally occurring binding sites, the immunoglobin Fc-binding site of the Z domain and the DNA-binding motif of MyoD bHLH, into a novel stable protein.Entities:
Year: 2011 PMID: 21941539 PMCID: PMC3173724 DOI: 10.1155/2011/457578
Source DB: PubMed Journal: Adv Bioinformatics ISSN: 1687-8027
Figure 1Chart of the design process. We employed a genetic algorithm (GA) with a fitness function based on secondary structure alignments and hydrophobicity and molecular weight comparisons. The resulting sequence set of this iterative process was refined using ERIS to build and rank the models which were then simulated using molecular dynamics simulations in order to estimate stability according to [4]. Amber and Brownian dynamics simulations are applied for testing and refinement of the final optimized protein models.
Figure 21LP1: sequence of the Z domain. 1MDY: part of the sequence of MyoD. Red marked amino acids are used as part of the seed sequence. Seed: seed sequence for the optimization. The blue and magenta marked amino acids are fixed during optimization. The initial population was created by randomly mutating black marked amino acids. JW70: selected model of the optimization aligned to the seed sequence.
Figure 3RMSD plots of the best (solid line) and worst (dashed line) sequences ranked by ERIS after 1000 generation (a) and 2000 generations (b), respectively. Models after 20 ns MD simulations were aligned to the wild-type structure of the Z domain using residues 6–17 and 22–33 (Helix 1 and 2). C RMSD of Helix 3 (residue 39–53) was calculated and smoothed using spline interpolation.
Figure 4(a): JW70 after 20 ns MD simulation (blue) aligned to the structure of the Z domain from 1LP1 after 10 ns MD simulation (orange). (b): model of the seed sequence after 20 ns MD simulation (purple) aligned to the Z domain from 1LP1 (orange). Helix 3, which contains the new DNA-binding site, is shown on top.
Brownian dynamics simulation results.
| model |
| net charge | rel. |
|---|---|---|---|
| WT | 4.66 · 108 | +5 | 1.000 |
| Negative | 0 | −2 | 0.000 |
| Seed | 1.17 · 108 | +5 | 0.251 |
| JW15 | 3.06 · 108 | +7 | 0.657 |
| JW19 | 1.60 · 107 | +3 | 0.034 |
| JW56 | 4.61 · 107 | +5 | 0.099 |
| JW70 | 4.56 · 108 | +5 | 0.978 |
Method comparison.
| method | residues | individuals | generation | sequences | CPU time |
|---|---|---|---|---|---|
| GHH [ | 36 | 8 | 15 | 2 · 120 | 1 year |
| current study | 54 | 600 | 2000 | 1.2 mil | 2 months |