| Literature DB >> 19433514 |
Julien Maupetit1, Philippe Derreumaux, Pierre Tuffery.
Abstract
Rational peptide design and large-scale prediction of peptide structure from sequence remain a challenge for chemical biologists. We present PEP-FOLD, an online service, aimed at de novo modelling of 3D conformations for peptides between 9 and 25 amino acids in aqueous solution. Using a hidden Markov model-derived structural alphabet (SA) of 27 four-residue letters, PEP-FOLD first predicts the SA letter profiles from the amino acid sequence and then assembles the predicted fragments by a greedy procedure driven by a modified version of the OPEP coarse-grained force field. Starting from an amino acid sequence, PEP-FOLD performs series of 50 simulations and returns the most representative conformations identified in terms of energy and population. Using a benchmark of 25 peptides with 9-23 amino acids, and considering the reproducibility of the runs, we find that, on average, PEP-FOLD locates lowest energy conformations differing by 2.6 A Calpha root mean square deviation from the full NMR structures. PEP-FOLD can be accessed at http://bioserv.rpbs.univ-paris-diderot.fr/PEP-FOLD.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19433514 PMCID: PMC2703897 DOI: 10.1093/nar/gkp323
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.PEP-FOLD flowchart.
Figure 2.PEP-FOLD sample output for the peptide 1UAO. A 2D energy plot. For each cluster (x-axis), we show the energy of its centroid (y-axis). The size of the circle is proportional to the cluster population which is also given. The dashed lines help locate the LEC. (B) Left: LEC. Right: the representatives of the best clusters are superimposed on the LEC to illustrate conformational variability. Pictures are obtained using the PyMol software.
PEP-FOLD results on 25 peptides with 9–25 amino acids in aqueous solution
| PDB | Class | LEC energy | Max Δ | Native cRMSD | LEC cRMSD | |
|---|---|---|---|---|---|---|
| 1a13 | 14 | – | −26.5 | 0.0 | 1.8 | 0.0 |
| 1b03 | 18 | B | −23.6 | 1.4 | 2.0–3.1 | 2.1 |
| 1dep | 15 | A | −30.7 | 0.0 | 1.7 | 0.0 |
| 1du1 | 20 | A | −45.1 | 0.0 | 5.1 | 0.0 |
| 1e0q | 17 | B | −26.7 | 0.2 | 2.7–5.7 | 2.2 |
| 1egs | 9 | – | −3.9 | 0.0 | 1.5 | 0.0 |
| 1gjf | 14 | A | −24.7 | 0.0 | 2.4–2.5 | 0.1 |
| 1in3 | 12 | A | −25.8 | 0.0 | 2.3 | 0.0 |
| 1k43 | 14 | B | −20.9 | 0.4 | 1.5–1.7 | 0.8 |
| 1l2y | 20 | A | −28.5 | 0.1 | 2.1–2.3 | 0.2 |
| 1l3q | 12 | – | −6.1 | 0.7 | 3.3–5.3 | 2.5 |
| 1lcx | 13 | – | −28.0 | 0.3 | 2.8 | 0.0 |
| 1le1 | 12 | B | −18.2 | 0.0 | 1.0 | 0.0 |
| 1le3 | 16 | B | −24.2 | 3.9 | 1.2–2.7 | 1.1 |
| 1niz | 14 | B | −19.1 | 0.7 | 2.1–2.7 | 0.7 |
| 1nkf | 16 | A | −18.9 | 0.0 | 4.3–4.5 | 0.3 |
| 1pef | 18 | A | −58.2 | 0.0 | 0.9 | 0.0 |
| 1pei | 22 | A | −50.3 | 0.0 | 1.6 | 0.0 |
| 1pgbF | 16 | B | −21.6 | 2.7 | 2.1–2.4 | 2.4 |
| 1rpv | 17 | A | −28.9 | 0.0 | 0.6 | 0.1 |
| 1uao | 10 | B | −7.3 | 0.1 | 2.0 | 0.1 |
| 1wbr | 17 | – | −33.9 | 0.0 | 3.5 | 0.0 |
| 1wz4 | 23 | A | −31.2 | 0.6 | 5.6–6.4 | 2.8 |
| 2bta | 15 | – | −23.5 | 0.0 | 4.5 | 0.0 |
| 2evq | 12 | B | −16.9 | 0.0 | 0.9 | 0.0 |
PDB: Protein Data Bank identifier. L: peptide length. Class: structural class (A for α, B for β and - for none), according to STRIDE (29) program. Each target is subject to five independent runs, each of 50 greedy simulations. For each peptide, we report, the LEC (kcal/mole) from run 1 and the maximal energy difference (Max ΔE) between the five LECs from runs 1–5. We also give the minimal and maximal cRMSD (in angstroms) of the five LECs with respect to the NMR structure (native cRMSD). Finally, we report the average cRMSD between the five LECs (LEC cRMSD). Note that 1pgbF corresponds to the fragment 41–56 of 1pgb, and the NMR structure used for cRMSD calculation is model 1, except for 1a13, 1b03, 1e0q, 1egs, 1lcx, 1pei, 1rpv, 1wbr and 1wz4 where the reference models are 10, 4, 8, 3, 25, 10, 12, 23 and 2, respectively.
Figure 3.PEP-FOLD server-generated models. The lowest energy model (blue) is superposed on the experimental structure of 1UAO in aqueous solution and 2MAG in TFE/water (magenta) using iSuperpose online facility (http://mobyle.rpbs.univ-paris-diderot.fr/). For 2MAG, we also show the lowest RMSD structure (green) with respect to reference state. For 1UAO, we also show the all-atom predicted and experimental structures.