| Literature DB >> 18237407 |
Nicholas Furnham1, Paul Iw de Bakker, Swanand Gore, David F Burke, Tom L Blundell.
Abstract
BACKGROUND: Although comparative modelling is routinely used to produce three-dimensional models of proteins, very few automated approaches are formulated in a way that allows inclusion of restraints derived from experimental data as well as those from the structures of homologues. Furthermore, proteins are usually described as a single conformer, rather than an ensemble that represents the heterogeneity and inaccuracy of experimentally determined protein structures. Here we address these issues by exploring the application of the restraint-based conformational space search engine, RAPPER, which has previously been developed for rebuilding experimentally defined protein structures and for fitting models to electron density derived from X-ray diffraction analyses.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18237407 PMCID: PMC2275734 DOI: 10.1186/1472-6807-8-7
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Templates for Each of the Targets Modelled.
| PID | ||||
| Family | 1 | 2 | 3 | 4 |
| Parv §* | 62.0 | 55.1 | 58.7 | 53.8 |
| 1pvaa | 5cpv | 1pal | 5pal | 1a75a |
| Ghf22 § | 69.9 | 35.8 | 38.2 | 45.5 |
| 1hml | 1hfx | 1lz3 | 1lmn | 1jug |
| Cyt3 § | 86.9 | 52.3 | 40.2 | 29.9 |
| 2cdv | 2cym | 1wad | 3cyr | 2cy3 |
| MHC § | 59.8 | 7.3 | 11.0 | 8.9 |
| 1fv1b | 1iakb | 1iaka | 1fnga | 1fv1a |
| Flav §* | 48.8 | 33.9 | 28.6 | 22.5 |
| 1flv | 1ag9a | 2fcr | 1akr | 5nll |
| Phos § | 36.9 | 40.2 | 40.7 | 34.7 |
| 1bp2 | 1vpi | 1aokb | 1vapa | 1ppa |
| Fabp § | 32.1 | 29.0 | 30.5 | 19.1 |
| 1ifc | 1hmt | 1lif | 1opba | 1mdc |
| Resp § | 32.2 | 25.0 | 22.6 | 23.5 |
| 1kgsa | 1b00a | 1dz3a | 1tmy | 1qkka |
| Asp §* | 54.8 | 40.2 | 26.0 | 27.6 |
| 3app | 4ape | 4apr | 1smra | 1mpp |
| Glob §* | 87.6 | 26.0 | 21.1 | 20.6 |
| 1ymb | 1emy | 2hhbb | 1spgb | 1ecd |
| Az | 59.4 | 69.8 | 53.5 | 61.7 |
| 2azzaa | 1joi | 1rkra | 1cuoa | 1jzga |
| Blm | 43 | 37.1 | 39.5 | 42.2 |
| 4blma | 3blm | 1btl | 1mfo | 1bul |
| Cytc | 57.7 | 53.2 | 45.8 | 32.1 |
| 1yea | 2pcbb | 1ccr | 1cry | 1cxc |
| Egf | 32.4 | 27 | 27 | 21.6 |
| 1esl | 1dan1 | 1dan2 | 1rfnb | 1hcgb |
| Fn3 | 34.8 | 25.3 | 20.9 | 22.5 |
| 1fnf2 | 1fnf3 | 1fnf1 | 1fnf4 | 1ten |
| Gluts | 31.4 | 27.6 | 28.6 | 28.1 |
| 17gsa | 1guka | 1gtua | 1guha | 1gula |
| Gtp | 64.7 | 48.8 | 29.9 | 32.3 |
| 1kao | 1guaa | 3p21 | 1mh1 | 1ftn |
| Igvar-h | 46 | 43.9 | 42.9 | 42.6 |
| 1faihv | 8fabhv | 2fbjhv | 2fb4hv | 7fabhv |
| Igvar-l | 73.7 | 68.7 | 57.3 | 45.4 |
| 2fb4lv | 2rhelv | 2mcglv | 8fablv | 3hflv |
| Ltn | 89.9 | 88.5 | 47.8 | 41.6 |
| 2ltn | 1len | 1loe | 1lte | 1avba |
| Phc | 54.4 | 55 | 37.3 | 33.8 |
| 1phnb | 1liab | 1b8db | 1allb | 1alla |
| Sh3 | 33.3 | 33.3 | 31.6 | 29.6 |
| 1shg | 1shfa | 2src | 1qcfa | 1lcka |
| Tim | 70.3 | 56.6 | 44.7 | 68.5 |
| 1amk | 5tima | 1ypia | 1ydva | 1tcda |
Members of the homologous families (given by their Homstrad abbreviation) used to model the target are shown with the target and its PDB code in the first column. The percentage sequence identity and the PDB code of the four templates used to model on is given in the subsequent columns. The four families used to parameterise are indicated by *, while the ten families that were used to explore the effect of PID are indicated by §.
Fifteen Different Combinations of Templates Used in Exploring the Effect of PID.
| Combination | Template 1 | Template 2 | Template 3 | Template 4 |
| 1 | ✓ | |||
| 2 | ✓ | |||
| 3 | ✓ | |||
| 4 | ✓ | |||
| 5 | ✓ | ✓ | ||
| 6 | ✓ | ✓ | ||
| 7 | ✓ | ✓ | ||
| 8 | · | ✓ | ✓ | |
| 9 | ✓ | ✓ | ||
| 10 | ✓ | ✓ | ||
| 11 | ✓ | ✓ | ✓ | |
| 12 | ✓ | ✓ | ✓ | |
| 13 | ✓ | ✓ | ✓ | |
| 14 | ✓ | ✓ | ✓ | |
| 15 | ✓ | ✓ | ✓ | ✓ |
To explore the effect of the number and diversity of templates on RAPPER building using the Standard mode of conformer generation, fifteen combinations as shown in the table where used. Template 1 was closest by PID to the target with Template 4 the most remote.
All-Atom RMSD for RAPPER Models for 15 Combinations of Template to Target.
| Family | ||||||||||
| Combination | Asp | Glob | Parv | Cyt3 | MHC_II_N | Fabp | Flav | Phoslip | Responce_reg | Ghf22 |
| 1 | 2.07 (99) | 1.37 (100) | 1.34 (100) | 1.11 (100) | 1.95 (96) | 2.22 (100) | 1.50 (100) | 2.85 (100) | 3.28 (100) | 2.61 (100) |
| 2 | 2.71 (100) | 2.87 (100) | 1.68 (99) | 2.25 (99) | 4.62 (98) | 2.10 (100) | 1.58 (99) | 2.73 (100) | 7.37 (100) | 2.42 (100) |
| 3 | 3.77 (100) | 2.88 (100) | 1.38 (100) | 2.20 (100) | 4.20 (98) | 2.56 (100) | 5.80 (100) | 2.57 (100) | 4.34 (100) | 2.26 (100) |
| 4 | 3.47 (100) | 2.95 (99) | 1.40 (100) | 3.94 (99) | 4.37 (96) | 2.60 (100) | 6.49 (98) | 2.85 (100) | 3.82 (99) | 2.51 (100) |
| 5 | 2.47 (99) | 1.62 (100) | 1.30 (96) | 1.70 (100) | 2.74 (98) | 2.16 (100) | 1.55 (100) | 2.70 (100) | 3.60 (98) | 2.51 (100) |
| 6 | 2.71 (99) | 1.68 (98) | 1.18 (95) | 1.25 (99) | 2.18 (98) | 2.16 (100) | 1.51 (99) | 2.64 (100) | 8.40 (88) | 2.43 (100) |
| 7 | 3.08 (99) | 1.59 (100) | 1.52 (100) | 1.46 (100) | 1.83 (95) | 2.13 (98) | 1.70 (100) | 2.67 (100) | 3.46 (100) | 2.52 (100) |
| 8 | 3.36 (100) | 2.94 (98) | 1.26 (96) | 1.94 (98) | 4.00 (95) | 2.28 (100) | 2.09 (100) | 2.50 (100) | 3.87 (99) | 2.31 (100) |
| 9 | 3.13 (99) | 2.68 (100) | 1.63 (93) | 2.35 (99) | 4.56 (98) | 2.19 (98) | 2.18 (99) | 2.83 (100) | 3.77 (100) | 2.17 (100) |
| 10 | 3.71 (99) | 2.63 (100) | 1.40 (100) | 2.27 (98) | 4.34 (98) | 2.30 (98) | 3.45 (93) | 2.52 (100) | 3.90 (100) | 2.17 (100) |
| 11 | 2.30 (99) | 1.74 (98) | 1.17 (95) | 1.75 (99) | 1.98 (95) | 2.20 (99) | 1.55 (99) | 2.65 (100) | 3.52 (98) | 2.41 (100) |
| 12 | 2.68 (99) | 1.68 (100) | 1.53 (93) | 2.01 (100) | 2.88 (98) | 2.05 (98) | 1.78 (100) | 2.64 (100) | 3.72 (99) | 2.56 (100) |
| 13 | 2.80 (99) | 2.08 (96) | 1.33 (93) | 1.77 (99) | 2.24 (98) | 2.17 (98) | 1.78 (99) | 2.64 (100) | 3.46 (100) | 2.50 (100) |
| 14 | 3.08 (99) | 2.52 (96) | 1.37 (93) | 2.16 (100) | 4.11 (95) | 2.14 (98) | 2.22 (100) | 2.78 (100) | 3.18 (67) | 2.20 (100) |
| 15 | 2.52 (99) | 1.99 (100) | 1.35 (93) | 1.75 (99) | 1.98 (95) | 2.01 (98) | 1.83 (99) | 2.98 (100) | 3.50 (100) | 2.43 (100) |
The all-atom RMSD in Ångstrom's for RAPPER models built in the standard mode for ten families. The combinations of template to target are as indicated in Table 3.2. To provide a fair comparison the percentage of residues used in the RMSD calculation is given in parentheses.
All-Atom RMSD for Models Built Using Different RAPPER Restraint Derivations and Templates and for Models Built by Modeller.
| 1.23 | 1.47 | 1.54 | 1.59 | 1.41 | 1.45 | 1.45 | 1.23 | |
| 1.74 | 1.89 | 1.97 | 2.05 | 1.86 | 1.90 | 1.92 | 1.74 | |
| 1.31 | 1.37 | 1.48 | 1.36 | 1.35 | 1.25 | 1.60 | 1.31 | |
| 1.54 | 1.55 | 2.00 | 1.67 | 1.83 | 1.95 | 2.00 | 1.47 | |
| 1.21 | 1.12 | 1.08 | 1.67 | 1.18 | 1.15 | 1.26 | 1.19 | |
| 1.45 | 1.52 | 1.48 | 2.16 | 1.54 | 1.46 | 1.48 | 1.48 | |
| 1.07 | 2.16 | 1.83 | 3.43 | 2.22 | 2.14 | 2.11 | 2.20 | |
| 1.37 | 1.98 | 2.20 | 2.05 | 2.35 | 2.21 | 2.14 | 2.13 | |
| 1.24 | 1.49 | 1.45 | 1.75 | 1.44 | 1.69 | 1.61 | 1.63 | |
| 1.37 | 1.60 | 2.11 | 1.98 | 2.14 | 2.24 | 2.03 | 1.97 | |
| 1.32 | 2.07 | 2.58 | 2.38 | 2.39 | 2.45 | 2.67 | 2.03 | |
| 1.33 | 1.42 | 1.36 | 1.97 | 1.38 | 1.43 | 1.35 | 1.54 | |
| 1.25 | 2.88 | 3.19 | 3.66 | 2.92 | 2.84 | 3.19 | 2.89 | |
| 1.33 | 2.31 | 2.44 | 2.62 | 2.38 | 2.50 | 2.44 | 2.30 | |
| 1.52 | 1.72 | 2.01 | 3.24 | 2.49 | 2.54 | 2.31 | 2.98 | |
| 1.44 | 1.47 | 1.60 | 2.04 | 1.54 | 1.61 | 1.50 | 1.51 | |
| 1.53 | 1.36 | 1.36 | 1.63 | 1.34 | 1.41 | 1.27 | 1.34 | |
| 1.58 | 1.77 | 1.93 | 1.91 | 1.76 | 1.82 | 2.04 | 2.12 | |
| 1.18 | 1.40 | 1.23 | 1.33 | 1.32 | 1.27 | 1.27 | 1.14 | |
| 1.14 | 1.56 | 1.68 | 1.59 | 1.53 | 1.58 | 1.99 | 1.74 | |
| 1.34 | 2.95 | 2.84 | 2.65 | 2.67 | 2.54 | 2.67 | 2.37 | |
| 1.23 | 2.69 | 3.14 | 3.41 | 3.06 | 3.35 | 3.03 | 3.26 | |
| 1.38 | 2.09 | 2.22 | 2.24 | 2.21 | 2.18 | 2.50 | 2.07 | |
| 1.40 | 1.24 | 1.42 | 1.63 | 1.26 | 1.33 | 1.41 | 1.37 | |
| 1.35 | 1.80 | 1.92 | 2.17 | 1.90 | 1.93 | 1.97 | 1.88 | |
The all-atom RMSD given in Ångstrom's for RAPPER models built using A. Cα-trace mode; B. Using optimal spatial restraints mode using based on all templates; C. stranded restraint derivation using just the closest template by PID; D. stranded restraint derivation using all templates; E. restraints predicted by CHORAL and ANDANTE using all templates; F. using restraints based on the PID of the templates using all templates; G. using MODELLER based on just the closest templates and H. using MODELLER based on all the templates.
Statistical Analysis of Modelling Methods.
| Relation | A < B | A < C | A < D | A < E | A < F | A < G | A < H | |
| Score | 3.20E-04 | 6.25E-05 | 5.53E-06 | 4.89E-05 | 7.61E-05 | 2.27E-05 | 1.72E-04 | |
| P 0.01 | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | |
| P 0.05 | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | |
| Relation | B < C | B < D | B < E | B < F | B < G | B < H | ||
| Score | 4.05E-03 | 9.63E-05 | 1.29E-02 | 1.84E-02 | 7.69E-04 | 1.28E-01 | ||
| P 0.01 | TRUE | TRUE | FALSE | FALSE | TRUE | FALSE | ||
| P 0.05 | TRUE | TRUE | FALSE | FALSE | TRUE | FALSE | ||
| Relation | C < D | C < E | C < F | G < H | F < H | C < G | C < H | |
| Score | 7.67E-03 | 4.00E-01 | 2.58E-01 | 5.28E-02 | 2.99E-01 | 6.43E-02 | 2.37E-01 | |
| P 0.01 | TRUE | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | |
| P 0.05 | TRUE | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | |
The statistical significance of the differences in all-atom RMSD between the modelling methods as reported in Table 3.4. The differences are assessed using a t-test of a paired sample between two means. The truth logic given is for p values of 0.01 and 0.05.
Figure 1Contribution to Overall RMSD by Individual Residue. The per-residue all-atom RMSD for models generated by RAPPER (solid red) and MODELLER (dotted green) for a target of the Ghf22 family. The greatest contribution to the overall RMSD can be seen to be from the C-terminal residues. If these are excluded from the overall RMSD the recalculated RMSD is the same for both modelling procedures.
All-Atom RMSD's for RAPPER Using Two Different Restraint Derivations Compared to Those for MODELLER.
| All-atom RAPPER PDF (Å) | All-atom RAPPER Standard (Å) | Modeller (Å) | |
| Az | 1.24 | 1.62 | 1.30 |
| Asp | 1.89 | 2.47 | 1.54 |
| Blm | 1.56 | 2.06 | 1.55 |
| Cyt3 | 1.50 | 1.99 | 1.19 |
| Cytc | 2.07 | 2.53 | 2.11 |
| Egf | 2.64 | 2.28 | 2.64 |
| Fabp | 2.18 | 2.36 | 2.15 |
| Flav‡ | 2.30 | 1.78 | 1.65 |
| Fn3 | 2.02 | 2.35 | 2.02 |
| Ghf22 | 1.73 | 2.50 | 1.88 |
| Glob‡ | 2.00 | 2.10 | 1.50 |
| Gluts | 2.24 | 3.66 | 2.84 |
| Gtp | 1.88 | 2.21 | 1.78 |
| Igvar-h | 2.32 | 2.79 | 2.36 |
| Igvar-l | 1.66 | 1.78 | 1.41 |
| Ltn | 1.66 | 1.78 | 1.41 |
| MHC_II_N | 2.61 | 2.62 | 2.45 |
| Parv | 1.14 | 1.73 | 1.21 |
| Phc | 1.46 | 1.81 | 1.52 |
| Phoslip | 1.87 | 2.56 | 1.90 |
| Response_reg | 2.41 | 2.64 | 2.26 |
| Sh3 | 1.76 | 2.34 | 1.88 |
| Tim | 1.18 | 1.58 | 1.18 |
| Mean | 1.87 | 2.23 | 1.81 |
The RMSD for models of each of the family targets generated using RAPPER with restraints derived from PDF's, RAPPER with the standard restraint derivation procedure and MODELLER. The RMSD's for MODLLER and Standard RAPPER differ from the equivalent RMSD's in Tables 3 and 4 as they are calculated over the equivalent number of residues for all models in each family. The two families with difficulties in PDF generation are indicated by ‡.
Figure 2Problems in Deriving PDF's for Flav Family. A. The superimposed templates in gray with the derived centres of the PDF's shown as yellow spheres. Note the divergent loop on the left. The target structure is shown in green. B. The resulting models from different modes of building in RAPPER: RAPPER-PDF in gray, RAPPER-CHORAL in blue, RAPPER-Standard in yellow and MODELLER in pink. The target structure is also shown in green.
Figure 3Comparison of RAPPER and NMR Ensembles to the Crystallographic Model. Comparison of a RAPPER ensemble of comparative models for the target 1PVA chain A from the Parvalbulmin family with an NMR ensemble, the crystal structure and the deposited representative NMR structure. A: The backbone trace of 9 models from the RAPPER ensemble (cyan) generated by comparative modelling on all targets and the equivalent models generated by NMR (blue). Also shown are the deposited crystal structure (red) and the representative NMR single model (orange). All models are superimposed with reference to the crystal structure. B: The plot of ensemble mean and mode for each residue in the RAPPER ensemble. C: The 1st derivative of the per residue ensemble mean for RAPPER (red) and the NMR ensemble (green). D: The all atom per residue RMSD for the RAPPER representative single model (red) compared to the equivalent single NMR representative model (green).
Figure 4Schematic of RAPPER Conformer Generation Applied to Comparative Modelling.
Figure 5Centres of PDF's Compared to the Target. PDF's and target for the Ltn family. The centres of each PDF shown as a space filled sphere with the ribbon trace of the target in red. Note that the size of the sphere does not represent the size of the PDF sphere enforced in RAPPER.