| Literature DB >> 18022190 |
Ilkka Lappalainen1, Michael G Hurley, Jane Clarke.
Abstract
A number of beta-sandwich immunoglobulin-like domains have been shown to fold using a set of structurally equivalent residues that form a folding nucleus deep within the core of the protein. Formation of this nucleus is sufficient to establish the complex Greek key topology of the native state. These nucleating residues are highly conserved within the immunoglobulin superfamily, but are less well conserved in the fibronectin type III (fnIII) superfamily, where the requirement is simply to have four interacting hydrophobic residues. However, there are rare examples where this nucleation pattern is absent. In this study, we have investigated the folding of a novel member of the fnIII superfamily whose nucleus appears to lack one of the four buried hydrophobic residues. We show that the folding mechanism is unaltered, but the folding nucleus has moved within the hydrophobic core.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18022190 PMCID: PMC2291451 DOI: 10.1016/j.jmb.2007.09.088
Source DB: PubMed Journal: J Mol Biol ISSN: 0022-2836 Impact factor: 5.469
Figure 1The structure of the CAfn2 (left, green, PDB code 1K85) and TNfn3 (right, orange, PDB core 1TEN). The side-chains forming the putative folding nucleus in both structures are shown in blue. In most fnIII domains these residues form a ring of interactions deep within the core, as is shown for TNfn3. The polar side-chain of N40 in CAfn2 is not interacting with the other residues in the folding nucleus. The CAfn2 has the same topology as all other fnIII domains: seven β-strands that arrange into two β-sheets. The first sheet is formed of the A, B and E-strands, and the second sheet is formed of the C′, C, F and G-strands. Figures 1, 5 and6 were made using PyMol [http://pymol.sourceforge.net/].
Figure 2The fnIII sequences containing only a single aromatic residue within the predicted obligatory folding nucleus. The frequency of the appearance of a given amino acid in any position is shown on the y-axis. The majority of the sequences have either phenylalanine at the F-strand or tyrosine at the C-strand folding position.
Figure 3A simplified representation of the CAfn2 (left, green) and TNfn3 (right, orange) structures showing the mutated positions within each strand. The core of the proteins is divided into six layers, residues within the same layer pack against each other on opposite β-sheets. The differences in loop lengths between CAfn2 and TNfn3 are shown for each loop in the CAfn2 representation (+ means the CAfn2 loop is longer, –– means the CAfn2 loop is shorter). The Φ-values are marked in blue for each position. The obligatory folding positions are shown as a red oval.
Figure 4Chevron plots for each mutation according to the β-strand. The observed rate constants (k) are measured in s− 1 and the concentration of urea ([urea]) is measured in M.
Structural details at each position mutated in CAfn2
| Mutant | Position | SASA (%) | Deleted contacts: residue number (number
of contacts deleted) |
|---|---|---|---|
| L10A | A3 | 0 | 11(2), 22(12), 23(1), 24(17), 66(1), 67(2), 68(9), 81(6), 82(1), 84(9) |
| S12A | A2 | 11 | 11(1), 15(1), 20(2), 22(1), 84(3), 86(3) |
| I20A | B2 | 0 | 12(4), 15(10), 21(2), 22(6), 55(6), 58(5), 66(13), 86(7), 88(1) |
| L22A | B3 | 0 | 10(11), 12(3), 20(6), 24(3), 38(6), 53(3), 55(4), 66(13), 68(3), 84(8), 86(2) |
| V33A | BC-loop | 3 | 4(5), 27(4), 29(1), 30(2), 34(2), 36(6), 70(2), 72(2) |
| Y36L | C5 | 4 | 7(7), 24(20), 26(4), 27(8), 33(5), 51(3), 70(4) |
| Y36F | C5 | 4 | 7(1), 24(3), 26(1), 27(2), 33(2), 51(1), 70(1) |
| V38A | C4 | 0 | 22(7), 24(4), 39(1), 45(1), 46(4), 48(3), 53(2), 55(5), 66(6), 67(1), 68(4) |
| N40A | C3 | 18 | 45(3), 55(1), 58(3), 64(14), 65(2), 66(15) |
| L44A | C′3 | 25 | 37(11), 38(1), 39(15), 45(2), 47(5) |
| T46A | C′4 | 10 | 38(4), 45(1), 47(2), 48(4), 53(1), 54(2), 55(7) |
| V48A | C′5 | 8 | 24(9), 36(3), 38(3), 46(4), 52(1), 53(2), 54(2), 68(1) |
| A53G | E4 | 0 | 22(4), 24(4), 38(3), 46(2), 48(3), 52(2), 54(2), 55(2), 68(1) |
| I55A | E3 | 3 | 20(7), 22(5), 38(6), 40(2), 45(2), 46(8), 53(2), 54(1), 56(1), 58(5), 66(20) |
| L58A | E2 | 2 | 20(7), 40(3), 55(7), 64(23), 66(10), 88(7) |
| Y64L | F2 | 11 | 40(4), 58(15), 59(4), 62(12), 66(1), 88(9) |
| Y64F | F2 | 11 | 58(3), 59(1), 62(3), 88(2) |
| F66L | F3 | 0 | 20(9), 22(2), 38(2), 40(10), 45(1), 55(12), 58(10), 64(12), 86(4), 88(1) |
| V68A | F4 | 0 | 7(5), 10(7), 22(2), 24(30), 36(5), 38(4), 48(1), 53(1), 67(1), 81(4) |
| A70G | F5 | 0 | 7(3), 27(2), 33(3), 36(8), 4(3), 69(1), 71(1), 78(2) |
| S81A | G4 | 0 | 7(1), 10(4), 67(3), 68(3), 80(1), 82(1) |
| V84A | G3 | 7 | 10(8), 11(1), 12(4), 22(5), 66(3), 82(1), 86(4) |
| V86A | G2 | 6 | 12(4), 15(6), 20(6), 22(2), 64(3), 66(8), 84(5), 88(1) |
Each residue is described by strand and core layer (see Figure 3).
Side-chain — side chain contacts within 6 Å deleted on mutation.
Changes in stability and refolding kinetics for mutants of CAfn2
| Protein | Position | [urea]50% (M) | ΔΔ | Φ at 0 M urea | |||
|---|---|---|---|---|---|---|---|
| Wild-type | 1.08 ± 0.14 | 6.20 ± 0.14 | 4.23 ± 0.11 | 0.89 ± 0.01 | |||
| L10A | A3 | 1.19 ± 0.05 | 4.01 ± 0.03 | 2.52 ± 0.17 | 4.70 ± 0.04 | 1.00 ± 0.04 | 0.03 ± 0.01 |
| S12A | A2 | 1.00 ± 0.03 | 6.29 ± 0.05 | −0.10 ± 0.18 | ND | ||
| I20A | B2 | 1.15 ± 0.05 | 3.50 ± 0.03 | 3.11 ± 0.18 | 1.35 ± 0.10 | 1.12 ± 0.07 | 0.22 ± 0.02 |
| L22A | B3 | 1.16 ± 0.11 | 1.39 ± 0.12 | 5.54 ± 0.25 | 0.44 ± 0.03 | a | 0.24 ± 0.01 |
| V33A | BC-loop | 1.16 ± 0.07 | 4.66 ± 0.04 | 1.77 ± 0.18 | 3.12 ± 0.22 | 0.88 ± 0.03 | 0.10 ± 0.03 |
| Y36F | C5 | 1.25 ± 0.12 | 4.66 ± 0.06 | 1.77 ± 0.18 | 4.24 ± 0.18 | 1.04 ± 0.03 | 0.00 ± 0.02 |
| Y36L | C5 | 1.46 ± 0.34 | 1.02 ± 0.10 | 5.97 ± 0.24 | 0.92 ± 0.10 | a | 0.15 ± 0.01 |
| F36L | C5 | 1.46 ± 0.34 | 1.02 ± 0.10 | 4.20 ± 0.10 | 0.92 ± 0.10 | a | 0.22 ± 0.02 |
| V38A | C4 | 1.27 ± 0.06 | 2.56 ± 0.03 | 4.19 ± 0.19 | 0.25 ± 0.02 | 1.46 ± 0.05 | 0.40 ± 0.02 |
| N40A | C3 | 0.99 ± 0.08 | 5.33 ± 0.10 | 1.00 ± 0.20 | 4.29 ± 0.19 | 0.87 ± 0.82 | 0.01 ± 0.03 |
| L44A | C′3 | 1.25 ± 0.05 | 3.79 ± 0.03 | 2.78 ± 0.18 | 0.61 ± 0.03 | 1.17 ± 0.03 | 0.40 ± 0.03 |
| T46A | C′4 | 0.89 ± 0.03 | 7.49 ± 0.04 | − 1.49 ± 0.17 | 3.08 ± 0.52 | 0.88 ± 0.02 | 0.36 ± 0.05 |
| V48A | C′5 | 1.10 ± 0.06 | 4.83 ± 0.04 | 1.58 ± 0.17 | 1.28 ± 0.07 | 1.08 ± 0.02 | 0.45 ± 0.05 |
| A53G | E4 | 1.18 ± 0.03 | 4.18 ± 0.02 | 2.33 ± 0.17 | 2.48 ± 0.16 | 1.23 ± 0.04 | 0.14 ± 0.02 |
| I55A | E3 | 1.13 ± 0.08 | 3.05 ± 0.05 | 3.63 ± 0.19 | 0.37 ± 0.05 | 1.08 ± 0.07 | 0.40 ± 0.03 |
| L58A | E2 | 1.13 ± 0.10 | 1.63 ± 0.05 | 5.26 ± 0.21 | 1.06 ± 0.06 | a | 0.16 ± 0.01 |
| Y64L | F2 | 1.26 ± 0.07 | 2.40 ± 0.03 | 4.38 ± 0.19 | 3.09 ± 0.14 | 1.14 ± 0.04 | 0.04 ± 0.01 |
| Y64F | F2 | 1.31 ± 0.07 | 4.88 ± 0.03 | 1.52 ± 0.17 | 3.82 ± 0.17 | 0.94 ± 0.02 | 0.04 ± 0.02 |
| F64L | F2 | 1.26 ± 0.07 | 2.40 ± 0.03 | 2.86 ± 0.12 | 3.09 ± 0.14 | 1.14 ± 0.04 | 0.12 ± 0.03 |
| F66L | F3 | 1.08 ± 0.06 | 4.03 ± 0.05 | 2.50 ± 0.18 | 3.13 ± 0.08 | 1.01 ± 0.02 | 0.07 ± 0.01 |
| V68A | F4 | 1.19 ± 0.07 | 2.67 ± 0.05 | 4.07 ± 0.20 | 0.74 ± 0.04 | 1.36 ± 0.04 | 0.25 ± 0.01 |
| A70G | F5 | 1.22 ± 0.03 | 4.57 ± 0.02 | 1.88 ± 0.17 | 1.77 ± 0.06 | 1.04 ± 0.02 | 0.27 ± 0.03 |
| S81A | G4 | 1.13 ± 0.09 | 4.97 ± 0.06 | 1.42 ± 0.18 | 4.84 ± 0.22 | 0.92 ± 0.02 | −0.06 ± 0.02 |
| V84A | G3 | 1.22 ± 0.06 | 4.76 ± 0.03 | 1.66 ± 0.17 | 3.66 ± 0.17 | 0.94 ± 0.02 | 0.05 ± 0.02 |
| V86A | G2 | 1.15 ± 0.05 | 4.95 ± 0.15 | 1.44 ± 0.24 | 4.19 ± 0.14 | 0.96 ± 0.01 | 0.00 ± 0.02 |
ND Not done: No Φ-value was determined for S12A because ΔΔGD–N ∼ 0. Previous studies have indicated that where ΔΔGD–N > 0.6 kcal mol− 1, the corresponding Φ-value can be considered reliable.
The errors reported for mD–N and kf are standard errors from the fits of the data. Errors in ΔΔGD–N and Φ were determined by standard error propagation methods.
aWhere the mutation induced a large change in stability there were too few points to fit the refolding m-value with accuracy. These mutant chevrons were fit with a fixed average m-value of 1.06 M− 1.
Figure 5The CAfn2 structure showing the ϕ-values (high, blue, Φ ≥ 0.4; medium, magenta, 0.2 < Φ < 0.4; and low, red, Φ ≤ 0.2). (a) The front view of CAfn2 (the CAfn2 structure is oriented as in Figure 1). (b) The rear view of CAfn2.
Figure 6(a) and (c) The obligatory folding nucleus of CAfn2 has moved within the hydrophobic core in comparison to (b) and (d) the structurally conserved positions for TNfn3. The molecules are oriented as in Figure 1. The novel obligatory folding nucleus of CAfn2 is based on the Φ-values and contact maps between the residues in the hydrophobic nucleus.
Figure 7Brønsted analysis of TNfn3 and CAfn2 mutants, showing the plot of ΔΔGD–‡versus ΔΔGD–N. The separation of data points into two discrete populations shows that the folding nucleus of both proteins is not a uniformly expanded form of the native state. The central core of the protein forms early and the peripheral regions pack after the transition state for folding.
Figure 8Comparison of the Φ-value patterns in CAfn2 (red) and TNfn3 (blue). (a) The pattern of Φ-values in the A–B–E strand is the same with the highest Φ-values falling in layer 3. (b) In the C′-C-F-G sheet, however, the residues with the highest Φ-values in CAfn2 are in a lower layer than in TNfn3.