Huabing Wang1, Derek T Logan2, Jens Danielsson1, Mikael Oliveberg3. 1. Department of Biochemistry and Biophysics, Arrhenius Laboratories of Natural Sciences, Stockholm University, S-106 91 Stockholm, Sweden. 2. Department of Chemistry, Division of Biochemistry & Structural Biology, Lund University, 22100 Lund, Sweden. 3. Department of Biochemistry and Biophysics, Arrhenius Laboratories of Natural Sciences, Stockholm University, S-106 91 Stockholm, Sweden; mikael@dbb.su.se.
Abstract
Although folded proteins are commonly depicted as simplistic combinations of β-strands and α-helices, the actual properties and functions of these secondary-structure elements in their native contexts are just partly understood. The principal reason is that the behavior of individual β- and α-elements is obscured by the global folding cooperativity. In this study, we have circumvented this problem by designing frustrated variants of the mixed α/β-protein S6, which allow the structural behavior of individual β-strands and α-helices to be targeted selectively by stopped-flow kinetics, X-ray crystallography, and solution-state NMR. Essentially, our approach is based on provoking intramolecular "domain swap." The results show that the α- and β-elements have quite different characteristics: The swaps of β-strands proceed via global unfolding, whereas the α-helices are free to swap locally in the native basin. Moreover, the α-helices tend to hybridize and to promote protein association by gliding over to neighboring molecules. This difference in structural behavior follows directly from hydrogen-bonding restrictions and suggests that the protein secondary structure defines not only tertiary geometry, but also maintains control in function and structural evolution. Finally, our alternative approach to protein folding and native-state dynamics presents a generally applicable strategy for in silico design of protein models that are computationally testable in the microsecond-millisecond regime.
Although folded proteins are commonly depicted as simplistic combinations of β-strands and α-helices, the actual properties and functions of these secondary-structure elements in their native contexts are just partly understood. The principal reason is that the behavior of individual β- and α-elements is obscured by the global folding cooperativity. In this study, we have circumvented this problem by designing frustrated variants of the mixed α/β-protein S6, which allow the structural behavior of individual β-strands and α-helices to be targeted selectively by stopped-flow kinetics, X-ray crystallography, and solution-state NMR. Essentially, our approach is based on provoking intramolecular "domain swap." The results show that the α- and β-elements have quite different characteristics: The swaps of β-strands proceed via global unfolding, whereas the α-helices are free to swap locally in the native basin. Moreover, the α-helices tend to hybridize and to promote protein association by gliding over to neighboring molecules. This difference in structural behavior follows directly from hydrogen-bonding restrictions and suggests that the protein secondary structure defines not only tertiary geometry, but also maintains control in function and structural evolution. Finally, our alternative approach to protein folding and native-state dynamics presents a generally applicable strategy for in silico design of protein models that are computationally testable in the microsecond-millisecond regime.
One of the most intriguing properties of naturally occurring proteins is their high folding cooperativity. Generally, this cooperativity promotes a relatively slow (minutes to hours) and highly concerted unfolding process where only the fully native and fully denatured species populate (1). Although this two-state behavior plays a vital role in safeguarding native-state integrity and biological function, it presents a major challenge for the experimentalist who wants to map out the global dynamics: Structural fluctuations that are sufficiently large to report on the secondary-structure interplay are so sparsely populated that they generally defy detection (2). This intrinsic difficulty of isolating high-energy events has not only impeded our understanding of the folding cooperativity itself, but also prevented detailed insight into the more extensive conformational changes involved in biological function and protein-aggregation disease (2). Experimental insight into native-state dynamics has so far been provided by relaxation-dispersion NMR (3), fluorescence resonance energy transfer labeling (4), and various hydrogen-exchange techniques, monitoring the fluctuations of individual hydrogen bonds in the protein backbone (5). While these strategies have been instrumental in establishing breathing modes at the low free-energy level (6–8) and global exchange (8, 9), they fail to capture the rare events occurring at the high free-energy level. Due to their local foci, they tend also to be ambiguous about the degree of structural concertedness, i.e., it is not always clear to what extent the observed local fluctuations are indeed cooperatively coupled or report on more diffusely distributed motions (5). A notable exception is the pioneering hydrogen–deuterium (HD)-exchange studies by Englander and coworkers where the concerted local unfolding of individual α-helices in cytochrome c was inferred by GdmCl titration (10). In essence, these experiments reveal that the helices of cytochrome c maintain their own subglobal cooperativity, resulting in a stepwise loss of structure on the native side of the unfolding barrier (10). When it comes to the structural behavior of β-strands, however, the observations are more confounding. For example, native-like intermediates with locally unfolded β-strands are, in some cases, implicated as precursors for pathological aggregation (11–13), while the very elongation and growth of fibrillar structures proceed in other cases by the incorporation of globally disordered monomers (14–16). Moreover, the formation of individual β-hairpins in peptides is observed to occur on the microsecond timescale (17), while the exchange of individual β-strands in the domain swap is notably slow by following the global unfolding transition (18). The clue to understanding these differences in structural and dynamic behaviors seems, thus, to rest in how the properties of the secondary-structure elements change upon transfer from solution into the tertiary context of folded topologies. In this study, we target this issue by employing an unconventional engineering strategy based on intramolecular domain swap where the domain is an individual β-strand or α-helix (Fig. 1). The results show that the structural behaviors of the β-strands and α-helices in the ribosomal protein S6 are, indeed, very distinct: The β-strands cannot be dislodged by local fluctuations but require global unfolding to swap, whereas the α-helices are free to swap locally in the native basin, to hybridize in new tertiary alignments, and to promote intermolecular association by "sliding" onto neighboring protein molecules. From these observations, we conclude that the advantage of having two types of secondary-structure elements lies not only in shape and complementarity, but also in evolvability through functional optimization of rigidity versus flexibility.
Fig. 1.
Principle of construct design. (A) The protein is first circularized by joining the original N and C termini and then cut open next to the target secondary-structure element (here, α2). (B) An identical copy of the target element (α2copy) is added to the N terminus, producing a primary sequence with α2 at either ends. (C) The construct is now free to fold by incorporating either α2 or α2copy, leading to alternate folded states where the helical element is also allowed to swap locally. (D) Outline of the S6 pseudowild-type sequence showing permutant incisions and the secondary-structure elements analyzed in this study.
Principle of construct design. (A) The protein is first circularized by joining the original N and C termini and then cut open next to the target secondary-structure element (here, α2). (B) An identical copy of the target element (α2copy) is added to the N terminus, producing a primary sequence with α2 at either ends. (C) The construct is now free to fold by incorporating either α2 or α2copy, leading to alternate folded states where the helical element is also allowed to swap locally. (D) Outline of the S6 pseudowild-type sequence showing permutant incisions and the secondary-structure elements analyzed in this study.
Results
Design of Swap Constructs.
The principal idea of this study is to access new levels of structural behavior by engineering proteins that can freely swap between given secondary-structure elements in their folded states. Our design of such swap constructs is outlined in Fig. 1, showing the example where α-helix 2 (α2) is chosen as a target element. First, wild-type S6 (PDB code 1ris) was circularized and cut open in a loop on the C-terminal side of α2. Next, a second copy of the target secondary-structure element was added to the new N terminus (α2copy), yielding a construct with two identical secondary-structure elements at either end of the sequence. This construct can now fold by integrating either the original C-terminal secondary-structure element (foldedC) or the new N-terminal copy (foldedN). Of primary interest, the construct is also free to swap between the two identical secondary-structure elements in the native state (Fig. 1).Regarding nomenclature, we denote the "double element" construct targeting α2 as with the alternate folded states /foldedC and /foldedN. As independent references for the structures and folding behavior of the alternate /foldedC and /foldedN, we used the "clean" circular permutants P81-82 (foldedC) and P68-69 (foldedN) with incisions between sequence positions 81 and 82, and 68 and 69, respectively (Fig. 2 and ). Analogous strategy and nomenclature are used for the constructs targeting the secondary-structure elements β1 and β2 of the S6 sheet (Fig. 3). The complete set of constructs and PDB codes to their X-ray structures are listed in .
Fig. 2.
Nomenclature as exemplified for the α2-constructs. The full set of constructs is listed in . (A) Sequence outline of swapα2 containing dual α2-copies. This construct is able to form the alternate states foldedN and foldedC, according to Fig. 1. (B) The circular permutant P68-69 containing only the N-terminal α2-element. This variant is used as the control for the structure and folding pathway of foldedN. (C) Corresponding outline of the permutant P81-82 forming the control for foldedC. (D) Construct with partial extension at the C terminus, in this case, n amino acids of α2. Besides from accounting for the effects of minor C-terminal modifications, this type of construct is used below for determining the effects of stepwise titration of the full α2-segment. (E) Corresponding outline of constructs with partial N-terminal extensions.
Fig. 3.
Stopped-flow data showing the distinct global- and local-swap behaviors of β-strands and α-helices in the S6 structure. The sequences of the analyzed protein variants are outlined in and fitted parameters in . (A, Left) Constructs probing the swap behavior of β1. (Middle) The references foldedC (6aa-P13-14) and foldedN (P97-3-5aa) show archetypical two-state behavior, manifested in v-shaped chevron plots with single folding relaxations. The swap-competent construct swapβ1 shows, instead, double folding relaxations that superimpose with both of the single chevrons of the references plus a slow phase that represents conversion between the alternate ground states swapβ1/foldedC and swapβ1/foldedN via the globally unfolded state. Large markers show the consistent relaxations obtained from the double-jump analysis on the Right. Fits are from the three-state relaxation in Scheme 1 (). (B) Corresponding data for swapβ2, foldedC (8aa-P54-55), and foldedN (P33-34-3aa). (C) The contrasting behavior of α2, which shows rapid swapping in the folded ground states according to the triangular mechanism in Fig. 1. Constructs swap α2, foldedC (6aa-P81-82), and foldedN (P68-69-6aa).
Nomenclature as exemplified for the α2-constructs. The full set of constructs is listed in . (A) Sequence outline of swapα2 containing dual α2-copies. This construct is able to form the alternate states foldedN and foldedC, according to Fig. 1. (B) The circular permutant P68-69 containing only the N-terminal α2-element. This variant is used as the control for the structure and folding pathway of foldedN. (C) Corresponding outline of the permutant P81-82 forming the control for foldedC. (D) Construct with partial extension at the C terminus, in this case, n amino acids of α2. Besides from accounting for the effects of minor C-terminal modifications, this type of construct is used below for determining the effects of stepwise titration of the full α2-segment. (E) Corresponding outline of constructs with partial N-terminal extensions.Stopped-flow data showing the distinct global- and local-swap behaviors of β-strands and α-helices in the S6 structure. The sequences of the analyzed protein variants are outlined in and fitted parameters in . (A, Left) Constructs probing the swap behavior of β1. (Middle) The references foldedC (6aa-P13-14) and foldedN (P97-3-5aa) show archetypical two-state behavior, manifested in v-shaped chevron plots with single folding relaxations. The swap-competent construct swapβ1 shows, instead, double folding relaxations that superimpose with both of the single chevrons of the references plus a slow phase that represents conversion between the alternate ground states swapβ1/foldedC and swapβ1/foldedN via the globally unfolded state. Large markers show the consistent relaxations obtained from the double-jump analysis on the Right. Fits are from the three-state relaxation in Scheme 1 (). (B) Corresponding data for swapβ2, foldedC (8aa-P54-55), and foldedN (P33-34-3aa). (C) The contrasting behavior of α2, which shows rapid swapping in the folded ground states according to the triangular mechanism in Fig. 1. Constructs swap α2, foldedC (6aa-P81-82), and foldedN (P68-69-6aa).
β-Strand Swap Does Not Occur Locally but Proceeds via Global Unfolding.
To examine the properties of the S6 β-sheet, we opted for the constructs and (Fig. 3 and ). The central β1 composes, here, the structural overlap between the two competing folding nuclei of the S6 folding process (1, 9, 19–21), whereas β2 is the dynamic (22) and structurally dispensable (23) strand at the sheet edge (Fig. 3). Our constructs targeting the remaining strands β3 and β4 turned out to be unsuitable for swap analysis because the rate constants of the involved ground states are too poorly separated, cf. ref. 21 (). To optimize the analysis of swapβ1, the alternate ground states swapβ1/foldedC and swapβ1/foldedN were first poised to similar stabilities by the mutation L10A in the N-terminal β1 copy (). This was to achieve balanced ground-state populations at equilibrium to maximize the resolution of the strand-swap events. Second, we equipped the references for foldedC and foldedN with short N/C-termini extensions to account for disordered-end effects (cf. Fig. 2 ). Generally, such short extensions are slightly destabilizing (2) and were, here, made up of five to six amino acids from either half of the β1 element (). The resulting folding- and unfolding-kinetics of swapβ1 and the terminally modified references for foldedC (6aa-P13-14) and folded N (P97-3-5aa) are shown in Fig. 3. Most evidently, swapβ1 shows two relaxation phases that superimpose precisely with the chevron plots of either of the reference states (Fig. 3). Additionally, the swapβ1 construct reveals a characteristic slow refolding phase in the timescale of global unfolding (Fig. 3), analogous to that observed for strand-swap events in SOD1 (2). Taken together, these features suggest that the competing β1 elements in swapβ1 do not swap rapidly in the folded basin but need to await global unfolding according to (2)where the procedures for fitting are described in the . The swapβ1 relaxation phase that superimposes with the refolding limb of the foldedN reference (P97-3-5aa) is linked to heterogeneity in the denatured ensemble due to generic proline isomerization (Fig. 3). In essence, the fraction of denatured molecules with cis prolines in the linker to the C-terminal β1 segment is unable to fold rapidly via the foldedC pathway and becomes naturally biased to the somewhat slower foldedN alternative (). Consistently, stopped-flow double-jump analysis yields refolding- and unfolding-rate constants that superimpose precisely with the chevron data of swapβ1 (Fig. 3). Analysis of the edge strand β2 was undertaken in the same way, using the constructs swapβ2, foldedC (8aa-P54-55), and foldedN (P33-34-3aa) (). The result is analogous to that of β1: The behavior of swapβ2 complies with the global swap mechanism according to Scheme 1 (Fig. 3). A notable feature of the β2 swap event is that it involves the ∼10-residue loop connecting β2 to β3 (). Even if this loop is largely disordered in solution (9, 22), it folds over the sheet in the crystal lattice to form an external "minicore" at the monomer’s packing interface (24). To see if the global-swap behavior of β2 is somehow coupled to such minicore interactions persisting in the solution state, we produced a truncated swapβ2 variant and reference states where the loop is replaced by aGAG linker (). The control shows that the loop truncation has a negligible effect on the β2 swap behavior, which remains slaved to the global folding transition (). It is thus evident that neither of the two S6 strands are free to detach and reinsert locally in the native basin but are as slaved to global unfolding (2). Considering that β2 is partly dynamic in wild-type S6 (22) and can be cut away completely without impact on folding cooperativity (23), this a notable and somewhat surprising result.
The Contrasting Behavior of α-Helices.
The question is now to what extent the S6 helices behave differently. Our focus is on α2 as the analysis of α-helix 1 is precluded by the reference constructs having unsuitable refolding kinetics (21) (). Following the procedures of the strand-swap constructs, the competing states of swapα2 were first poised to matching stabilities by the mutation L75A in the N-terminal α2-copy of the swap construct (). The results show that the kinetic behavior of this construct is markedly different from swapβ1 and swapβ2: The relaxation of swapα2 displays one phase only, and its chevron plot matches neither of the reference permutants (Fig. 3). The single-phase relaxation is maintained in the double-jump controls, yielding the same single refolding- and unfolding-rate constants as the chevron analysis (Fig. 3). This apparent two-state behavior of swapα2 suggests that the helix-swap events occur rapidly in the native state. In the simplest case, the linear Scheme 1 extends, then, to a triangular three-state relaxation with fast interconversion in the native basin as outlined in Fig. 1 and . We also note that the data set as a whole resembles an ideal ϕ-value analysis where swapα2 is the wild type, and foldedC (6aa-P81-82) and foldedN (P68-69-6aa) are the mutant variants with ϕ = 0 and ϕ = 1, respectively (25). Before embarking on more detailed kinetic analysis, however, we characterized the folded ground states of the various α2-constructs by X-ray crystallography and solution-state NMR as described below.
X-ray Structures Reveal Helix-Packing Plasticity.
Further clues to the swapα2 behavior is provided from X-ray data (Fig. 4 and ). Starting with the foldedN reference (L75), this construct shows an overall wild-type-like structure (). The only notable difference is that the strain release following the incision between residues 68 and 69 causes α2 to relax and to elongate by half a turn at the N-terminal end (). A similar structural arrangement is found for swapα2 (L75) where the presence of a complete α2-segment at the C terminus is not enough to overcome the high stability of the foldedN state (). Upon destabilization of the foldedN reference by the mutation L75A, however, α2 moves closer to the core, indicating that its docking is to some extent frustrated and susceptible to readjustments (). This frustration manifests itself fully in the likewise mutated swapα2 where both copies of α2 are structured but adjusted into nonnative positions: The C-terminal terminal helix slides out to bind the neighboring molecule in the crystal lattice, allowing the N-terminal copy to seal the volume left behind (Fig. 4). Interestingly, this hybrid arrangement of the swapα2 helices leads to an expansion of the core around the original α2-position where the new N- and C-terminal copies occupy similar volumes (Fig. 4). Elongating instead the sequence from the N-terminal end, the clean foldedC reference (P81-82) fails to produce useful diffraction data. Even so, an overall wild-type arrangement of the P81-82 structure is verified by solution-state NMR (). The ability to form ordered crystals is restored by a nine-residue extension of the N terminus in the reference construct 9aa-P81-82, yielding a hybridized helical arrangement similar to that of swapα2 (Fig. 4). Although these lattice-stabilized structures of 9aa-P81-82 and swapα2 are not necessarily transferrable to solution conditions, they underpin the notion that the tertiary arrangement of α-helices — because of their self-contained cooperativity (26, 27) — is more flexible than that of globally fixed β-strands (28). Of particular interest is that the C-terminal α2 in several cases slides over to the neighboring S6 molecule and thereby contributes to extend the dimer-interface area (Fig. 4). The self-contained behavior of α2 is, thus, not only revealed by intramolecular packing flexibility, but also by "induced" intermolecular fit. Similar sliding of helices in the crystal packing is consistently observed in four-helix bundles (29).
Fig. 4.
X-ray structures showing helix hybridization and dimer-interface sliding. The C-terminal α2-element is in red, and the N-terminal α2copy is in blue. (A) The foldedN-reference (P68-69-6aa) shows a wild-type (gray)-like structure with the 6aa α2-extension protruding disordered in the crystal lattice. The foldedC-titration construct 9aa-P81-82 starts to undergo hybridization at an α2copy-addition of 9aa (cf. ). Similar hybridization is found for the full swapα2 construct. Reintroduction of the stabilizing L75 in the α2copy of swapα2, leads to complete expulsion of α2. (B) Overlay of representative α2-swap structures on S6 wild type (green). (C) Crystallographic protein—protein interface showing how α2 slides over to merge with the neighboring molecule.
X-ray structures showing helix hybridization and dimer-interface sliding. The C-terminal α2-element is in red, and the N-terminal α2copy is in blue. (A) The foldedN-reference (P68-69-6aa) shows a wild-type (gray)-like structure with the 6aa α2-extension protruding disordered in the crystal lattice. The foldedC-titration construct 9aa-P81-82 starts to undergo hybridization at an α2copy-addition of 9aa (cf. ). Similar hybridization is found for the full swapα2 construct. Reintroduction of the stabilizing L75 in the α2copy of swapα2, leads to complete expulsion of α2. (B) Overlay of representative α2-swap structures on S6 wild type (green). (C) Crystallographic protein—protein interface showing how α2 slides over to merge with the neighboring molecule.
NMR Shows Local Helix Exchange.
To determine to what extent helix hybridization occurs in the free swapα2 monomers, we employed solution-state NMR. The immediate observation is that the heteronuclear single quantum coherence (HSQC) spectra of swapα2, the foldedN, and foldedC references (P68-69-6aa and 6aa-P81-82) show exchange broadening at protein concentrations >100 μM, which is not present in wild-type S6 (9, 22). Examination by elution chromatography and stopped-flow analysis shows further that these spectral components coincide with reversible (low-level) oligomerization of the folded species (). It is reasonable to suspect that the oligomerization relates to the intermolecular interactions in the crystal packing, i.e., induced-interface fit by α2-sliding (Fig. 4). To test this possibility, we perturbed the crystallographic dimer interface by the surface mutation A92K, which does not significantly affect the system’s helix-swap behavior (). The mutation diminishes the solution oligomerization and renders the HSQC spectra characteristic of monomeric S6 species (). As an additional control, we verified the monomeric nature of the various A92K-containing swapα2 variants under NMR conditions with NMR relaxation analysis (). The new foldedN and foldedC references, i.e., P68-69-6aa (A92K) and 6aa-P81-82 (A92K), produce well-dispersed HSQC spectra typical for uniformly folded permutant structures (). These ideal spectral features are overall maintained upon complete helix duplication in swapα2 (A92K) but with the interesting difference that some cross peaks shift to further new positions (Fig. 5). The obvious first step is, here, to assume that the swapα2 (A92K) spectrum is the combination of foldedN and foldedC in rapid exchange (30), i.e., the helices swap dynamically with one another, generating a single cross peak that is the weighted average of the two competing folded states. However, this reductionist interpretation is challenged by several swapα2 (K92) cross peaks being offset from the expected foldedN and foldedC average (Fig. 5). The spectral divergence indicates that the helix duplication not just leads to competition for wild-type-like docking with the nonrecruited element protruding disordered into solvent but to a new helical arrangement of similar stability (Fig. 5). Judged by the single set of swapα2 (K92) cross peaks, this new helical arrangement seems further to be in rapid exchange with foldedN and foldedC, leaving a balanced three-state equilibrium on the native side of the folding barrier(s) (Fig. 5). Henceforth, we refer tentatively to the new swapα2 (K92) species as the foldedhybrid state since the present NMR data sheds no information of its actual helical arrangement: It can be either a true hybrid as observed in Fig. 4 or a variant where one helix adapts fully native-like packing and the other perturbs the chemical shifts by dynamic sampling of the folded surface ().
Fig. 5.
Solution-state NMR analysis indicating α2-hybridization of free monomers. All constructs contain the dimer-splitting mutation A92K. (A) Chemical-shift changes upon structural titration from the clean circular permutants P81-82 (foldedC) and P68-69 (foldedN) to swapα2. Step-by-step additions of native amino acids on the N-terminal end of P81-82 result in a linear response in chemical shift toward the chemical shift of swapα2. A corresponding response is seen upon addition of native amino acids on the P68-69 C terminus. Even so, the kinks in the combined chemical-shift contours indicate that the swapα2 state is not simply a linear combination of foldedC and foldedN but rather constitutes a third state. (B) Scheme showing cooperative folding of the alternate states foldedC and foldedN, and rapid (>200 s−1) α2 swapping in the native basin, involving the hybridized state foldedhybrid. The competing pathways for escaping and reentering the native basin are controlled by two transition states where the ‡C engages α2 (ϕ = 1) and ‡N lacks α2-influence (ϕ = 0). (C) Corresponding two-state projection of the free-energy profile, explaining the effects of N- and C-terminal α2-truncation on the swapα2 background (cf. Fig. 3) where foldedΣ is the sum of foldedC, foldedhybrid, and foldedN.
Solution-state NMR analysis indicating α2-hybridization of free monomers. All constructs contain the dimer-splitting mutation A92K. (A) Chemical-shift changes upon structural titration from the clean circular permutants P81-82 (foldedC) and P68-69 (foldedN) to swapα2. Step-by-step additions of native amino acids on the N-terminal end of P81-82 result in a linear response in chemical shift toward the chemical shift of swapα2. A corresponding response is seen upon addition of native amino acids on the P68-69 C terminus. Even so, the kinks in the combined chemical-shift contours indicate that the swapα2 state is not simply a linear combination of foldedC and foldedN but rather constitutes a third state. (B) Scheme showing cooperative folding of the alternate states foldedC and foldedN, and rapid (>200 s−1) α2 swapping in the native basin, involving the hybridized state foldedhybrid. The competing pathways for escaping and reentering the native basin are controlled by two transition states where the ‡C engages α2 (ϕ = 1) and ‡N lacks α2-influence (ϕ = 0). (C) Corresponding two-state projection of the free-energy profile, explaining the effects of N- and C-terminal α2-truncation on the swapα2 background (cf. Fig. 3) where foldedΣ is the sum of foldedC, foldedhybrid, and foldedN.
Clues from α2-Structural Titration: the Competing Ground States.
To shed mechanistic detail on the helix-swap events, we "titrated in" the competing α2-segments — one residue at the time — to gradually end up with the full swapα2 protein (Fig. 6 and ). The effects on folding, stability, and ground-state structures were monitored both by chevron analysis (cf. Fig. 3) and by NMR (). Starting with the clean reference foldedN (P68-69), appreciable effects on protein stability do not materialize before position V72 (i.e., construct P68-69-4aa with a C-terminal addition of four residues) (Fig. 6 and ). This slight destabilization is tentatively assigned to steric interference between the added residues and the folded helical element, possibly in combination with altered electrostatic interactions, including the C-terminal charge. The destabilization then maintains relatively constant between V72 and E78 (P68-69-10aa) after which the addition of the final two to three residues promotes a sharp drop to full swapα2 stability (Fig. 6 and ). The behavior suggests that, when titrated from the C-terminal end of the S6 sequence, a nearly complete copy of the α2-segment is required for the helix-swap events to commence. Starting conversely from the foldedC reference (P81-82), the extensions yield overall matching results. Mirroring the foldedN titration, the destabilization cuts in relatively sharply between residues 81–79 while the recovery of stability upon adding residues 75–69 is more gradual (Fig. 6). Regardless of how the titration is started, the decisive features for triggering the swap motions seem, thus, contained in the L75-I81 region. We also note that this region is likely to encompass the highest degree of helix–helix repulsion due the unmatched positive charges at positions R77 and R80 where the introduction of an addition positive terminal charge in the titration from the N-terminal end is expected to exacerbate this repulsion further. Corroborating the overall titration trend, the chemical-shift changes accompanying the helix titrations match the protein-stability profiles: The titration yields overall steeper response between positions 79 and 81 than between positions 69 and 75 (Fig. 6 and ). Stability and NMR data thus yield consistent results in the form of a population-averaged transition of the initial and final species in rapid exchange.
Fig. 6.
Stepwise addition of an α2-copy. (A) Progressive extension of α2-residues to the C terminus of foldedN (P68-69) to finally end up with the full swapα2 and the analogous procedure starting with foldedC (P81-82) (). (B) Effects on protein stability relative the full swapα2. The initial destabilizations show the penalty of adding extra C- or N-terminal residues to folded S6, and the final convergence to Δlog KD–N = 0 shows the formation of the full swapα2 species as detected by NMR in Fig. 5. Notably, the addition of residues 69–75 yields a more gradual stability response than addition of residues 79–81, regardless of titration direction. (C) Corresponding changes of the unfolding- (▲) and refolding- (▼) rate constants. The kinetic profiles suggest that the C-terminal α2-element catalyzes global folding by providing full native stabilization already in the folding transition state (ϕ = 1), whereas the N-terminal α2-copy forms first in the native basin (ϕ = 0).
Stepwise addition of an α2-copy. (A) Progressive extension of α2-residues to the C terminus of foldedN (P68-69) to finally end up with the full swapα2 and the analogous procedure starting with foldedC (P81-82) (). (B) Effects on protein stability relative the full swapα2. The initial destabilizations show the penalty of adding extra C- or N-terminal residues to folded S6, and the final convergence to Δlog KD–N = 0 shows the formation of the full swapα2 species as detected by NMR in Fig. 5. Notably, the addition of residues 69–75 yields a more gradual stability response than addition of residues 79–81, regardless of titration direction. (C) Corresponding changes of the unfolding- (▲) and refolding- (▼) rate constants. The kinetic profiles suggest that the C-terminal α2-element catalyzes global folding by providing full native stabilization already in the folding transition state (ϕ = 1), whereas the N-terminal α2-copy forms first in the native basin (ϕ = 0).
Clues from Structural Titration: the Competing Transition States.
An interesting outcome of the modular helical behavior in Figs. 5 and 6 is that it sheds lights on the ϕ-value character of the swapα2 chevron plots (Fig. 3). To pin down the details behind this orderly response, we followed the effects of structural titration on the individual unfolding (ku) and refolding (kf) -rate constants (Fig. 6). Starting from the C-terminal end (foldedN), ku remains relatively unaffected by the helix elongation, while kf undergoes a pronounced acceleration upon addition of the final L79-I81 residues. As such, the titration confirms that the C-terminal α2-element catalyzes global folding by providing full native stabilization already in the folding transition state, i.e., ϕ = 1. Starting instead from the N-terminal end (foldedC), the α2-titration shows only marginal impact on kf but a pronounced increase in ku (Fig. 6). This selective ku effect suggests that the N-terminal α2-copy does not take part in the folding transition state of , but structures first in the native basin, i.e., ϕ = 0. There are two implications of these results for the folding mechanism. First, the absence of the N-terminal α2-copy in the barrier crossing suggests that the preferred transition state of is similar to that of the clean foldedC construct (P81-82). We denote this dominant transition state ‡ϕ
=
1 = ‡C (Fig. 7). Likewise, there exists a parallel but less favorable transition state where none of the α2-elements contribute, and this seems similar to that of the clean foldedN construct (P68-69), i.e., ‡ϕ
=
0 = ‡N (Fig. 7). The two pathways (‡ϕ=
1 and ‡ϕ
=
0) are not just accessible to but also to the foldedC reference. This minimal extension of the triangular scheme in Fig. 1 explains why the refolding rate constant of () is identical to and not (+ ) (). Evidence for such dual-pathway behavior is also found in the V72-L79 region of the foldedN titration where ku and kf increase in concert as the faster ‡ϕ=
1 pathway gradually kicks in (Fig. 6). Second, the conclusion that and foldedC employ similar transition state (‡C) turns out to have implications for the folded structure. The clue is in the response to α2-perturbation. Mutation of the foldedC structure (P81-82) has previously been shown to yield overall low ϕ-values in the α2-region, i.e., ϕ < 0.2, indicating that the native interactions of the helix are largely missing in the folding transition state (20, 21). In stark contrast, perturbations of the C-terminal α2 in yield, here, ϕ = 1 (Fig. 3). Given that ‡C is still the same, these differences must stem from the helix mixing in the ground state (Figs. 5 and 6). One possibility is thus that the interactions of the C-terminal α2 in the structure (i.e., foldedhybrid) are weakened to level where they match those of the foldedC transition state. Whether this correspondence is just by chance, system specific, or report on the inherent features of structural self-organization remains to be tested.
Fig. 7.
Folding free-energy profiles illustrate the different swap behaviors of β-strands and α-helices, respectively. The arrows follow a typical refolding experiment as carried out in this study. (A) Proteins that first fold from denatured (D) to foldedC with the C-terminal β-strand cannot swap to the N-terminal β-strand unless they return to D and refold to foldedN. The mechanistic reason for this global-swap behavior has been assigned to the need to establish β-sheet register early in the folding process, i.e., in the rate-limiting step D to ‡ (2). (B) Swapping of α-helices can occur much more flexibly in the native basin, and this also increases the susceptibility to hybridization. The mechanistic reason for this local-swap behavior is assigned to the self-contained cooperativity of the α-helical structure itself. Accordingly, the β-strands and α-helices seem to provide opposing properties in the optimization of structural rigidity versus conformational freedom in protein evolution.
Folding free-energy profiles illustrate the different swap behaviors of β-strands and α-helices, respectively. The arrows follow a typical refolding experiment as carried out in this study. (A) Proteins that first fold from denatured (D) to foldedC with the C-terminal β-strand cannot swap to the N-terminal β-strand unless they return to D and refold to foldedN. The mechanistic reason for this global-swap behavior has been assigned to the need to establish β-sheet register early in the folding process, i.e., in the rate-limiting step D to ‡ (2). (B) Swapping of α-helices can occur much more flexibly in the native basin, and this also increases the susceptibility to hybridization. The mechanistic reason for this local-swap behavior is assigned to the self-contained cooperativity of the α-helical structure itself. Accordingly, the β-strands and α-helices seem to provide opposing properties in the optimization of structural rigidity versus conformational freedom in protein evolution.
Discussion
The Different Behaviors of β-Strands and α-Helices in Folded Proteins.
The deceptively simple organization of protein structures into bundled α-helices and β-strands is often taken to infer how the protein folds, i.e., the primary chain first forms the secondary-structure elements, which, then, assemble into the native tertiary topology. However, this view is not entirely true. Rather, the secondary structure and tertiary contacts are biased to form concomitantly since the principal role of the intramolecular hydrogen bonds is not in providing stability but in promoting specificity by compensating for lost water interactions upon hydrophobic collapse (31–33). For as yet poorly understood reasons, this conflict between satisfying hydrogen-bonding constraints and tertiary close packing is typically resolved at high-energy level, giving rise to substantial folding free-energy barriers and high folding cooperativity. Following microscopic reversibility, the native-state dynamics leading to global unfolding is the reverse, i.e., rupture of the tertiary integrity coupled to solvation of the backbone hydrogen bonds. Although this concerted two-state nature of structural transitions reduces complexity in the crowded interior of live cells (34, 35), it leaves us with difficulties in deconvoluting exactly what is going on (2). Some pressing examples are how secondary-structure properties direct structural evolution (36), the coupling between structural motions and protein function (37, 38), and to what extent proteins can locally unfold to trigger pathologic aggregation and misfolding disease (2, 39). We target, here, the problem by exposing in an alternative way the local and global behaviors of individual secondary-structure elements in the split β−α−β protein S6 (Fig. 1). Our approach is based on engineering sequence variants where the examined secondary-structure element is sterically allowed to swap place with an identical copy in the native state (Figs. 3 and 4). Taken together, the results from these secondary-structure duplications show that the β-strands and α-helices behave very differently. The β-strands are so firmly locked into their tertiary context that they cannot swap unless the protein globally ruptures and refolds, whereas the α-helices are free to swap dynamically on the native side of the unfolding barrier (Fig. 7).
Comparison with Other Systems.
Although the results in Fig. 7 essentially confirm what is predicted from long-range (β-sheets) and local (α-helices) hydrogen-bond patterns alone, the very experimental verification of distinct secondary-structure behavior in the context of a folded structure sets the basis for a more detailed examination. First, we note that the partitioning into globally slaved and locally free motions of the β- and α-segments is fully discrete and without intermediate behavior.Somewhat surprisingly, this partitioning includes also the edge strand β2, which is clearly dynamic as measured by HD-exchange (9) and NMR-relaxation data (22). The result indicates that the breathing and solvent accessibility of individual hydrogen bonds does not necessarily report on the concerted motions of the entire secondary-structure segment (2, 40). Mechanistically, one possibility is that the local breathing occurs just diffusely along the main chain without affecting the topology as a whole. Alternatively, the swap behavior is controlled by just a few globally fixed hydrogen bonds along the otherwise dynamic strand that, in this case, would be confined to the N-terminal end (9). In the case of helices, however, the agreement between our current data and the HD-exchange patterns appears more clear-cut: The individual helical elements of cytochrome c are found to undergo concerted unfolding in the native basin (10), consistent with the schematic outline in Fig. 7. This difference in local dynamic behavior stems most likely from the intrinsic cooperativity of helical segments, in general (26, 27). Independent support for the global nature of β-strand transitions is provided by domain swap where two or more proteins "entangle" by exchanging secondary-structure elements with one another (41, 42). The domain swap follows mechanistically from the funneled landscapes of globular proteins (43), and is employed naturally in functional oligomerization (44), i.e., the domain-swapped oligomers have higher activity (45–47). Domain swap also tends to occur spontaneously in refolding of monomeric proteins if the protein concentration is high enough (48) or loop restraints are relieved (49, 50). When the domain-swapped elements include β-strands, the dissociation into folded monomers is observed to require global unfolding (51–53). An illustrative example is provided by cyanovirin-N where the barrier for domain swap matches that for global unfolding of the free monomers (51). When the swapped element is instead a single α-helix, the domain swapping is reported to occur more rapidly by local unfolding in the native basin (54). Additional evidence for dynamic partitioning of β- and α-elements is provided from studies of a protein that can reversibly switch between two folded structures, i.e., protein GA (all-α) and protein GB (α/β) (55, 56). Following the constraints of the β-strands in Fig. 7, the transition between the GA and the GB topologies does not take place locally but proceeds via global unfolding (55, 56). To this end, the conspicuous insert of a single β-strand into a partners sheet, which governs the functional polymerization of serpins, is observed to rely on concerted changes in secondary and tertiary structures (57) and, similarly, the swapping of individual β-strands in the amyotrophic lateral sclerosis-associated protein SOD1 requires global unfolding to occur (2). The advantage of this mechanism has, in both cases, been linked to the avoidance of sticky intermediates, which could conceivably populate if the swaps were based on hybridizing α-helices (Fig. 7). Taken together, this overall consistent separation of β-strand and α-helix behaviors suggests that our present observations are not unique for S6 but reflect some degree of generality.
Notes on Structural Evolution.
Evolution of protein function is not just a matter of achieving suitable topology, but also more elusively about tuning the structural properties. The latter involves maintaining sufficient structural stability (58), balancing affinity and interaction promiscuity (59), tuning the quinary interplay, proteome self-organization, intracellular search (60, 61), and optimizing the dynamic motions (62–64). Our present focus is on the evolutionary role of the motions and properties of entire β- and α-elements, which has, so far, gained relatively sparse attention (2, 65, 66). Nevertheless, these secondary-structure elements constitute the basic units of insertions and deletions (indels) that, together with point mutations, are the most common events in protein evolution (36). Our simplified arguments are as follows. Large-scale analysis of existing protein structures shows that α-helical elements are overall more robust to mutations than β-strands, i.e., helices tolerate more sequence change without changing secondary structure (65). In view of the current data, this implies that α-helical elements more easily maintain their hydrogen-bonding integrity because they are generally freer to readjust in the folded structure upon shape perturbations. Such helix adjustments are, indeed, apparent upon superposition of S6 homologs and seem further to correlate with sequence divergence (). It is, thus, not surprising that proteins that encapsulate a bulky heme group are better suited to a malleable envelope of α-helices than tertiary restricted β-sheets. As exemplified by hemoglobin, helical topologies are also readily amenable to long-range allosteric crosstalk and conformational changes through coupled mutual helical motions (67, 68). Similar cofactor-dependent conformational heterogeneity is observed for the helical protein GAD65 (69). The extreme case of unified helical motions is possibly found in transmembrane proteins where the lipid confinement not only allows larger conformational changes than in an aqueous solvent, but also stabilizes the individual helical elements (70). By comparison, conformational changes within rigid β-sheets are intrinsically more problematic to achieve. A revealing example is the allosteric communication between the metal-binding sites in the homodimeric β-barrels of SOD1: To propagate the signal, this protein strains the sheet by twisting in a single β−β hydrogen bond into the core, introducing a "bridge" of dynamic frustration across the two subunits (6). Accordingly, proteins based on mixed α- and β-motifs can be said to combine the best of two worlds and represent the most common motifs in proteomes across all organisms (71). Besides increasing the repertoire of steric shapes, rigid β-sheets covered on one or both sides by adjustable α-helical elements are overall more tolerant to mutations than corresponding all-β arrangements (65, 72). From the observations in this study, it is indicated that mixed α- and β-topologies also have the evolutionary advantage of mixed dynamical properties. First, the seamless merge of sheets enabled by the sliding motion of α2 in Fig. 4 not only produces a new snug interface, but also seems to create a new "foldon" in the form of one-helix-docking-against-two strands (9, 21). In essence, foldons are proposed to make up the minimal cooperative units (folding nuclei) and to propagate cooperativity within domains by being combined with structural overlap (1). As such, the induced S6 interface in Fig. 4 can be seen as a cooperative binding motif in its own right. Although the affinity provided by this induced "interface foldon" is just in the millimolar regime as judged by solution NMR (), it is easy to imagine how it can be evolutionarily fixed by additional point mutations or altered sequence connectivity. Second, the ability of identical helical elements to hybridize dynamically upon sequence duplication (Figs. 4 and 5), taken together with the finding that S6 still folds cooperatively upon complete truncation of the β2-strand (23), provides experimental evidence that structural evolution by insertion and deletion works in practice, and is even more flexible than envisioned from static structures alone. In particular, it is interesting that the stability change accompanying the insertion and hybridization of α2 is small and unlikely to compromise in vivo foldability. From this structure-centric analysis, we conclude that the balance between global rigidity and controlled local motions is not only key to enzymatic efficiency and functional conformational changes (63, 73), but also can play a role in facilitation of evolution of the structures per se. Notably, the experimental observations behind this conjecture are easily put to test: The β-strands are rigid and unyielding because their hydrogen bonds are cooperatively slaved to the global unfolding transition, whereas the α-helical elements are free to dynamically swap and hybridize in the native basin.
Materials and Methods
Molecular biology was performed as described in ref. 2, and proteins were produced as in ref. 21. Stopped-flow analysis, NMR analysis, and X-ray crystallography were performed as in ref. 2. All measurements were performed at 25 °C in 50 mM 2-(N-morpholino)ethanesulfonic acid (Sigma-Aldrich) buffer at pH6.3 unless otherwise stated. Detailed materials and methods are in the .
Authors: Sichun Yang; Samuel S Cho; Yaakov Levy; Margaret S Cheung; Herbert Levine; Peter G Wolynes; José N Onuchic Journal: Proc Natl Acad Sci U S A Date: 2004-09-10 Impact factor: 11.205
Authors: Xin Mu; Seongil Choi; Lisa Lang; David Mowray; Nikolay V Dokholyan; Jens Danielsson; Mikael Oliveberg Journal: Proc Natl Acad Sci U S A Date: 2017-05-23 Impact factor: 11.205
Authors: M Lindahl; L A Svensson; A Liljas; S E Sedelnikova; I A Eliseikina; N P Fomenkova; N Nevskaya; S V Nikonov; M B Garber; T A Muranova Journal: EMBO J Date: 1994-03-15 Impact factor: 11.598
Authors: Lisa Lang; Per Zetterström; Thomas Brännström; Stefan L Marklund; Jens Danielsson; Mikael Oliveberg Journal: Proc Natl Acad Sci U S A Date: 2015-07-28 Impact factor: 11.205
Authors: C Ashley Barnes; Yang Shen; Jinfa Ying; Yasuharu Takagi; Dennis A Torchia; James R Sellers; Ad Bax Journal: J Am Chem Soc Date: 2019-05-23 Impact factor: 15.419