Literature DB >> 33229587

Robust folding of a de novo designed ideal protein even with most of the core mutated to valine.

Rie Koga1, Mami Yamamoto2, Takahiro Kosugi3,2,4, Naohiro Kobayashi5,6, Toshihiko Sugiki5, Toshimichi Fujiwara5, Nobuyasu Koga1,2,4.   

Abstract

Protein design provides a stringent test for our understanding of protein folding. We previously described principles for designing ideal protein structures stabilized by consistent local and nonlocal interactions, based on a set of rules relating local backbone structures to tertiary packing motifs. The principles have made possible the design of protein structures having various topologies with high thermal stability. Whereas nonlocal interactions such as tight hydrophobic core packing have traditionally been considered to be crucial for protein folding and stability, the rules proposed by our previous studies suggest the importance of local backbone structures to protein folding. In this study, we investigated the robustness of folding of de novo designed proteins to the reduction of the hydrophobic core, by extensive mutation of large hydrophobic residues (Leu, Ile) to smaller ones (Val) for one of the designs. Surprisingly, even after 10 Leu and Ile residues were mutated to Val, this mutant with the core mostly filled with Val was found to not be in a molten globule state and fold into the same backbone structure as the original design, with high stability. These results indicate the importance of local backbone structures to the folding ability and high thermal stability of designed proteins and suggest a method for engineering thermally stabilized natural proteins.
Copyright © 2020 the Author(s). Published by PNAS.

Entities:  

Keywords:  de novo protein design; local backbone structures; protein folding; thermal stability

Mesh:

Substances:

Year:  2020        PMID: 33229587      PMCID: PMC7739874          DOI: 10.1073/pnas.2002120117

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


The de novo design of protein structures, starting from pioneering work (1, 2), has been achieved in tandem with our understanding of how amino acid sequences determine folded structures (3–16). A breakthrough in protein design methodology was a finding of principles for encoding funnel-shaped energy landscapes into amino acid sequences (7, 10, 17, 18). Based on studies of protein folding, it had been suggested that naturally occurring proteins have evolved to have funnel-shaped energy landscapes toward their folded structures (19–23). However, complicated structures of naturally occurring proteins with nonideal features for folding—for example, kinked α-helices, bulged β-strands, long or strained loops, and buried polar groups—make it difficult to understand how the funnels are encoded in amino acid sequences. By focusing on protein structures without such nonideal features, we proposed principles for designing ideal protein structures stabilized by completely consistent local and nonlocal interactions (24), based on a set of rules relating local backbone structures to preferred tertiary motifs (7, 10). These design rules describe the relation of the lengths or torsion patterns of two secondary structure elements and the connecting loop to favorable packing geometries (). The design principles enable to encode strongly funneled energy landscapes into amino acid sequences, by the stabilization of folded structures (positive design) and by the destabilization of nonnative conformations (negative design) due to the restriction of folding conformational space by the rules (). In the design procedure, backbone structures for a target topology are generated based on a blueprint (), in which either the lengths or backbone torsion patterns of the secondary structures and loops are determined using the rules so that the tertiary motifs present in the target topology are favored, and then amino acid sequences stabilizing the generated backbone structures are designed. The designed amino acid sequences stabilize their folded structures both with nonlocal interactions such as hydrophobic core packing and with local interactions favoring the secondary structures and loops specified in the blueprint, which destabilize a myriad of nonnative topologies through local backbone strain captured by the rules, thereby resulting in funnel-shaped energy landscapes (). The principles have enabled the de novo design of ideal protein structures for various topologies with atomic-level accuracy (Fig. 1) (6, 7, 10, 13).
Fig. 1.

In silico energy landscapes and far-UV circular dichroism (CD) spectra for 10 de novo designed ideal proteins. (A–E) Five designs by Koga et al. in 2012 (7). (F–I) Four designs by Lin et al. in 2015 (10). (J) Top7 by Kuhlman et al. in 2003 (6). (Top) Design models. (Middle) Energy landscapes obtained from Rosetta ab initio structure prediction simulations (41). Red points represent the lowest energy structures obtained in independent Monte Carlo structure prediction trajectories starting from an extended chain for each sequence; the y axis is the Rosetta all-atom energy; the x axis is the Cα root-mean-square deviation (RMSD) to the design model. Green points represent the lowest energy structures obtained in trajectories starting from the design model. (Bottom) The far-UV CD spectra during thermal denaturation with the melting temperature Tm, which is obtained by fitting to the denaturation curves shown in .

In silico energy landscapes and far-UV circular dichroism (CD) spectra for 10 de novo designed ideal proteins. (A–E) Five designs by Koga et al. in 2012 (7). (F–I) Four designs by Lin et al. in 2015 (10). (J) Top7 by Kuhlman et al. in 2003 (6). (Top) Design models. (Middle) Energy landscapes obtained from Rosetta ab initio structure prediction simulations (41). Red points represent the lowest energy structures obtained in independent Monte Carlo structure prediction trajectories starting from an extended chain for each sequence; the y axis is the Rosetta all-atom energy; the x axis is the Cα root-mean-square deviation (RMSD) to the design model. Green points represent the lowest energy structures obtained in trajectories starting from the design model. (Bottom) The far-UV CD spectra during thermal denaturation with the melting temperature Tm, which is obtained by fitting to the denaturation curves shown in . Interestingly, the de novo designs exhibit prominent characteristics in terms of thermal stability when compared with naturally occurring proteins. The circular dichroism (CD) measurements up to 170 °C conducted in this study revealed the melting temperature (Tm), which was above 100 °C for most of the designs (Fig. 1) (6, 7, 10). Therefore, the designs have great potential for use as scaffolds to engineer proteins with specific functions of interest. Indeed, miniprotein structures (∼40 residues) designed de novo according to the rules were applied as scaffolds for creating protein binders specific for influenza hemagglutinin and botulinum neurotoxin, displaying high thermal stability (>70 °C) despite the small size (25). The rules in the principles described above emphasize the importance of local backbone structures not the details of amino acid side chains to protein folding, which is also supported by studies using simple calculations with the hydrophobic-polar lattice model or the snake-cube model (26, 27). On the other hand, it is known that hydrophobic interactions are the dominant driving force for folding (28, 29) and the cores of naturally occurring proteins are tightly packed with hydrophobic amino acid residues (30, 31) like a jigsaw puzzle. Indeed, in our design principles, protein cores were designed to be tightly packed and as “fat” as possible with larger hydrophobic residues so that energy landscapes were sculpted to be deeply funneled into a target topology by lowering its energy (). Which factor, the local backbone structures encoded by the rules or the tight core packing with fat hydrophobic residues, contributes more to the generation of funnels in the designs? Here, we studied the contribution of hydrophobic core packing to folding ability and thermal stability by investigating the robustness of folding against the reduction of packing, using the design with the highest thermal stability among our nine de novo designs (Fig. 1, except Top7), Rsmn2x2_5_6 (10). We started to study single-residue mutants from Leu or Ile to Val that prune one carbon atom from the aliphatic side chain, which lose the tight packing like a jigsaw puzzle and decrease the hydrophobicity, and then, we combined the mutations. Consequently, we found that a mutant with 10 residue substitutions of Leu or Ile with Val still has the folding ability and high thermal stability despite its reduced and loosened hydrophobic core packing. This result suggests the importance of the local backbone structures for the folding ability and stability of the de novo designs.

Results

The de novo designed Rossmann2x2 fold protein, Rsmn2x2_5_6 (10), in which two helices sandwich a β-sheet, showed superhigh thermal stability with the melting temperature (Tm) of 129.6 °C (Fig. 1). The design was computationally created using Rosetta by generating backbone structures based on a blueprint (), which was derived from the rules so that the Rossmann2x2 fold was selectively favored compared with other topologies, and then by designing sequences that stabilize the generated backbone structures with only hydrophobic residues in the core and only polar residues on the surface. The core of this protein consists of 20 valine residues, 7 leucine, 3 isoleucine, 2 alanine, 1 phenylalanine, and 1 tyrosine (Fig. 2). We substituted leucine and isoleucine with valine and measured the Tm of the mutants using CD.
Fig. 2.

Hydrophobic core of the de novo designed protein, Rsmn2x2_5_6. The core consists of 20 valine residues (green), 7 leucine (red), 3 isoleucine (salmon), 2 alanine (yellow), 1 phenylalanine (cyan), and 1 tyrosine (pale cyan). The structure on the Left is viewed from the side while the structure on the Right is viewed from the top following a 90° clockwise rotation of the structure on the Left. H1-4 represents the first to fourth helices. The first model of the NMR structure for the original design (PDB ID: 2N3Z) was used for the illustration.

Hydrophobic core of the de novo designed protein, Rsmn2x2_5_6. The core consists of 20 valine residues (green), 7 leucine (red), 3 isoleucine (salmon), 2 alanine (yellow), 1 phenylalanine (cyan), and 1 tyrosine (pale cyan). The structure on the Left is viewed from the side while the structure on the Right is viewed from the top following a 90° clockwise rotation of the structure on the Left. H1-4 represents the first to fourth helices. The first model of the NMR structure for the original design (PDB ID: 2N3Z) was used for the illustration.

Single-Residue Mutations from Leu or Ile to Val.

The 10 single-residue mutants, L4V, L13V, L28V, I39V, I42V, L52V, L59V, L60V, I63V, and L80V (Fig. 3), were generated (the residue numbers correspond to those of the NMR structure of Rsmn2x2_5_6; Protein Data Bank [PDB] ID: 2N3Z). The mutants were expressed in Escherichia coli, purified, and characterized by CD and size-exclusion chromatography combined with multiangle light scattering (SEC-MALS). All mutants were well expressed, soluble, and monomeric, and showed far-ultraviolet (UV) CD spectra characteristic of αβ-proteins as the original Rsmn2x2_5_6 design. Thermal denaturation experiments were carried out to identify the Tm of each mutant by using CD up to 170 °C under 1-MPa pressure, which made it possible to study the contribution of hydrophobic core packing to the stability in the superstable de novo design. The L80V mutant, represented by red crosses in Fig. 3, had the lowest Tm, 6.3 °C lower than the original protein (Orig), represented by black filled circles. (This mutant also showed lower cooperativity.) This was followed by the I63V, I42V, L13V, L59V, and I39V mutants, in ascending order of stability. Four mutants, L52V, L4V, L28V, and L60V, did not show a significant change in Tm (<1 °C) (Fig. 3 and ). (Some single-residue mutations substituting Leu on helix with Ala were also carried out. See .)
Fig. 3.

Stability of single-residue substitutions of Leu or Ile with Val within the core. (A) Mutated residues (Leu, Ile) are shown in the two views, as in Fig. 2. The residue numbers correspond to those in the NMR structure of Rsmn2x2_5_6 (PDB ID: 2N3Z). Colors indicate the types of secondary structure (SS) contacts shown in the table: Residues with both contact types are shown in orange (L80, which forms the largest number of distant SS contacts, is shown in red), those that only form distant contacts are shown in green except for L4, which is shown in cyan, and those that only form close contacts are shown in blue (see main text for details). (B) Thermal denaturation measured by CD at 222 nm. The data were fit to the Gibbs–Helmholtz equation (solid lines) to obtain the melting temperature T, the enthalpy change ΔH, and the heat capacity change ΔC. The color scheme is the same as that used in A. The original (Orig) is shown in black.

Stability of single-residue substitutions of Leu or Ile with Val within the core. (A) Mutated residues (Leu, Ile) are shown in the two views, as in Fig. 2. The residue numbers correspond to those in the NMR structure of Rsmn2x2_5_6 (PDB ID: 2N3Z). Colors indicate the types of secondary structure (SS) contacts shown in the table: Residues with both contact types are shown in orange (L80, which forms the largest number of distant SS contacts, is shown in red), those that only form distant contacts are shown in green except for L4, which is shown in cyan, and those that only form close contacts are shown in blue (see main text for details). (B) Thermal denaturation measured by CD at 222 nm. The data were fit to the Gibbs–Helmholtz equation (solid lines) to obtain the melting temperature T, the enthalpy change ΔH, and the heat capacity change ΔC. The color scheme is the same as that used in A. The original (Orig) is shown in black. To understand why each mutant made a different contribution to the stability, we attempted to predict the change in stability by computational modeling using Rosetta. We carried out Rosetta ΔΔG calculations, which were developed for evaluating the free energy difference between the wild type and its mutant (32) (see for details). In addition, we assessed the packing quality for each mutant using RosettaHoles (33). As expected, all 10 single mutants showed decreased stability and packing quality. However, the correlations of the decreases of the melting temperature ΔTm with the calculated ΔΔG values and the packing quality were not found (). These results are probably due to the insufficient accuracy of the energy function to evaluate an energy at the single-residue resolution (34) or due to the difficulty of predicting structural rearrangements induced by a single-residue mutation (35) (see strong correlations when the combined mutants using the single mutations are included for the comparison []). Here, we sought to understand the different contribution made by each mutated residue to the stability by considering the number of secondary structure elements that the residue makes contacts with and classifying those into the distant (not adjacent to the secondary structure element containing the mutated residue) and close (adjacent) ones (table in Fig. 3 and ; contacts are defined in ). Interestingly, we found that the hydrophobic residues that have multiple secondary structure contacts including both the contact types, “hub-residues,” L80, I63, I42, and L13 (red and orange in Fig. 3), contributed strongly to stability. L80 (red), in particular, is expected to contribute to stability the most because it forms the greatest number of distant secondary structure contacts, which agrees with the largest change of Tm upon the mutation. Residues that form only distant secondary structure contacts, such as L59 and I39 (green in Fig. 3), but not L4 (cyan), appeared to make relatively lower yet still significant contributions to the stability. In contrast, those residues that form only close secondary structure contacts (blue in Fig. 3) did not appear to contribute to stability, possibly indicating the importance of distant secondary structure contacts for stability. Interestingly, the L4V mutant did not change Tm significantly despite forming distant secondary structure contacts. L4 at the N terminus interacts with residues at the C-terminal helix (H4); therefore, the enthalpy loss upon the mutation may be compensated by the entropy gain due to the flexibility of both termini.

Combined Mutations.

We next constructed the combined variants using the single mutations conducted above. The Rossmann2x2 fold has two hydrophobic packing sites separated by a β-sheet: the helix (H) 2,3 side and the H1,4 side (Fig. 3). The H2,3 side has six residues that are Leu or Ile: I39, I42, L52, L59, L60, and I63, while the H1,4 side has four residues: L4, L13, L28, and L80. We generated two combined variants consisting of all of the single mutations on each side, which were subsequently expressed and purified. Remarkably, the variants were still soluble and monomeric, and exhibited CD spectra characteristic of αβ-proteins. The Tm of the H2,3 and H1,4 side combined variants were 115.3 and 123.2 °C, respectively (). The thermal denaturation of the H1,4 side was considerably less cooperative than that of the original (), which may be due to the decreased cooperativity of the L80V mutant described above.

The Mutant with Most of the Core Filled with Valine.

We finally constructed a mutant with total 10 substitutions of Leu or Ile with Val, resulting in 30 of the 34 hydrophobic core residues being valine (88% valine in the core, thus referred to as VAL88). The VAL88 protein was well expressed and still soluble (), although it starts to aggregate after being left at ∼500 μM concentration for 1 wk at room temperature. Furthermore, VAL88 in the soluble fraction was found to be monomeric by SEC-MALS () and showed a CD spectrum characteristic of αβ-proteins (Fig. 4) and high stability with Tm of 106.1 °C (). The 23.6 °C decrease in Tm compared with the original is almost equal to the sum of the decreases for the combined variants on each side (20.8 °C) (), suggesting that the two sides separated by the β-sheet contribute almost independently to stability. As well as the Tm, the cooperativity of the thermal denaturation of VAL88 also decreased compared with the original (). The experimentally estimated ΔΔG, the change in free energy of unfolding for VAL88 compared with the original, at the Tm of the original, was −5.5 kcal/mol (), which is close to the calculated hydrophobic free energy change (−5.8 kcal/mol) based on buried surface area during folding () (36, 37). This result is consistent with those in previous studies (36, 37) that demonstrated how much protein stability is lost when carbon atoms in hydrophobic residues are removed from the core, indicating that total reduction in stability caused by the 10 substitutions can be explained by the reduction in hydrophobic interactions.
Fig. 4.

Characterization of the designed protein with most of the core mutated to Val, VAL88. (A) The energy landscape obtained from Rosetta ab initio structure prediction simulations. As in Fig. 1, red points represent simulated structures starting from an extended chain. The Cα RMSD of simulated structures was calculated against the first model of the NMR structure for the original (PDB ID: 2N3Z). Green points represent simulated structures starting from the VAL88 structure generated from the first NMR model using Foldit (55). (B) The far-UV CD spectra at various temperatures. (C) Two-dimensional 1H–15N HSQC spectrum at 25 °C and 600 MHz. (D) NMR structure (PDB ID: 6LLQ). (E) Hydrophobic core side chains are shown in stick. The residue color and the views are the same as those in Fig. 2. Residues colored in green are valine. (F) Core packing in the H1,4 and H2,3 sides of the original and VAL88 NMR structures are shown in the same view as the Bottom figure in E. Hydrophobic residues are represented by space-filling with the same residue color as in Fig. 2. The first model of each NMR structure was used for illustration. (G) HDX rates for core residues. (Top) Changes in signal intensity ratio (signal intensity at the beginning is set to 1) of 1H–15N correlation NMR signals for six residues in the core are shown for the original (Orig) as solid lines and for VAL88 as dashed lines. (Bottom) The six residues shown by green sticks. Residue 52 is Leu in the original and Val in VAL88; the other five residues are Val in the both. The first model of the NMR structure for VAL88 was used for illustration in E and G.

Characterization of the designed protein with most of the core mutated to Val, VAL88. (A) The energy landscape obtained from Rosetta ab initio structure prediction simulations. As in Fig. 1, red points represent simulated structures starting from an extended chain. The Cα RMSD of simulated structures was calculated against the first model of the NMR structure for the original (PDB ID: 2N3Z). Green points represent simulated structures starting from the VAL88 structure generated from the first NMR model using Foldit (55). (B) The far-UV CD spectra at various temperatures. (C) Two-dimensional 1H15N HSQC spectrum at 25 °C and 600 MHz. (D) NMR structure (PDB ID: 6LLQ). (E) Hydrophobic core side chains are shown in stick. The residue color and the views are the same as those in Fig. 2. Residues colored in green are valine. (F) Core packing in the H1,4 and H2,3 sides of the original and VAL88 NMR structures are shown in the same view as the Bottom figure in E. Hydrophobic residues are represented by space-filling with the same residue color as in Fig. 2. The first model of each NMR structure was used for illustration. (G) HDX rates for core residues. (Top) Changes in signal intensity ratio (signal intensity at the beginning is set to 1) of 1H15N correlation NMR signals for six residues in the core are shown for the original (Orig) as solid lines and for VAL88 as dashed lines. (Bottom) The six residues shown by green sticks. Residue 52 is Leu in the original and Val in VAL88; the other five residues are Val in the both. The first model of the NMR structure for VAL88 was used for illustration in E and G. The two-dimensional (2D) 1H15N heteronuclear single-quantum coherence (HSQC) spectrum of VAL88 showed the expected number of well-dispersed sharp peaks (Fig. 4), indicating that the protein with most of the core mutated to valine is not in a molten globule state (38). The solution structure was then determined with high quality by NMR spectroscopy using MagRO-NMRViewJ (39, 40), and was found to have the same topology as the original (see Fig. 4, , and for details). The hydrophobic core consisting of many valine residues (green sticks in Fig. 4) has cavities (Right in Fig. 4) compared with the original (Left in Fig. 4). Based on hydrogen/deuterium exchange (HDX) experiments, core residues in VAL88 were found to be significantly more unprotected than the original (Fig. 4 and ). The packing quality evaluated by RosettaHoles (33) also suggested the loosened packing: The score of VAL88, 6.0, is higher than those of the original, −0.45, and the other de novo designs (lower values represent better packing) (). Despite the loosened packing, the structure maintained the same topology and backbone structure as the original with an averaged root-mean-square deviation of Cα atoms of 1.4 Å (Fig. 4 and ). To identify how the energy landscape of VAL88 is sculpted, we performed 10,000 independent Rosetta ab initio structure prediction simulations (41) starting from an extended chain. The energy landscape still remained funnel-shaped toward the same folded structure as the original, although the funnel got wider and the energy gap between the native and the denatured states got smaller than the original, likely because the hydrophobic interactions were weakened by the mutations to smaller hydrophobic residues (Fig. 4 and the energy landscape of the original, Rsmn2x2_5_6, in Fig. 1). This result agrees with the fact that VAL88 can fold into the same structure as the original protein, even though the cooperativity and melting temperature decreased. We also constructed a mutant with an additional substitution F21V (11 residues were substituted with Val in total). Notably, the 1H15N HSQC spectrum still showed well-dispersed sharp peaks, and the thermal denaturation showed the Tm of 101.5 °C, indicating that the protein is still folded with high stability, although the soluble monomeric fraction was quite low ().

The Mutant with Altered Loop Lengths.

Finally, we directly investigated the importance of local backbone structures by changing the lengths of the loops that connect a β-strand to an α-helix (). We created a mutant that violates the βα-rules () by changing the two-residue loops to three-residue ones with the insertion of an Ala into the middle, and the three-residue loops to two-resiude ones with the deletion of the middle residue. We found that most of the soluble fraction of the mutant was in an aggregated form and that the monomeric fraction was quite low by SEC-MALS (, Left). Furthermore, the 1H15N HSQC spectrum showed a small number of nondispersed peaks (, Right), indicating that the mutant can no longer fold. This result does not, however, imply that de novo designed structures are vulnerable to any loop changes. A wide range of loop types should be tested for the purpose of creating functional proteins.

Discussion

Various ideal protein structures stabilized by completely consistent local and nonlocal interactions have been designed using the design rules (6, 7, 10, 13). Among the consistent local and nonlocal interactions, we investigated the contribution of a nonlocal interaction, hydrophobic core packing, to the folding ability and stability of de novo designed proteins by reducing hydrophobic core packing through extensive mutation of large hydrophobic residues (Leu, Ile) to smaller ones (Val). The results obtained from single-residue mutations suggested that “hub-residues,” the residues having contacts with both distant and close secondary structures, make a large contribution to the stability. It would be interesting to test the generality of this idea for other proteins. Furthermore, by accumulating the single mutations, we demonstrated that a designed protein with most of the core mutated to valine can still fold into the same topology as the original and remains folded above 100 °C even with the stability reduced by 5.5 kcal/mol. This result indicates that hydrophobic tight core packing may not be quite important for designed proteins: The folding ability and extremely high stability may arise from the restriction of conformational space to be searched during folding by the local backbone structures. This can lead to an exceptionally stable protein even in the absence of precise core packing. On the other hand, the contribution of local interactions in naturally occurring proteins may not be significant compared with designed proteins. This is because naturally occurring protein structures are prevented from completely being optimized for folding and stability due to the requirements to be functional or the susceptibility to neutral evolution, although the design rules are also observed in naturally occurring protein structures (7). Indeed, to the best of our knowledge, there are no natural proteins that can still fold properly when most of the hydrophobic residues in the core are mutated into valine. In addition, proteins that contain a high ratio of valine in the core are not observed in naturally occurring globular proteins including Rossmann folds (). Finally, we should note studies in which most of hydrophobic core residues of naturally occurring proteins were substituted with either methionine or isoleucine (42, 43). However, these studies aimed to create alternative core packing with methionine or isoleucine having higher hydrophobicity and higher number of side-chain conformations than valine; VAL88 created in this study is uniquely characterized by the reduced and loosened tight core packing with valine. Unlike de novo designed proteins, most naturally occurring proteins are not highly stable and unfold at less than 100 °C. Thermostabilization of natural proteins has been a long-standing challenge for industrial applications. Many studies have attempted to increase the thermal stability of proteins by changing side chains (amino acid residues) without modifying backbone conformations (44). Remodeling backbone structures with ideal ones would be an alternative way to achieve thermal stabilization of naturally occurring proteins (45, 46).

Materials and Methods

Definition of Secondary Structure Contacts.

Secondary structure contacts that each mutated residue forms were defined as follows. First, using all 20 models of the NMR structure for the original design, Rsmn2x2_5_6 (PDB ID: 2N3Z), residue-pair contacts were calculated if any heavy atoms of two residues separated by at least five residues along the sequence are within 4.5 Å in more than half of the 20 NMR models. All residue-pair contacts are shown in . Subsequently, secondary structure elements that each mutated residue makes contact with were identified by looking up the calculated residue-pair contacts, and then classified into the distant (not adjacent to the secondary structure element containing the mutated residue) and close (adjacent) ones (). Here, the residue pairs between β-strands were excluded for the calculation, because nonlocal interactions between the strand residues are provided by backbone–backbone hydrogen bonding in addition to hydrophobic contacts between side chains; therefore, the mutations of hydrophobic residues may not largely contribute to stability. Indeed, the mutations of the strand residues forming contacts with the neighboring β-strands (L52V, L4V, and L28V) did not show a significant stability change ().

Calculation of Energy Landscapes.

Energy landscapes of the original (Rsmn2x2_5_6, Fig. 1), VAL88 (Fig. 4), and Top7 (Fig. 1) were obtained from Rosetta ab initio structure prediction simulations (41). We performed 10,000 independent Monte Carlo structure prediction simulations starting from an extended chain for each sequence using energy function talaris2014 on Research Center for Computational Science (RCCS). The command line used for executing Rosetta ab initio structure prediction simulations starting from an extended chain is the following: $ROSETTA/source/bin/AbinitioRelax.linuxgccrelease -database $ROSETTADB -in:file:native [input.pdb] -in:file:fasta [input.fasta] -in:file:frag3 [input.frag3] -in:file:frag9 [input.frag9] -abinitio:fastrelax -out:file:silent [output.silent] -nstruct XX -ex1 -ex2aro -use_filters false -abinitio::increase_cycles 10 -abinitio::rg_reweight 0.5 -abinitio::rsd_wt_helix 0.5 abinitio::rsd_wt_loop 0.5 -relax::fast -relax::default_repeats 15 -score:weights talaris2014, where [input.pdb] is a pdb file of a design model, [input.fasta] is the designed sequence, [input.frag3] and [input.frag9] are three-residue or nine-residue fragment files generated from the designed sequence, and XX is a number of folding simulation trajectories. The relax simulations starting from the design model are carried out using the command line below: $ROSETTA/source/bin/relax.linuxgccrelease -database $ROSETTADB -s [input.pdb] -in:file:native [input.pdb] -out:file:silent [output.silent] -nstruct XX -ex1 -ex2aro -use_filters false -relax::fast -relax::default_repeats 15 -score:weights talaris2014.

Plasmid Construction, Expression, and Purification of Mutants.

The gene encoding the originally designed sequence, Rsmn2x2_5_6, cloned into plasmid pET21b, was obtained from the previous study (10). All single-residue mutations were introduced by the site-directed mutagenesis, in which a pair of complementary mutagenic primers are used to amplify the whole plasmid using Phusion high-fidelity DNA polymerase (New England Biolabs) or PrimeSTAR mutagenesis basal kit (Takara Bio). The combined mutations were introduced by QuikChange multi site-directed mutagenesis kit (Agilent Technologies). Oligos were obtained from FASMAC or Eurofins Genomics. The mutated DNA sequences were confirmed by DNA sequencing by FASMAC. The mutated proteins except VAL88 were expressed in E. coli BL21 Star (DE3) cells (Invitrogen) as nonlabeled proteins, which were expressed using autoinduction media (47). The expressed proteins with a 6×His tag at the C terminus were purified through a nickel affinity column. The purified proteins were then dialyzed against PBS buffer, 137 mM NaCl, 2.7 mM KCl, 10 mM Na2HPO4, 1.8 mM KH2PO4, at pH 7.4; this buffer was used for all of the experiments except VAL88. The expression, solubility, and purity of the designed proteins were assessed by sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS/PAGE) and mass spectrometry (Thermo Scientific Orbitrap Elite).

CD.

All CD data were collected on a JASCO J-1500 KS CD spectrometer under 1-MPa pressure, which makes the thermal denaturation possible up to 170 °C. Thermal denaturation was performed from 30 to 170 °C with an increase of 1 °C per min using ∼20 μM protein samples in PBS buffer (pH 7.4) in a 1-mm path length cuvette. During the denaturation, the ellipticity at 222 nm was monitored and far-UV CD spectra were measured from 260 to 200 nm at various temperatures of 30, 40, 60, 80, 100, 110, 120, 130, 140, 160, and 170 °C. The thermal denaturation curves were fit to the Gibbs–Helmholtz equation: to obtain the melting temperature Tm, in which the number of folded proteins is equal to the number of unfolded proteins, the enthalpy change ΔHm and the heat capacity change ΔC. Since the heat capacity is a function of pressure in general, we investigated the dependency by performing thermal denaturation under the two different pressures, the atmospheric pressure, 0.1 MPa, and 1 MPa for heating up to 170 °C, using one of our de novo designs, Fd_5A_3. The heat capacities obtained by fitting to the Gibbs–Helmholtz equation were 0.79 kcal/mol/K under 0.1-MPa pressure and 0.68 kcal/mol/K under 1-MPa pressure; these values agree well.

SEC-MALS.

SEC-MALS experiments were performed using a miniDAWN TREOS static light scattering detector (Wyatt Technology Corporation) combined with a HPLC system (1260 Infinity LC; Agilent Technologies). The volume 100 μL of 300 to 500 μM protein samples in PBS buffer (pH 7.4) after Ni purification except VAL88 protein samples for NMR structure determination was injected into a Superdex 75 increase 10/300 GL column (GE Healthcare) or a Shodex KW-802.5 (Showa Denko K.K.) equilibrated with PBS buffer at a flow rate of 0.5 mL/min. As for VAL88 proteins purified by gel filtration chromatography, the volume 50 μL of 500 μM samples in PBS buffer (pH 6.0) was injected into a Superdex 75 increase 10/300 GL column. The protein concentrations were calculated from the absorbance at 280 nm detected by the HPLC system. Static light scattering data were collected at three different angles, 43.6°, 90.0°, and 136.4°, at 659 nm. These data were analyzed by the ASTRA software (version 6.1.2; Wyatt Technology Corporation) with a change in the refractive index with concentration, a dn/dc value, 0.185 mL/g.

Two-Dimensional 1H–15N HSQC Measurement.

Two-dimensional 1H15N HSQC spectrum was measured to verify whether the core of VAL88 proteins is well packed, i.e., not in a molten globule state. The VAL88 was expressed in E. coli BL21 Star (DE3) cell as uniformly (U-)15N-labeled proteins using MJ9 minimal media (48), which contain 15N ammonium sulfate as a sole nitrogen source and 12C glucose as a sole carbon source, respectively. The expressed proteins with a 6×His tag at the C terminus were purified through a nickel affinity column and then purified by gel filtration chromatography on an ÄKTA Pure 25 FPLC (GE Healthcare) using a Superdex 75 increase 10/300 GL column (GE Healthcare). The expression, solubility, and purity of VAL88 proteins were assessed by SDS/PAGE and mass spectrometry (Thermo Scientific Orbitrap Elite). The HSQC spectrum was collected for 500 μM protein sample in 90% H2O/10% D2O buffer containing 100 mM NaCl, 1.2 mM Na2HPO4, and 7.4 mM KH2PO4 at pH 6.0, at 25 °C on a JEOL JNM-ECA 600-MHz spectrometer.

Determination of Solution Structure by NMR.

Sample preparation.

For the NMR structure determination, the uniformly isotope labeled [U-15N,U-13C]-VAL88 was expressed using the same method as described above except 13C glucose as a sole carbon source. The U-15N,U-13C-enriched purified protein samples dissolved in 95% H2O/5% D2O buffer containing 100 mM NaCl, 1.2 mM Na2HPO4, and 7.4 mM KH2PO4 at pH 6.0 were transferred into Shigemi micro NMR tubes (protein concentration, ∼500 μM). For residual dipolar coupling (RDC) experiments, normal NMR tubes were used (protein concentration, ∼200 μM).

NMR measurements.

NMR measurements were performed on Bruker AVANCE III NMR spectrometers equipped with QCI cryo-Probe (1H/13C/15N/31P) at 303 K. The spectrometers with 600- and 800-MHz magnets were used for the signal assignments and nuclear Overhauser effect (NOE)-related measurements, while a 950-MHz one was used for RDC experiments. For the signal assignments, 2D 1H15N HSQC (echo/anti-echo), 1H13C constant-time HSQC for aliphatic and aromatic signals, 3D BEST-HNCO, 3D CBCA(CO)NH, 3D HNCACB, 3D H(CCCO)NH, and 3D CC(CO)NH spectra were measured. For structure determination, 3D 15N-edited nuclear Overhauser effect spectroscopy (NOESY), 3D 13C-edited NOESY for aliphatic and aromatic signals (mixing time, 100 ms) were performed. For RDC experiments, 2D IPAP 1H15N HSQC using water-gate pulses for water suppression was measured with or without 7 mg/mL Pf1 phage (ASLA Biotech Ltd.). For confirming the positions of 1H15N signals in the 2D HSQC, 3D HNCO in the presence of Pf1 phage at the same buffer condition were used. The signals for α- and β-states split by 1H15N 1J-coupling were separately obtained for the protein in the isotropic and weakly aligned states. The RDC values were estimated by simple subtraction of the shifted values between isotropic and weakly aligned states, and then divided by the static magnetic field to get the value in hertz.

NMR data process.

All of the acquired NMR free induction decay data were processed with NMRPipe (49). For the NMR analysis, an integrated package of NMR tools named MagRO-NMRViewJ, version 2.01.12 [the upgraded version of Kujira (39)], on NMRView (50) was used for automated peak identification and noise filtration using convolutional neural networks [CNN-filter (40)]. The filtered peak lists were applied to calculations for automated signal assignments by FLYA (51) and then the signal assignments were used for prediction of dihedral angles by TALOS+ (52), and automated NOE assignments and structure calculation by CYANA (53).

HDX NMR Experiments.

Uniformly 15N-labeled original and VAL88 proteins were lyophilized after purification. As reference experiments, the lyophilized samples were dissolved in a 95% H2O/5% D2O buffer solution containing 100 mM NaCl, 1.2 mM Na2HPO4 and 7.4 mM KH2PO4 at pH 6.0, and 2D 1H15N HSQC NMR spectra were measured by Bruker AVANCE III 800-MHz NMR spectrometer equipped with a TXI cryogenic probe at 303 K. In HDX experiments, the lyophilized sample was dissolved in a prechilled 100% D2O buffer solution, of which composition was same as that of the reference experiments. Upon addition of the 100% D2O buffer, the sample was immediately transferred into a Shigemi tube (5-mm diameter) and placed in the Bruker NMR magnet of 800 MHz, and 2D 1H15N HSQC NMR spectral data were collected. The time from the dissolution of the lyophilized sample to the beginning of NMR data collection was 20 min, and the eight of the 1H15N HSQC NMR spectra (measurement time of one NMR spectrum was 20 min) were measured continuously. Those NMR spectral data were processed by using the program NMRPipe (49), and H/D exchanging time-dependent changes of signal intensity of those spectra were analyzed by using the program NMRFAM-SPARKY (54).
  53 in total

Review 1.  Dominant forces in protein folding.

Authors:  K A Dill
Journal:  Biochemistry       Date:  1990-08-07       Impact factor: 3.162

Review 2.  From Levinthal to pathways to funnels.

Authors:  K A Dill; H S Chan
Journal:  Nat Struct Biol       Date:  1997-01

3.  Combined automated NOE assignment and structure calculation with CYANA.

Authors:  Peter Güntert; Lena Buchner
Journal:  J Biomol NMR       Date:  2015-03-24       Impact factor: 2.835

4.  Loop size optimization induces a strong thermal stabilization of the thioredoxin fold.

Authors:  Alessia Ruggiero; Giovanni Smaldone; Luciana Esposito; Nicole Balasco; Luigi Vitagliano
Journal:  FEBS J       Date:  2019-02-16       Impact factor: 5.542

5.  Structural studies of mutants of T4 lysozyme that alter hydrophobic stabilization.

Authors:  M Matsumura; J A Wozniak; D P Sun; B W Matthews
Journal:  J Biol Chem       Date:  1989-09-25       Impact factor: 5.157

6.  High thermodynamic stability of parametrically designed helical bundles.

Authors:  Po-Ssu Huang; Gustav Oberdorfer; Chunfu Xu; David Baker; Xue Y Pei; Brent L Nannenga; Joseph M Rogers; Frank DiMaio; Tamir Gonen; Ben Luisi
Journal:  Science       Date:  2014-10-24       Impact factor: 47.728

7.  NMRPipe: a multidimensional spectral processing system based on UNIX pipes.

Authors:  F Delaglio; S Grzesiek; G W Vuister; G Zhu; J Pfeifer; A Bax
Journal:  J Biomol NMR       Date:  1995-11       Impact factor: 2.835

Review 8.  Theoretical studies of protein folding.

Authors:  N Go
Journal:  Annu Rev Biophys Bioeng       Date:  1983

9.  NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy.

Authors:  Woonghee Lee; Marco Tonelli; John L Markley
Journal:  Bioinformatics       Date:  2014-12-12       Impact factor: 6.937

10.  Principles for designing ideal protein structures.

Authors:  Nobuyasu Koga; Rie Tatsumi-Koga; Gaohua Liu; Rong Xiao; Thomas B Acton; Gaetano T Montelione; David Baker
Journal:  Nature       Date:  2012-11-08       Impact factor: 49.962

View more
  3 in total

1.  A Method for Assessing the Robustness of Protein Structures by Randomizing Packing Interactions.

Authors:  Shilpa Yadahalli; Lakshmi P Jayanthi; Shachi Gosavi
Journal:  Front Mol Biosci       Date:  2022-06-27

Review 2.  The Protein Folding Problem: The Role of Theory.

Authors:  Roy Nassar; Gregory L Dignon; Rostam M Razban; Ken A Dill
Journal:  J Mol Biol       Date:  2021-07-03       Impact factor: 6.151

Review 3.  Recent Progress Using De Novo Design to Study Protein Structure, Design and Binding Interactions.

Authors:  Juan Ferrando; Lee A Solomon
Journal:  Life (Basel)       Date:  2021-03-10
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.