Proteinogenic amino acid residues that promote β-sheet secondary structure are hydrophobic (e.g., Ile or Val) or only moderately polar (e.g., Thr). The design of peptides intended to display β-sheet secondary structure in water typically requires one set of residues to ensure conformational stability and an orthogonal set, with charged side chains, to ensure aqueous solubility and discourage self-association. Here we describe new amino acids that manifest substantial β-sheet propensity, by virtue of β-branching, and also bear an ionizable group in the side chain.
Proteinogenic amino acid residues that promote β-sheet secondary structure are hydrophobic (e.g., Ile or Val) or only moderately polar (e.g., Thr). The design of peptides intended to display β-sheet secondary structure in water typically requires one set of residues to ensure conformational stability and an orthogonal set, with charged side chains, to ensure aqueous solubility and discourage self-association. Here we describe new amino acids that manifest substantial β-sheet propensity, by virtue of β-branching, and also bear an ionizable group in the side chain.
β-Sheet is a
very common motif within folded proteins. Small,
autonomously folding peptides that display β-sheet secondary
structure in the absence of a specific tertiary context are useful
tools for exploring intrinsic relationships between sequence and β-sheet
stability[1] and for establishing characteristic
spectroscopic signatures of this motif.[2] In addition, autonomously folding systems have been valuable for
probing noncovalent interactions involving post-translationally introduced
units.[3] Engineered β-sheets have
been used to explore preferred modes of intermolecular β-strand
associations, phenomena that underlie amyloid formation;[4] such β-sheet designs may ultimately lead
to diagnostic or therapeutic tools for amyloid diseases.[5]Design of β-sheet-forming peptides
is challenging because
the natural residues that promote this secondary structure are hydrophobic
(Ile, Val, Phe, Trp, Tyr) or only moderately polar (Thr).[6] Therefore, sequences intended to adopt β-sheet
conformations but not self-associate in aqueous solution must be rich
both in β-promoting residues and in solubility-promoting residues,
the latter usually bearing charged side chains. The necessity of using
distinct sets of residues for conformational propensity and solubility
represents a significant design constraint. Here we explore the hypothesis
that β-sheet propensity and peripheral charge can be combined
in a single α-amino acid residue.Side chain branching
adjacent to the backbone (“β-branching”),
as in Ile, Val and Thr, is correlated with high β-sheet propensity;[7] therefore, we have synthesized and evaluated
new amino acids that contain both a β-branch point and an ionizable
group in the side chain (Figure 1).
Figure 1
Sequences of hairpin peptides.
Results and Discussion
Initial Design: Ether Derivative of Thr
Our initial
design hypothesis focused on derivatives of threonine in which the
side chain hydroxyl is used to form an ether linkage to a unit bearing
an ionizable group. Scheme 1 shows the synthesis
of protected amino acid 3 from aziridine 1, which was generated from l-threonine via well-precedented
methods.[8a] Vederas et al. have shown that
nucleophiles can open closely related Thr-derived aziridines in the
presence of BF3 with high stereoselectivity,[8] and this approach was useful in our case. The
residue derived from 3, which we designate TO+, was readily incorporated into synthetic peptides via Fmoc-based
solid-phase synthesis. TO+ can be viewed as an analogue
of Lys that contains a β-branch point in the side chain.
Scheme 1
Synthesis of Amino Acids
We used 12-mer peptide I as a benchmark for
evaluating
the β-sheet propensity of TO+ and other new residues
described below (Figure 1). The conformational behavior of I was previously
characterized via NMR.[9] In aqueous solution
this peptide adopts an antiparallel two-stranded β-sheet conformation
(“β-hairpin”) with a reverse turn at the dPro-Gly segment. d-Proline was employed to promote a type
II′ reverse turn, which favors β-sheet interactions between
flanking strands composed of l-residues.[10] NMR data indicate ∼68% population of the β-hairpin
conformation at 4 °C in aqueous buffer (pH 3.8).[9b]Sequences of hairpin peptides.Peptide II is the analogue of I in which
Lys-9 has been replaced with TO+. 2D NMR analysis suggested
significant population of the β-hairpin conformation at 4 °C
in aqueous buffer (pH 3.8).[11] The NOEs
observed for II are consistent with adoption of the expected
β-hairpin conformation. For additional insight, we compared
chemical shifts for α protons (δCαH)
of residues within I and II. δCαH values are sensitive to secondary structure, with
residues that participate in α-helix displaying upfield shifts
and residues that participate in β-sheet showing downfield shifts
relative to random coil values.[12]Figure 2a shows ΔδCαH data, the deviation from the random coil chemical shift at each
residue, for peptides I and II. The lPro-6 diastereomers of I and II were
used to provide the random coil δCαH values
because we have previously shown that replacing dPro with lPro abolishes β-hairpin folding in I and
comparable designs.[9] For both I and II, ΔδCαH ≥
0.2 ppm for the segments Tyr-2 to Val-5 and Orn-8 to Ile-10, which
is consistent with the expected locations of the two β-strand
segments. The consistent negative ΔδCαH values for Leu-11 presumably arise from the proximity of the Tyr
side chain to Leu-11 CαH in the β-hairpin conformation.
The near-zero values at dPro-6 and Gly-7 are consistent with
the expected turn, and the near-zero values at Arg-1 and Gln-12 are
consistent with “fraying” at the termini.
Figure 2
Comparison
of ΔδCαH values for the
(A) parent, TS+, and TO+ peptides and (B) parent
and TS– peptides. All peptides were referenced relative
to unfolded controls of the same sequence. The ΔδCαH values at or adjacent to the substitution cannot be
directly compared as the substitution changes the dynamic range. ΔδCαH values at hydrogen bonding positions in the core of
the peptide have been shown to most accurately reflect the population
of the β-hairpin.[2b]
Comparison
of ΔδCαH values for the
(A) parent, TS+, and TO+ peptides and (B) parent
and TS– peptides. Allpeptides were referenced relative
to unfolded controls of the same sequence. The ΔδCαH values at or adjacent to the substitution cannot be
directly compared as the substitution changes the dynamic range. ΔδCαH values at hydrogen bonding positions in the core of
the peptide have been shown to most accurately reflect the population
of the β-hairpin.[2b]The ΔδCαH comparison
between I and II suggests that the new TO+ residue may have a slightly lower β-sheet
propensity than does Lys. This surprising conclusion emerges because
the absolute ΔδCαH values for II are smaller than those for I at most strand residues
common to the two peptides.
Redesign: Thioether Derivatives of Thr
The unexpected
behavior of TO+ led us to re-evaluate our design hypothesis,
which focused exclusively on β-branching in the side chain.
The side chain oxygen of Thr can form an H-bond with a nearby backbone
N–H, which leads to Thr backbone torsion angles (ϕ and
ψ) more compatible with the α-helical than β-sheet
secondary structure.[7a,7b] An ether is a better H-bond acceptor
than a comparable alcohol.[13a] If the H-bond
acceptor ability of the etheroxygen in TO+ works against
β-sheet propensity, then we hypothesized that the desired conformational
properties should be achieved by replacing this oxygen atom with sulfur,
to generate TS+. Thioethers are much poorer H-bond acceptors
than are ethers.[13] A protected α-amino
acid that could be used to incorporate TS+ residues was
prepared as shown in Scheme 1; the key step
was use of trityl thiol to open the Thr-derived aziridine.[14] Protected amino acid 5a allowed
solid-phase synthesis of peptide III, which contains
TS+ in place of Lys-9 of I.2D NMR analysis
of III revealed interstrand NOEs consistent with significant
population of the expected β-hairpin conformation in aqueous
buffer at 4 °C. Resonance overlap hindered identification of
NOEs involving the Tyr-2 side chain at this temperature, but at 10
°C such NOEs could be detected (Figure 3). AllNOEs are consistent with the expected β-sheet structure.
Figure 3
Cross-strand
NOEs observed for peptide III at 10 °C
in pH 3.8 aqueous buffer.
Cross-strand
NOEs observed for peptide III at 10 °C
in pH 3.8 aqueous buffer.ΔδCαH data for III (Figure 2a) suggest that the extent of β-hairpin
formation
is higher for this peptide than for II, which contains
the TO+ residue, because the absolute value of ΔδCαH is larger at each strand residue (positions 2–5
and 8–11) for III than for II. Moreover,
the ΔδCαH values are larger at most strand
positions for III relative to parent peptide I, which suggests that the TS+ residue stabilizes the β-hairpin
conformation relative to Lys.We explored the scope of this
design strategy by evaluating the
residue TS–, an analogue of TS+ that
bears an acidic rather than a basic side chain along with a thioether-based
β-branch point. Peptide IV, prepared using 5b, is an analogue of I in which Glu-4 is replaced
by TS–. The NOEs observed at 4 °C are consistent
with the formation of the expected β hairpin. ΔδCαH data for IV (Figure 2b) suggest that this peptide forms a more stable β-hairpin
than that formed by I. Circular dichroism spectra of
peptides I, III, and IV (phosphate
buffer, pH = 7.0) are consistent with the formation of a β hairpin
and the stability trends observed by NMR.[11]ΔδCαH-based population analysis
was
undertaken in order to estimate the thermodynamic impact of replacing
Lys-9 with either TO+ or TS+ or replacing Glu-4
with TS–. Residues 3, 5, 8, and 10 occupy hydrogen
bonded positions within the β-hairpin conformation adopted by I–IV, and we previously found that these
positions are optimal “indicators” for population analysis
in I and related peptides.[2b,9b] Such peptides
are presumed to equilibrate rapidly between β-hairpin and unfolded
states on the NMR time scale. The ΔδCαH value measured at each indicator residue represents a populated-weighted
average of the contributions from the fully unfolded state and the
fully folded (i.e., β-hairpin) state. For each sequence, we
use the lPro-6 diastereomer to estimate ΔδCαH for each indicator residue in the fully unfolded state,
and we use a macrocyclized analogue with dPro–Gly
turns at both ends to estimate δCαH in the
fully folded state.[11] (The reference peptides
for III are shown in Figure 4.)
Table 1 shows the β-hairpin population
deduced in this way at each of the four indicator positions in peptides I–IV. The variation among the four values
for each molecule is a measure of the intrinsic uncertainty associated
with this quantification strategy, which involves independent analysis
at four distinct sites within each peptide. Despite this uncertainty,
the data support the qualitative conclusion that replacing Lys-9 of I with TO+ (II) has little effect
on β-hairpin population, while replacing Lys-9 with TS+ (III) or Glu-4 with TS– (IV) leads to enhanced β-hairpin population.
Figure 4
Reference peptides for
thermodynamic analysis. Differences from
the peptide of interest are highlighted in blue.
Table 1
Population and Thermodynamic Analysis
for the Reporter Positions in Hairpins I–IVa
peptide
Val 3 (%)
Val 5 (%)
Orn 8 (%)
Ile 10 (%)
ΔΔGfold (kcal/mol)
Ib
70
76
65
59
II (TO+)c
62
77
63
51
+0.1 ± 0.2
III (TS+)c
85
93
76
71
–0.5 ± 0.4
IV (TS–)d
93
96
84
77
–0.6 ± 0.3
The ΔΔG was calculated for each reporter position in the hairpin
and then
averaged.
Data from Syud
et al. (ref (9a)).
Replacement for Lys-9. For II relative to III ΔΔG = −0.8 ± 0.5 kcal/mol.
Replacement for Glu-4.
Reference peptides for
thermodynamic analysis. Differences from
the peptide of interest are highlighted in blue.The ΔΔG was calculated for each reporter position in the hairpin
and then
averaged.Data from Syud
et al. (ref (9a)).Replacement for Lys-9. For II relative to III ΔΔG = −0.8 ± 0.5 kcal/mol.Replacement for Glu-4.Table 1 shows ΔΔGfold calculated for peptides II–IV relative to parent I, and for III relative to II, based on the residue-specific
population.
This thermodynamic analysis shows that replacing Lys with TO+, the basic Thr derivative with an ether linkage at the branch point,
fails to enhance β-hairpin stability, which invalidates our
original design hypothesis. In contrast, the analogous basic residue
containing a thioether linkage at the branch point, TS+, stabilizes the β-hairpin conformation. Although the ΔΔGfold values for II vs I and for III vs I are not cleanly distinguished
given the uncertainties in Table 1, direct
comparison of III vs II shows unambiguously
that TS+ enhances β-hairpin stability relative to
TO+. The magnitude of the β-sheet stabilization provided
by TS+ or TS– relative to proteinogenic
residues with unbranched side chains, Lys and Glu, respectively, is
significant given that the natively folded state of a typical globular
protein, containing a few hundred residues, is only ∼5 kcal/mol
more stable than the unfolded state.[15]It should be noted that differences between ΔδCαH values (bar heights) from Figure 2a and
% folded values from Table 1 are
not directly comparable. Each ΔδCαH value
in Figure 2a is determined by the difference
between the chemical shift (δCαH) for a given
residue in one of three peptides (I, II or III) and the analogous chemical shift from a reference peptide
that has l-proline in place of d-proline. The l-proline peptide represents the fully unfolded state. The %
folded values in Table 1 are generated from
an algebraic expression based on three chemical shifts,
the two mentioned above and the δCαH value
for the analogous cyclic peptide, which represents the fully folded
state. Since this last value can vary among macrocyclic peptides with
different sequences, quantitative correlations between ΔδCαH values (Figure 2a) and % folded
values (Table 1) may not be evident at specific
residues, particularly at or adjacent to substitution positions. In
addition, another factor may prevent such quantitative correlations:
the relationship between changes in ΔδCαH and changes in % folding is not linear. For these reasons, and
to account for local factors that can influence δCαH values at particular residues, we draw conclusions based on measurements
at multiple sequence positions.
Circular Dichroism:
Effects of pH and Temperature
Circular
dichroism (CD) provides information on peptide folding that is intrinsically
of lower structural resolution than the insights available from 2D
NMR; however, CD can be very useful for qualitative comparisons. CD
data in the far-UV region (190–250 nm) arise largely from the
backbone amide groups and therefore report on secondary structure.
The far-UV signatures of peptides I, III, and IV obtained in the buffer used for NMR studies,
100 mM acetate buffer, pH 3.8, all manifest a minimum at 215 nm (Figure 5), which is characteristic of β-sheet secondary
structure and therefore consistent with NMR data for these peptides.
Figure 5
CD data
for peptides (A) I, (B) III,
and (C) IV in phosphate buffer (pH 7.0) or acetate buffer
(pH 3.8). CD data for peptides (D) I, (E) III, and (F) IV in phosphate buffer at 4, 20, and 37 °C.
CD data
for peptides (A) I, (B) III,
and (C) IV in phosphate buffer (pH 7.0) or acetate buffer
(pH 3.8). CD data for peptides (D) I, (E) III, and (F) IV in phosphate buffer at 4, 20, and 37 °C.For each peptide, the far-UV CD
spectrum obtained in the NMR buffer
is not significantly different from the spectrum obtained in 100 mM
phosphate buffer, pH 7.0 (Figure 5a–c),
which indicates that β-hairpin folding is not pH-dependent over
this range. This observation is important because the Glu or TS– side chain carboxyl group is likely to be largely
protonated at pH 3.8, but fully deprotonated at pH 7.0. Varying temperature
from 4 to 20 to 37 °C does not significantly alter the far UV
CD of I, III or IV (Figure 5d–f), which suggests that conclusions drawn
from NMR analysis at low temperature are relevant to room or physiological
temperatures.
Evaluation of the New Residues in a Different
β-Hairpin
Context
We turned to a different β-hairpin system to
explore the generality of the behavior observed in derivatives of
peptide I for new residues TO+, TS+ and TS– (Figure 6). The
design of peptide V was based on the sequence of a β-hairpin
that occurs at the N-terminus of ubiquitin; a central dPro–Gly
segment was used to promote autonomous β-hairpin folding. Previously
reported NMR data suggest that the expected β-hairpin conformation
is significantly populated in aqueous solution.[10] We prepared analogues of V in which Lys-15
is replaced with TO+ (VI) or TS+ (VII).
Figure 6
Sequences of hairpin peptides.
Sequences of hairpin peptides.The ΔδCαH data indicate that
replacing
Lys-15 with TO+ does not lead to a significant change in
β-hairpin population, because the ΔδCαH values for VI are within error of those for V at strand positions Gln-2 to Ser-7 (Figure 7). In contrast, replacing Lys-15 with TS+ results
in consistently higher ΔδCαH values among
the N-terminal strand residues (Figure 7).
These conclusions are similar to those derived from comparisons among I-III, which also differ at a single residue
(Lys vs TO+ vs TS+). To evaluate TS– in this β-hairpin system, we evaluated VIII,
the derivative of V in which Ile-3 has been replaced
by Glu. ΔδCαH analysis indicates that
this change causes a significant decline in Δδ-hairpin
population,[11] which may reflect two factors,
the loss of β-branching and the loss of a hydrophobic side chain.
Replacing Glu-3 of VIII with TS–, to
generate IX, leads to an increase in β-hairpin
population, as judged by ΔδCαH data;[11] however, these data indicate that the TS– residue in IX is not as effective as
the Ile residue of V in terms of stabilizing the β-hairpin
conformation. Despite the apparently diminished β-sheet propensity
of the TS– residue relative to the more hydrophobic
Ile residue, TS– represents a useful design tool
because one can match the charge provided by Glu while enhancing the
folding tendency.
Figure 7
Comparison of ΔδCαH values
for the
parent, TS+, and TO+ peptides. All peptides
were referenced relative to unfolded controls of the parent. The ΔδCαH values at or adjacent to the substitution cannot be
directly compared as the substitution changes the dynamic range. The
ΔδCαH values at hydrogen bonding positions
in the core of the peptide have been shown to most accurately reflect
the population of the β-hairpin.[2b]
Comparison of ΔδCαH values
for the
parent, TS+, and TO+ peptides. Allpeptides
were referenced relative to unfolded controls of the parent. The ΔδCαH values at or adjacent to the substitution cannot be
directly compared as the substitution changes the dynamic range. The
ΔδCαH values at hydrogen bonding positions
in the core of the peptide have been shown to most accurately reflect
the population of the β-hairpin.[2b]
Hydrophilicity Assessements
Our design strategy is
based on the assumption that new amino acid residues with a side chain
that contains both a β-branch point and an ionizable group will
be more hydrophilic than the proteinogenic β-branched residues
(Thr, Ile and Val). To test this hypothesis, we evaluated the hydrophilicities
of TS+ and TS– along with selected proteinogenic
residues, Lys, Glu, Gly, Thr and Ile, using a previously described
system.[16] This method employs the N-4-nitrobenzoyl derivatives of the amino acids. Distribution
coefficients are determined (at equilibrium) between equal volumes
of octanol and aqueous buffer (100 mM phosphate, pH 7.0). The parameter
of comparison, Π, is normalized: the logarithm of the distribution
coefficient of glycine, log(DGlycine),
is subtracted from the logarithm of distribution coefficient of the
amino acid under consideration, log(Damino acid) to calculate Π for that amino acid. Table 2 shows Π values measured for TS+ and TS– and the selected proteinogenic residue.
Table 2
Normalized Octanol/Water Distribution
Coefficients of Selected 4-Nitrobenzoyl Amino Acid Derivatives
amino acid
Π
Lys
–2.44 ± 0.02
TS+
–1.47 ± 0.04
Glu
–1.85 ± 0.4
TS–
–1.75 ± 0.2
Gly
0
Thr
0.07 ± 0.01
Ile
1.59 ± 0.08
As we predicted, TS+ and TS– cluster
with Lys and Glu, all significantly preferring aqueous buffer relative
to octanol. It is interesting to note that TS– and
Glu are very similar on this scale, while TS+ is somewhat
less hydrophilic than Lys. In contrast, Ile significantly prefers
octanol to aqueous buffer, while Thr is similar to Gly. The data clearly
show that TS+ and TS– are significantly
more hydrophilic that Thr or Ile.
Conclusions
We
have identified a new family of unnatural α-amino acid
residues featuring two properties, intrinsic conformational propensity
and side chain charge, that have proven to be valuable for design
of peptides that fold autonomously to β-sheet secondary structure
in aqueous solution. These two properties are not paired in any proteinogenic
amino acid residue, which has made it challenging to design smallpeptides that adopt β-sheet conformations but do not aggregate.
High β-sheet propensity is generally associated with β-branching
in a residue’s side chain. Our initial attempt to build charge
and side chain branching into a new residue, involving an ether linkage
at the branch point, was not successful. Thioether-based branch points,
on the other hand, lead to the desired properties. We have illustrated
this approach with two new residues, TS+ and TS–, which feature basic and acidic side chains, respectively. The versatility
of the synthetic route will enable preparation of many related thioether-containing
α-amino acids.
Authors: Karen L Maxwell; David Wildes; Arash Zarrine-Afsar; Miguel A De Los Rios; Andrew G Brown; Claire T Friel; Linda Hedberg; Jia-Cherng Horng; Diane Bona; Erik J Miller; Alexis Vallée-Bélisle; Ewan R G Main; Francesco Bemporad; Linlin Qiu; Kaare Teilum; Ngoc-Diep Vu; Aled M Edwards; Ingo Ruczinski; Flemming M Poulsen; Birthe B Kragelund; Stephen W Michnick; Fabrizio Chiti; Yawen Bai; Stephen J Hagen; Luis Serrano; Mikael Oliveberg; Daniel P Raleigh; Pernilla Wittung-Stafshede; Sheena E Radford; Sophie E Jackson; Tobin R Sosnick; Susan Marqusee; Alan R Davidson; Kevin W Plaxco Journal: Protein Sci Date: 2005-02-02 Impact factor: 6.725
Authors: Kristen E Fiore; Martijn J Patist; Sam Giannakoulias; Cheng-Hsin Huang; Hitesh Verma; Bhavesh Khatri; Richard P Cheng; Jayanta Chatterjee; E James Petersson Journal: RSC Chem Biol Date: 2022-04-05
Authors: Abbas H K Al Temimi; Jona Merx; Christian J van Noortwijk; Giordano Proietti; Romano Buijs; Paul B White; Floris P J T Rutjes; Thomas J Boltje; Jasmin Mecinović Journal: Sci Rep Date: 2020-12-09 Impact factor: 4.379
Authors: Vesna Stanojlovic; Anna Müller; Ali Moazzam; Arthur Hinterholzer; Katarzyna Ożga; Łukasz Berlicki; Mario Schubert; Chiara Cabrele Journal: Chembiochem Date: 2021-12-16 Impact factor: 3.461