Robert W Newberry1, Ronald T Raines1,2. 1. Department of Chemistry, University of Wisconsin-Madison , Madison, Wisconsin 53706, United States. 2. Department of Biochemistry, University of Wisconsin-Madison , Madison, Wisconsin 53706, United States.
Abstract
The carbonyl group holds a prominent position in chemistry and biology not only because it allows diverse transformations but also because it supports key intermolecular interactions, including hydrogen bonding. More recently, carbonyl groups have been found to interact with a variety of nucleophiles, including other carbonyl groups, in what we have termed an n→π* interaction. In an n→π* interaction, a nucleophile donates lone-pair (n) electron density into the empty π* orbital of a nearby carbonyl group. Mixing of these orbitals releases energy, resulting in an attractive interaction. Hints of such interactions were evident in small-molecule crystal structures as early as the 1970s, but not until 2001 was the role of such interactions articulated clearly. These non-covalent interactions were first discovered during investigations into the thermostability of the proline-rich protein collagen, which achieves a robust structure despite a relatively low potential for hydrogen bonding. It was found that by modulating the distance between two carbonyl groups in the peptide backbone, one could alter the conformational preferences of a peptide bond to proline. Specifically, only the trans conformation of a peptide bond to proline allows for an attractive interaction with an adjacent carbonyl group, so when one increases the proximity of the two carbonyl groups, one enhances their interaction and promotes the trans conformation of the peptide bond, which increases the thermostability of collagen. More recently, attention has been paid to the nature of these interactions. Some have argued that rather than resulting from electron donation, carbonyl interactions are a particular example of dipolar interactions that are well-approximated by classical mechanics. However, experimental evidence has demonstrated otherwise. Numerous examples now exist where an increase in the dipole moment of a carbonyl group decreases the strength of its interactions with other carbonyl groups, demonstrating unequivocally that a dipolar mechanism is insufficient to describe these interactions. Rather, these interactions have important quantum-mechanical character that can be evaluated through careful experimental analysis and judicious use of computation. Although individual n→π* interactions are relatively weak (∼0.3-0.7 kcal/mol), the ubiquity of carbonyl groups across chemistry and biology gives the n→π* interaction broad impact. In particular, the n→π* interaction is likely to play an important role in dictating protein structure. Indeed, bioinformatics analysis suggests that approximately one-third of residues in folded proteins satisfy the geometric requirements to engage in an n→π* interaction, which is likely to be of particular importance for the α-helix. Other carbonyl-dense polymeric materials like polyesters and peptoids are also influenced by n→π* interactions, as are a variety of small molecules, some with particular medicinal importance. Research will continue to identify molecules whose conformation and activity are affected by the n→π* interaction and will clarify their specific contributions to the structures of biomacromolecules.
The carbonyl group holds a prominent position in chemistry and biology not only because it allows diverse transformations but also because it supports key intermolecular interactions, including hydrogen bonding. More recently, carbonyl groups have been found to interact with a variety of nucleophiles, including other carbonyl groups, in what we have termed an n→π* interaction. In an n→π* interaction, a nucleophile donates lone-pair (n) electron density into the empty π* orbital of a nearby carbonyl group. Mixing of these orbitals releases energy, resulting in an attractive interaction. Hints of such interactions were evident in small-molecule crystal structures as early as the 1970s, but not until 2001 was the role of such interactions articulated clearly. These non-covalent interactions were first discovered during investigations into the thermostability of the proline-rich protein collagen, which achieves a robust structure despite a relatively low potential for hydrogen bonding. It was found that by modulating the distance between two carbonyl groups in the peptide backbone, one could alter the conformational preferences of a peptide bond to proline. Specifically, only the trans conformation of a peptide bond to proline allows for an attractive interaction with an adjacent carbonyl group, so when one increases the proximity of the two carbonyl groups, one enhances their interaction and promotes the trans conformation of the peptide bond, which increases the thermostability of collagen. More recently, attention has been paid to the nature of these interactions. Some have argued that rather than resulting from electron donation, carbonyl interactions are a particular example of dipolar interactions that are well-approximated by classical mechanics. However, experimental evidence has demonstrated otherwise. Numerous examples now exist where an increase in the dipole moment of a carbonyl group decreases the strength of its interactions with other carbonyl groups, demonstrating unequivocally that a dipolar mechanism is insufficient to describe these interactions. Rather, these interactions have important quantum-mechanical character that can be evaluated through careful experimental analysis and judicious use of computation. Although individual n→π* interactions are relatively weak (∼0.3-0.7 kcal/mol), the ubiquity of carbonyl groups across chemistry and biology gives the n→π* interaction broad impact. In particular, the n→π* interaction is likely to play an important role in dictating protein structure. Indeed, bioinformatics analysis suggests that approximately one-third of residues in folded proteins satisfy the geometric requirements to engage in an n→π* interaction, which is likely to be of particular importance for the α-helix. Other carbonyl-dense polymeric materials like polyesters and peptoids are also influenced by n→π* interactions, as are a variety of small molecules, some with particular medicinal importance. Research will continue to identify molecules whose conformation and activity are affected by the n→π* interaction and will clarify their specific contributions to the structures of biomacromolecules.
The
carbonyl group has received great attention, due in part to
its varied reactivity and ubiquity across chemistry and biology. In
addition to the enormous catalogue of chemical transformations supported
by carbonyl groups, their intermolecular interactions play a paramount
role in the organization of biological systems. For example, carbonyl
groups participate in the hydrogen bonding that governs the structure
of proteins[1,2] and nucleic acids.[3] More recently, chemists have recognized that carbonyl groups can
form attractive interactions with one another. These thoughts had
their origin in the seminal analyses of small-molecule crystal structures
by Bürgi and Dunitz (Figure ).[4−6] Later, Allen identified intimate contacts between
carbonyl groups explicitly.[7] Only recently,
however, were perturbations of a carbonyl–carbonyl interaction
shown to affect the stability of a protein.[8] Now substantial effort is being devoted to understanding the nature
of these interactions and revealing their contributions to diverse
chemical and biological phenomena.[9,10]
Figure 1
Recent photographs
of Bürgi and Dunitz, whose work in the
1970s laid the foundation for current investigations of n→π* interactions.
Recent photographs
of Bürgi and Dunitz, whose work in the
1970s laid the foundation for current investigations of n→π* interactions.
Nature of Carbonyl Interactions
The charge
distribution of the carbonyl group creates the potential
for a variety of attractive interactions to exist between two carbonyl
groups.[11] In particular, one can envisage
(1) a Coulombic interaction between point charges on the carbon of
one moiety and the oxygen of another, (2) a dipolar interaction between
the permanent electric dipoles of the two groups, or (3) a donor–acceptor
interaction in which electron density from electron-rich orbitals
of one carbonyl group is donated into electron-deficient orbitals
of another. Purely Coulombic and dipolar contributions to carbonyl
interactions are likely to be well-approximated by the molecular force
fields used for interrogating biological phenomena. In contrast, contributions
from electron donation are not represented even in sophisticated force
fields, which could lead to divergence from the behavior of real biomolecules.The interactions of carbonyl groups have been explored using a
proline model system (Figure ). Not only does the pyrrolidine ring preorganize the i – 1 and i carbonyl groups for
interaction, but these molecules also provide a convenient readout
for perturbations. Specifically, peptide bonds to proline populate
both the cis and trans conformations, but because an attractive carbonyl
interaction exists only in the trans conformation, changes in the
conformer populations can report on changes in the strength of a carbonyl
interaction. For example, perturbations that enhance the strength
of carbonyl interactions cause an increase in the population of the
trans conformer.
Figure 2
Characterization of carbonyl interactions by torsion balance
analysis.
Experimental Ktrans/cis values of proline
derivatives were measured by NMR spectroscopy in D2O at
25 °C. Adapted from ref (13). Copyright 2013 American Chemical Society.
Characterization of carbonyl interactions by torsion balance
analysis.
Experimental Ktrans/cis values of proline
derivatives were measured by NMR spectroscopy in D2O at
25 °C. Adapted from ref (13). Copyright 2013 American Chemical Society.To probe the nature of carbonyl interactions, thioamides
were incorporated
strategically into either ester 1 or amide 3.[12,13] Thioamides bear less partial negative charge
on sulfur than do oxoamides on oxygen, so replacing the N-acetylproline peptide bond with a thioamide should attenuate the
Coulombic interactions.[14] Nevertheless,
an increase in the population of the trans conformation was observed
upon thioamide substitution of the i – 1 carbonyl
group, indicating a stronger carbonyl interaction in 2 relative to 1 and in 4 relative to 3 (Figure ). Carbonyl interactions are therefore not well described by simple
electrostatics. Amides have higher dipole moments than do esters.[14] Accordingly, if a dipolar interaction were dominant,
amide 3 would have a stronger carbonyl interaction than
ester 1. However, the opposite was observed experimentally.
Moreover, the thioamide has a still larger dipole moment than do oxoamides,[14] and yet 5 exhibited a weaker carbonyl
interaction than did 3. These data demonstrate that carbonyl
interactions cannot be described as being purely dipolar.The
data on compounds 1–6 are
consistent with electron donation. For example, 3 shows
a weaker carbonyl interaction than 1 because amides are
less electrophilic than esters. Moreover, the divergent consequences
of thioamide substitutions (compare 4 and 5 relative to 3) demonstrate that each carbonyl group
has a unique role in these interactions, namely, one as a donor and
the other as an acceptor. These interactions are reminiscent of the
approach of a nucleophile to a carbonyl group during an acyl transfer
reaction.[4−6] The angle of this approach, which maximizes overlap
of the lone pair of the nucleophile with the π* orbital of the
acceptor carbonyl group (Figure ), is known as the Bürgi–Dunitz trajectory.[4−6]
Figure 3
Overlap
of the n and π* orbitals (blue)
of N-acetylproline dimethyl amide. The overlap integral
is 0.1212. Adapted from ref (13). Copyright 2013 American Chemical Society.
Overlap
of the n and π* orbitals (blue)
of N-acetylproline dimethyl amide. The overlap integral
is 0.1212. Adapted from ref (13). Copyright 2013 American Chemical Society.Several additional lines of evidence support the
quantum-mechanical
nature of carbonyl interactions. In analyses of both small-molecule[15] and protein[16] crystal
structures, the angle between the two carbonyl dipoles of interacting
pairs varies but the approach of the nucleophile occurs strictly along
the Bürgi–Dunitz trajectory, highlighting the importance
of the orbitals of these groups in dictating the interaction. Moreover,
the n→π* interaction involves population
of the π* antibonding orbital and should thereby weaken the
carbonyl π bond, which has multiple consequences. First, a red
shift in the acceptor carbonyl stretching frequency, corresponding
to a weaker carbonyl bond, has been noted repeatedly.[8,17−20] Second, weakening of the π bond should reduce the planarity
of the carbonyl group, thus engendering pyramidalization (Figure ) that can be observed
in high-resolution crystal structures. Indeed, such signatures of
the n→π* interaction have been reported
in a wide variety of systems,[12,13,15,21−25] including polymers[26] and
proteins,[27] giving strong credence to the
notion of carbonyl interactions as fundamentally electronic in nature.
This notion has also been supported by changes in electronic spectra
observed upon modulation of the n→π*
interaction with thioamides.[28]
Figure 4
Geometric parameters
characterizing an n→π*
interaction in a peptide or protein and the ensuing pyramidalization.
The O···C distance is d ≤ 3.22
Å, which is the sum of the van der Waals radii of oxygen and
carbon; the O···C=O angle is θ = 109°
± 10°, which is near the Bürgi–Dunitz trajectory.
The distance Δ and angle Θ report on the extent of pyramidalization.[12]
Geometric parameters
characterizing an n→π*
interaction in a peptide or protein and the ensuing pyramidalization.
The O···C distance is d ≤ 3.22
Å, which is the sum of the van der Waals radii of oxygen and
carbon; the O···C=O angle is θ = 109°
± 10°, which is near the Bürgi–Dunitz trajectory.
The distance Δ and angle Θ report on the extent of pyramidalization.[12]The n→π* interaction relies
on orbital
mixing. Accordingly, two fundamental quantities govern the energy
of an n→π* interaction: (1) the degree
of orbital overlap and (2) the energy difference between the donor
and acceptor orbitals. Greater orbital overlap, generally corresponding
to shorter donor–acceptor distances, increases the strength
of an n→π* interaction.[12] However, as the donor–acceptor distance decreases,
the filled donor orbital will also experience Pauli repulsion from
filled orbitals of the acceptor. Hence, the acceptor group must be
highly polarized, which allows preferential interaction of the electron-pair
donor with the unfilled antibonding orbital of the acceptor over the
filled bonding orbital (Figure ). Indeed, carbonyl groups are effective n→π* acceptors, but isosteric alkenes and fluoroalkenes
lack sufficient orbital polarization and thus do not engage in substantial n→π* interactions.[29−33] As to the effect of the donor–acceptor energy
gap, it is known from second-order perturbation theory that the energy
released upon the mixing of a filled orbital with an empty one is
inversely proportional to the energy gap between the donor and acceptor
orbitals. Pairs of thioamides (e.g., in 6) form especially
strong n→π* interactions because the
donor lone-pair orbital is higher in energy than that of the corresponding
amide, whereas the acceptor antibonding orbital is lower in energy,
thereby creating a smaller energy gap.[13] Alkenes, on the other hand, have π* orbitals of particularly
high energy, again making them poor n→π*
acceptors.[33]
Figure 5
Overlap of the n donor orbital with (A) the π*
orbital and (B) the π orbital of N-acetylproline
methyl ester. Adapted from ref (33). Copyright 2010 American Chemical Society.
Overlap of the n donor orbital with (A) the π*
orbital and (B) the π orbital of N-acetylproline
methyl ester. Adapted from ref (33). Copyright 2010 American Chemical Society.n→π* interactions
have a quantum-mechanical
nature. Accordingly, computational methods, especially natural bond
orbital (NBO) analysis,[34] have provided
much insight. NBO protocols partition electron density from diffuse
molecular orbitals into localized Lewis-type orbitals, from which
the energy of mixing can be computed. Extensive calculations have
revealed the energies of the n→π* interaction
for a wide variety of amide–amide geometries[16] and have placed a lower bound on the energy of a typical n→π* interaction between amides at approximately
0.27 kcal/mol,[13] though numerous examples
of stronger interactions exist (vide infra). These calculations and
others[35,36] highlight the modest energy of common n→π* interactions. Nevertheless, because of
the ubiquity of carbonyl groups, these interactions can make substantial
contributions in toto.
Contributions to Protein
Structure
Carbonyl groups constitute half of the non-hydrogen
atoms in the
backbone of peptides and proteins. This prevalence suggests that the
impact of the n→π* interaction could
be substantial. Early on, molecular modeling suggested that many conformations
of the peptide backbone allow for the close contact of adjacent carbonyl
groups,[37] which was confirmed subsequently
through detailed computational and bioinformatics analysis.[9] In a later study,[16] the energy of the n→π* interaction
was calculated for the entire conformational space of the peptide
backbone, which showed clear areas of Ramachandran space with significant
(>0.5 kcal/mol) n→π* interactions.
In
addition, a survey of high-resolution crystal structures from the
Protein Data Bank found that a large fraction (∼34%) of residues
were oriented properly for an n→π* interaction (O···C′ distance within the sum of the van
der Waals radii and O···C′=O angle approximately along the Bürgi–Dunitz trajectory).
Importantly, those residues found to be oriented for an n→π* interaction had backbone dihedral angles consistent
with those predicted computationally to engage in n→π* interactions of significant energy.[16] A later analysis of protein crystal structures determined
at subangstrom resolution found that carbonyl groups that accept n→π* interactions exhibit greater pyramidalization
than do other carbonyl groups, demonstrating that these interactions
have measurable consequences for protein structure.[27] Even considering only the approximate lower bound to the
energy of a typical n→π* interaction
(0.27 kcal/mol),[13] the fact that a third
of residues in folded proteins engage in an n→π*
interaction means that their contributions could be nearly 10 kcal/mol
for a 100-residue protein. To put this contribution in context, the
conformational stability of typical globular proteins is ΔG ≈ 5–15 kcal/mol relative to their unfolded
states.[38]
Collagen
The pyrrolidine ring of
proline preorganizes adjacent carbonyl groups in a peptide for the
formation of n→π* interactions. Accordingly,
these interactions play a large role in the conformation of proline-rich
peptides and proteins. Indeed, the discovery that n→π* interactions contribute to protein structure was
made with collagen-mimetic peptides.[8] Collagen
is the predominant protein in animals and consists of three intertwined
polyproline II-type (PPII) helices. Its unique structure is enabled
by a distinctive amino acid sequence: an Xaa-Yaa-Gly repeat, in which
Xaa is often (2S)-proline (Pro) and Yaa is often
(2S,4R)-4-hydroxyproline (Hyp).[39] The presence of a hydroxy group at a 4R-configured stereogenic center of the Yaa residue is important
for the thermostability of collagen because that electron-withdrawing
substituent elicits a gauche effect that enforces the exo pucker of
the pyrrolidine ring (Figure A).[8,40,41] An electron-withdrawing substituent at a 4S-configured
center of the Yaa residue, which enforces the endo pucker, decreases
the thermostability. The different pyrrolidine conformations modulate
the attraction between adjacent carbonyl groups and thus the trans/cis
ratio of prolyl peptide bonds (Figure B). Specifically, prolines with 4R-configured electron-withdrawing substituents, either hydroxy or
fluoro, have a higher preference for the trans conformation of the
prolyl peptide bond, indicating a stronger carbonyl interaction. Conversely,
proline residues with 4S-configured electron-withdrawing
substituents have a weaker preference. Analogous results were obtained
for prolines with 4-azido substituents.[17] Crystallographic and computational analyses,[12] microwave spectroscopy,[42] and
experiments with methanoprolines[43,44] have since
established that in the absence of complicating hydrogen bonds, the
endo pucker of the pyrrolidine ring generally increases the distance
between the donor oxygen and the acceptor carbon, leading to weaker n→π* interactions.
These results explain the destabilization of collagen-mimetic peptides
by proline residues with electron-withdrawing substituents at 4S-configured centers in the Yaa position. In a collagen
triple helix, all of the peptide bonds are in the trans conformation.
Hence, the strength of the n→π* interaction
that enforces the trans conformation correlates with the thermostability
of collagen. This insight has been applied in the design of collagens
with a wide variety of physical and chemical properties,[45−49] especially by Wennemers and co-workers.[50−53]
Figure 6
(A) Cγ-exo and Cγ-endo puckers
of the pyrrolidine ring. (B) Experimental Ktrans/cis values of proline derivatives measured with NMR spectroscopy in
D2O at 25 °C.[8]
(A) Cγ-exo and Cγ-endo puckers
of the pyrrolidine ring. (B) Experimental Ktrans/cis values of proline derivatives measured with NMR spectroscopy in
D2O at 25 °C.[8]
Polyproline II-Type Helices
The importance
of the n→π* interaction for controlling
the conformational preferences of peptide bonds to proline is magnified
in polyproline peptides, which, because of the lack of backbone hydrogen
bonding, rely heavily on the n→π* interaction.
Polyproline adopts two distinct helical conformations: the polyproline
I-type (PPI) helix with exclusively cis peptide bonds and the aforementioned
PPII helix with exclusively trans peptide bonds. Polymers of proline
with 4R-configured hydroxy, fluoro, or azido substituents
have a stronger preference for the PPII conformation than do polymers
of unsubstituted proline.[54,55] Conversely, polymers
of the diastereomeric (i.e., 4S-configured) prolines
show a weaker preference for the PPII conformation than does polyproline.
Moreover, the presence of a strong n→π*
interaction, enforced by 4R-configured electron-withdrawing
substituents, also increases the barrier to interconversion of PPI
and PPII helices.[56] The n→π* interaction has been implicated further in the PPII
structures of other sequences,[57−59] demonstrating its ability to
control peptide conformation. Recently, Wennemers and co-workers determined
the first high-resolution crystal structure of an oligoproline.[60] In the crystalline state, the oligoproline adopted
a PPII helix with unequivocal hallmarks of n→π*
interactions: short donor–acceptor distances and significant
pyramidalization of carbonyl groups. Moreover, the lack of water in
the crystal precludes the importance of hydration for stabilizing
a PPII helix, underscoring the stability conferred by n→π* interactions. Notably, the PPII conformation has
been observed in the unfolded states of some peptides and proteins.[61] Thus, the n→π*
interaction, which operates between adjacent residues (i → i + 1), could direct the peptide chain
toward folding into the PPII conformation prior to the formation of
a native hydrogen-bonding pattern that enlists residues more distant
in sequence.
α-Helices
Most of the residues
engaged in n→π* interactions in proteins
are located within α-helices. Initial analysis of protein crystal
structures demonstrated that over 70% of residues in α-helices
are aligned to participate in an n→π*
interaction.[16] Strong evidence for the
influence of n→π* interactions has come
from analyses of high-resolution crystal structures, which showed
pyramidalization of residues in α/β peptides that adopt
helical conformations similar to that of an α-helix.[23] Although α-amino acids can position adjacent
amide carbonyl groups within close proximity, β-amino acids
(which contain an extra methylene group in their backbone) cannot.
In these α/β helices, only the carbonyl groups from α-amino
acids exhibited pyramidalization toward their putative n→π* donors. These results provide compelling evidence
that n→π* interactions are not only
extant in α-helices but also alter their structure. Likewise, n→π* interactions appear to contribute to the
stability of 310 helices,[16] as
α-aminoisobutyric acid residues, which strongly enforce the
310 conformation, also induce strong n→π* interactions.[62]n→π* interactions enable carbonyl groups
in the backbone of helices to utilize both of their
lone pairs simultaneously.[63] This concurrence
creates interplay between these two interactions. The geometry of
a hydrogen bond to an n→π* donor affects
the ensuing n→π* interaction by controlling
the demixing of the carbonyl lone pairs. When hydrogen-bond donors
approach along the axis of the carbonyl bond, they encourage demixing
of the carbonyl lone pairs into s- and p-type orbitals (Figure A,B).[64] The s-like orbital
engages in an i → i + 4 hydrogen
bond (Figure C), while
the p-like orbital engages in an n→π* interaction (Figure D). When hydrogen-bond donors approach at ∼120°
with respect to the carbonyl-bond axis, they encourage mixing of the s- and p-like orbitals, which then adopt
the “rabbit ears” geometry that is all too familiar[65] to chemists.
Figure 7
(A) s-Type lone pair
of a carbonyl oxygen. (B) p-Type lone pair of a carbonyl
oxygen. (C) Hydrogen bond
to the s-type carbonyl lone pair in an α-helix.
(D) n→π* interaction with the p-type carbonyl lone pair in an α-helix. Panels A–D
were adapted with permission from ref (16). Copyright 2010 Nature Publishing Group. (E)
Structure of an α-helix showing i → i + 4 hydrogen bonds[1] and i → i + 1 n→π*
interactions, which have d ≈ 3.0 Å and
θ ≈ 103°.[16]
(A) s-Type lone pair
of a carbonyl oxygen. (B) p-Type lone pair of a carbonyl
oxygen. (C) Hydrogen bond
to the s-type carbonyl lone pair in an α-helix.
(D) n→π* interaction with the p-type carbonyl lone pair in an α-helix. Panels A–D
were adapted with permission from ref (16). Copyright 2010 Nature Publishing Group. (E)
Structure of an α-helix showing i → i + 4 hydrogen bonds[1] and i → i + 1 n→π*
interactions, which have d ≈ 3.0 Å and
θ ≈ 103°.[16]In an α-helix, a hydrogen bond and an n→π*
interaction avail electron density from a single oxygen atom. Consistent
with expectations, an n→π* interaction
has been shown to antagonize hydrogen bonding in a peptidic system.[66] Thus, the canonical hydrogen bonds in an α-helix[1] are undermined by concurrent n→π* interactions. Moreover, like the hydrogen bonds,[67] the n→π* interactions
in an α-helix could be cooperative.[37] An n→π* interaction increases both
the length of the C=O bond in the carbonyl acceptor and the
charge on its oxygen,[40] effects that make
this C=O bond a better donor for another n→π* interaction.
Side
Chains
Like those in the backbone,
carbonyl groups in amino acid side chains are capable of forming n→π* interactions. For example, aspartate residues
interact with their own backbone carbonyl groups.[68] These interactions were first identified by their relatively
short oxygen–oxygen distances, which seemed counterintuitive.
Upon examining the geometries of these interactions, a side-chain
oxygen was often found to approach the backbone carbonyl group along
the Bürgi–Dunitz trajectory. Moreover, the arrangement
of the carbonyl dipoles in these cases is likely to be destabilizing,
suggesting that the n→π* interaction
stabilizes what would otherwise be an unfavorable self-contact.Similar contacts are made between the carbonyl groups of asparagine
residues.[64] Moreover, hydrogen bonds to
a carbonyl oxygen affect its ability to serve as an n→π* donor. Self-contacting n→π*
interactions are much more common when a side-chain carbonyl group
accepts a hydrogen bond along the carbonyl-bond axis. When the hydrogen-bond
donor approaches at an angle of ∼120° with respect to
the carbonyl-bond axis, the prevalence and calculated energy of self-contacting n→π* interactions diminish, consistent with
experimental evidence that these two interactions compete with one
another.[66] These data provide independent
support for the demixing of the carbonyl lone pairs into s- and p-type orbitals (Figure A,B) upon hydrogen-bond formation along the
carbonyl-bond axis.
Contributions to Other Polymers
Poly(lactic acid)
Although their
roles in protein structure have received the most attention, n→π* interactions contribute to the conformations
of other polymers as well. Consider poly(lactic acid) (PLA), a biodegradable
polyester (Figure A).[26] Fiber diffraction has shown that
the backbone dihedral angles in PLA resemble those of the PPII helix
of peptides, which takes advantage of numerous n→π*
interactions (vide supra). Computation placed the average energy of
an n→π* interaction in PLA at 0.44 kcal/mol,
and analysis of small-molecule crystal structures demonstrated characteristic
pyramidalization of the acceptor carbonyl group that results from
accepting an n→π* interaction. Like
polyproline, PLA has no potential for hydrogen bonding, so the observation
of n→π* interactions in this polymer
demonstrates not only that the n→π*
interaction can operate independently of hydrogen bonding but also
that it is sufficient to dictate molecular conformation even in the
absence of a preorganizing ring, implicating n→π*
interactions further in organizing early protein-folding intermediates.
Figure 8
n→π* interactions in polymers: (A)
poly(lactic acid);[26] (B) a peptoid.[70]
n→π* interactions in polymers: (A)
poly(lactic acid);[26] (B) a peptoid.[70]
Peptoids
n→π*
interactions are especially important for controlling the conformation
of polymers of N-substituted glycine residues, which
are also known as “peptoids” (Figure B).[69] Analogously
to polyproline and PLA, a peptoid lacks hydrogen bonds within its
backbone, so its conformation must be controlled by other forces such
as the n→π* interaction. In addition
to forming typical n→π* interactions,
peptoids can also form n→π* interactions
with aryl rings on side chains. Whereas backbone amide–amide n→π* interactions favor the trans conformation
of the tertiary amide, the amide–aryl n→π*
interaction between the backbone and side chain favors the cis conformation.[70,71] These two interactions can therefore be exploited to tune peptoid
structure. For example, decoration of side-chain phenyl rings with
electron-withdrawing fluoro or nitro groups increases the electrophilicity
of the aromatic π* orbitals and thereby encourages the cis conformation
of the tertiary amide; adding electron-donating hydroxy groups reverses
this preference.[72] Similarly, Huisgen azide–alkyne
1,3-dipolar cycloaddition can be used to construct an electron-deficient
triazolium ring, which is a potent n→π*
acceptor that enforces a cis conformation upon the tertiary amide
of a peptoid.[73] This tack is complementary
to tuning the ability of a carbonyl group to be an n→π* donor, which can be either enhanced with a thioamide
(vide supra)[12−14,28,74−76] or selenoamide[77] or attenuated
with electron-withdrawing groups.[74,78]
Contributions to Small Molecules
Amino
Acids
In principle, many compounds
(Figure ) are likely
to engage in n→π* interactions, especially
since the lone-pair donor need not be a carbonyl group. For example, n→π* interactions between the hydroxy group
of (2S,4S)-4-hydroxyproline and
its carbonyl group in the gas phase have been observed using microwave
spectroscopy.[42] A comparable interaction
has been observed in crystal structures of a wide variety of substituted
proline residues, including an especially strong one in an S-oxide for N-acetyl-4-thiaproline methyl
ester (7).[22,79] Indeed, many functional
groups can interact with carbonyl groups in an n→π*
manner, including halide ions,[15] thiols/thioethers/disulfides,[80] and a variety of nitrogen heterocycles.[81,82] Similar types of electronic interactions involving carbonyl groups
have been studied computationally and include complexes of SO2 with carbon dioxide or formaldehyde.[83]
Figure 9
A
selection of molecules that manifest n→π*
interactions.
A
selection of molecules that manifest n→π*
interactions.n→π*
interactions have also been
detected in amino acids other than proline. In particular, gas-phase
microwave spectroscopy identified conformations consistent with the
presence of an n→π* interaction between
the amino nitrogen and carboxylic acid of β-alanine (8).[500] The population of this conformer
was similar to those for hydrogen-bonded conformations, suggesting
that the energy of the n→π* interaction
might be similar to that of a typical hydrogen bond. Similar results
were obtained in an analysis of γ-aminobutyric acid (GABA, 9), an important neurotransmitter.[84] Here the n→π* interaction was observed
in the conformation predicted as the global minimum.
Drugs
Other medicinal implications
of n→π* interactions have become apparent.
For example, an n→π* interaction in
aspirin (10) was revealed through both crystallographic[24] and spectroscopic[85] investigations. Donation of electron density from the anionic carboxylate
into the ester carbonyl group is predicted to shield and disperse
negative charge on the molecule and thereby to improve its entry into
cells. An n→π* interaction was also
observed in N-acyl homoserine lactones (AHLs, 11), which mediate quorum sensing in Gram-negative bacteria.[25,86] Crystallography and computation established the presence of an n→π* interaction in the free AHL, whereas examination
of protein crystal structures with bound AHLs demonstrated that AHL
receptors break that n→π* interaction
upon binding. Attenuation of the n→π*
interaction, which can be accomplished by appending electron-withdrawing
substituents to the acyl group,[74,78] could preorganize the
ligand for receptor binding and thereby increase potency.
Synthetic Intermediates
n→π*
interactions also affect carbonyl reactivity. In
an imidazolidine-based model system (12), a carbonyl
group was able to be an acceptor of one or two n→π*
interactions with identical donor carbonyl groups.[87] As only one n→π* interaction
was observed, n→π* donation apparently
reduces the electrophilicity of the acceptor carbonyl group. The consequences
for the reactivity of carbonyl groups that accept n→π* interactions are profound. For example, Houk and
co-workers have identified carbonyl interactions as determinants of
stereoselectivity, such as in the dihydroxylation of cis-bicyclo[3.0.0]octenes[88] and the kinetic resolution of azlactones by
benzotetramisole catalysis.[89] An analogous
stereoelectronic interaction was also proposed to explain the observed
preference of an anhydroarabinonucleoside (13) for phosphorylation
at the 3′ oxygen rather than the less sterically encumbered
5′ oxygen.[21] This reaction is a
key step in a proposed prebiotic route toward nucleotide synthesis,
and regioselectivity in phosphorylation is essential for generating
cyclic phosphates of cytidine for polymerization. A similar interaction
was observed in cycloadditions of 3-hydroxyflavones.[90] The n→π* interaction induced
by proline was also shown to be the cause of the sluggish native chemical
ligation with proline thioesters.[91] The
pyrrolidine ring preorganizes the prolyl peptide bond to form an n→π* interaction with the thioester, decreasing
its electrophilicity.[87]
Fluorophores
Finally, n→π*
interactions can modulate fluorescence. In natural
fluorescent proteins, such as green fluorescent protein (GFP), an n→π* interaction forms between a backbone oxygen
and the imidazolidine chromophore (14).[92] The presence of this n→π*
interaction is consistent with the red shift in the vibrational frequency
of the imidazolidine carbonyl group in the protein-bound chromophore
relative to small-molecule mimics in solution. Moreover, analyses
of protein crystal structures with premature chromophores suggest
that this n→π* interaction preorganizes
the chromophore for cyclization and precludes bond rotations that
would lower the quantum yield. In small-molecule fluorogenic probes,
such as 2′,7′-dichlorofluorescein diisobutyrate (15), the reactivity can be tuned by n→π*
interactions.[93] The two Cl···C=O n→π* interactions deter solvent water from
gaining access to the π* orbitals of the proximal carbonyl groups
(cf. 12), though an esterase can still do so.
Conclusions and Outlook
The n→π*
interaction is an emergent
interaction that contributes to biomolecular structure and function.
Its discovery has not only refined our understanding but also inspired
new thoughts. For example, characterization of the n→π* interaction expedited
the identification of an unappreciated type of hydrogen bond within
the backbones of peptides and proteins.[94] These so-called “C5” hydrogen bonds confer stability
to β-strands just as n→π* interactions
stabilize α-helices—by allowing the protein backbone
to exploit both carbonyl lone pairs. Like n→π*
interactions, C5 hydrogen bonds are distributed broadly and contribute
to protein structure.We anticipate that n→π*
interactions
will be found in an ever-expanding array of molecules, particularly
those with a high density of carbonyl groups, such as proteins. Revealing
their impact will provoke clever experiments, and interpretations
will be guided by computational methodology. In view of the current
limitations in protein structure prediction and design, a thorough
understanding of these ubiquitous interactions is likely to enhance
countless efforts with peptides and proteins.
Authors: Gail J Bartlett; Robert W Newberry; Brett VanVeller; Ronald T Raines; Derek N Woolfson Journal: J Am Chem Soc Date: 2013-12-03 Impact factor: 15.419
Authors: Amit Choudhary; Kimberli J Kamer; Matthew W Powner; John D Sutherland; Ronald T Raines Journal: ACS Chem Biol Date: 2010-07-16 Impact factor: 5.100
Authors: Michele L DeRider; Steven J Wilkens; Michael J Waddell; Lynn E Bretscher; Frank Weinhold; Ronald T Raines; John L Markley Journal: J Am Chem Soc Date: 2002-03-20 Impact factor: 15.419
Authors: Diego B Diaz; Solomon D Appavoo; Anastasia F Bogdanchikova; Yury Lebedev; Timothy J McTiernan; Gabriel Dos Passos Gomes; Andrei K Yudin Journal: Nat Chem Date: 2021-02-15 Impact factor: 24.427
Authors: Henry R Kilgore; Chase R Olsson; Kyan A D'Angelo; Mohammad Movassaghi; Ronald T Raines Journal: J Am Chem Soc Date: 2020-08-21 Impact factor: 15.419
Authors: Nicole A Wenzell; Himal K Ganguly; Anil K Pandey; Megh R Bhatt; Glenn P A Yap; Neal J Zondlo Journal: Chembiochem Date: 2019-03-07 Impact factor: 3.164
Authors: Sandipan Roy Chowdhury; Steven Kennedy; Kai Zhu; Rama Mishra; Patrick Chuong; Alyssa-Uyen Nguyen; Stefan G Kathman; Alexander V Statsyuk Journal: Bioorg Med Chem Lett Date: 2018-11-09 Impact factor: 2.823
Authors: Oliver R Maguire; Bethany Taylor; Eleanor M Higgins; Matthew Rees; Steven L Cobb; Nigel S Simpkins; Christopher J Hayes; AnnMarie C O'Donoghue Journal: Chem Sci Date: 2020-07-02 Impact factor: 9.825