The modulation of protein-protein interactions (PPIs) by means of creating or stabilizing secondary structure conformations is a rapidly growing area of research. Recent success in the inhibition of difficult PPIs by secondary structure mimetics also points to potential limitations, because often, specific cases require tertiary structure mimetics. To streamline protein structure-based inhibitor design, we have previously described the examination of protein complexes in the Protein Data Bank where α-helices or β-strands form critical contacts. Here, we examined coiled coils and helix bundles that mediate complex formation to create a platform for the discovery of potential tertiary structure mimetics. Though there has been extensive analysis of coiled coil motifs, the interactions between pre-formed coiled coils and globular proteins have not been systematically analyzed. This article identifies critical features of these helical interfaces with respect to coiled coil and other helical PPIs. We expect the analysis to prove useful for the rational design of modulators of this fundamental class of protein assemblies.
The modulation of protein-protein interactions (PPIs) by means of creating or stabilizing secondary structure conformations is a rapidly growing area of research. Recent success in the inhibition of difficult PPIs by secondary structure mimetics also points to potential limitations, because often, specific cases require tertiary structure mimetics. To streamline protein structure-based inhibitor design, we have previously described the examination of protein complexes in the Protein Data Bank where α-helices or β-strands form critical contacts. Here, we examined coiled coils and helix bundles that mediate complex formation to create a platform for the discovery of potential tertiary structure mimetics. Though there has been extensive analysis of coiled coil motifs, the interactions between pre-formed coiled coils and globular proteins have not been systematically analyzed. This article identifies critical features of these helical interfaces with respect to coiled coil and other helical PPIs. We expect the analysis to prove useful for the rational design of modulators of this fundamental class of protein assemblies.
Mimicry of interfacial
protein segments has led to new classes
of rationally designed inhibitors of protein–protein interactions
(PPIs).[1−8] The identification and analysis of protein complexes mediated by
protein secondary structures provide a platform for these explorations.[3,4,9] We have recently examined the
full set of protein complexes in the Protein Data Bank mediated by
α-helices[10−13] and β-strands.[14] Our work, along
with efforts by Kritzer et al.[15] to define
loop motifs at protein interfaces, aims both to describe the interactions present in the Protein Data Bank and to prescribe effective starting points for the design of PPI
inhibitors.[4,9]Individual secondary structures are
critical elements of protein
interfaces; however, many PPIs feature more complex modes of binding,
suggesting a potential role for synthetic tertiary structure mimetics[16,17] or miniproteins[18,19] as attractive candidates for
the design of new classes of PPI inhibitors. Miniproteins consisting
of helical bundles, β-sheet barrels, and loops, along with synthetic
antibodies,[20−22] are now routinely used to enrich ligands for protein
targets, especially for extracellular receptors. In an effort to expand
our atomic analysis of protein structural data beyond interactions
that can be mediated by a single secondary structure element alone,
we have developed new methodology to create a database of helical dimers at protein–protein
interfaces (DippDB).We chose to begin our survey of protein
tertiary interactions by
focusing on helix dimers because the dimer is the simplest all-helical
tertiary structure stoichiometry. Coiled coils and helical bundles
are well understood and have been extensively studied in diverse biochemical
and biophysical contexts.[23−27] Dimeric coiled coils or similarly structured motifs such as bundles
play essential roles in mediating biological processes, iconically
driving the multimerization and stabilization of proteins involved
in transcription factor complexes and vesicular trafficking, among
other critical functions.[25,28] Several computational
approaches have been implemented to predict coiled coil-mediated interactions
by their pairwise and multimeric residue correlations.[29−32] Seminal studies have produced a comprehensive dataset of the coiled
coil interactome.[33−35] However, computational and experimental methods for
the analysis of coiled coils described thus far are largely devoted
to characterization of forces that lead to coiled
coil formation. To complement these studies, we sought to analyze
interactions of helical dimers with globular proteins as a step toward
the rational design of coiled coil mimetics as PPI inhibitors.[16,17] Though canonical coiled coils possess supercoiling and particular
packing properties, we did not impose these requirements, stipulating
only that the helices be proximal and well-oriented. Since our motivation
for developing this dataset is to identify interactions that may not
be inhibited by secondary structure mimics, we also required that
critical binding residues be located on both helices.
These criteria retain structures of high structural similarity to
a coiled coil but eliminate canonical all-alpha tertiary structure
motifs like the helix-loop-helix and helix-turn-helix DNA binding
domains, whose interhelical angles are far from parallel or antiparallel.Examination of the helix dimer dataset suggests that coiled coil
interfaces can be divided into three broad categories (Figure ) according to their interaction
stoichiometry. Case 1 features a helical dimer from one protein interacting
with a single partner protein. In Case 2, a helical dimer from one
protein interacts with two different protein partners. In Case 3,
a single helical dimer motif forms at the interface
between partner proteins. We anticipate that helical dimers in Case
3 would favor different interacting residues from examples in Cases
1 and 2, because in Case 3 high-affinity interactions must form between
two individual helices rather than a helix dimer and a globular protein.
This taxonomy reflects the different properties demanded of potential
designed inhibitors: Case 1 features interactions on predominantly
one dimer face; Case 2 generally interacts with two faces; Case 3
dimer interfaces may be interrupted by a single helix.
Figure 1
Models depicting three
families of coiled coil-like structures
at protein–protein interfaces. (a) In the first family (Case
1), a coiled coil entirely from chain A forms an interaction with
protein B. (b) In the second family (Case 2), a coiled coil, which
may come from one or two proteins, interacts with two different proteins
partners. (c) In the third family (Case 3), a coiled coil forms across
a protein interface.
Models depicting three
families of coiled coil-like structures
at protein–protein interfaces. (a) In the first family (Case
1), a coiled coil entirely from chain A forms an interaction with
protein B. (b) In the second family (Case 2), a coiled coil, which
may come from one or two proteins, interacts with two different proteins
partners. (c) In the third family (Case 3), a coiled coil forms across
a protein interface.Schematic for identification of protein interfaces in the Protein
Data Bank (PDB) where a helix dimer contributes significantly to complex
formation. Interfacial helices from the previously described HippDB
dataset were culled to produce a set of structures for detailed analysis
via stringent distance, orientation, and energetic criteria. On the
basis of our evaluations, we classified the interactions among three
classes (Cases 1–3) of helical tertiary structure-mediated
PPIs. The energetic contribution of each interface helix dimer and
individual residues was approximated using Rosetta. The complete dataset
is hosted at www.nyu.edu/projects/arora/dippdb/cc.php.We examined the biophysical properties
of each class in relation
to each other, to typical interface helices in general, and to canonical coiled coils. Coiled coils are defined as two
or more α-helices that wind around each other to form supercoils.[23] Classical coiled coils are characterized by
a heptad repeat, (abcdefg), where buried a and d positions
form the interface between partner helices. We sought to identify
all helix dimers that are in contact with a globular protein irrespective
of whether such helices would meet the strict definition. This study
has revealed the existence of a set of biologically relevant complexes
as potential targets for inhibitor design. We also analyzed the biophysical
properties of dimer interfaces, such as the composition of hot spot
residues and the degree to which helical dimers differ from coiled
coils and protein helices in general. This analysis shows that hot
spot residues are concentrated over compact areas, potentially allowing
the mimicry of these dimers by small or medium-sized molecules.
Results
and Discussion
We started from the previously developed HippDB
dataset.[10−12] HippDB was developed to comprise all interface helices
of four or
more residues that contain two or more hot spot residues, where hot
spot residues are defined via computational alanine scanning, performed
with Rosetta, as those that result in a loss of binding energy (ΔΔG) of at least 1.0 Rosetta energy unit (REU, which scales
approximately as 1 kcal/mol).[36,37]In this work,
we expanded the HippDB dataset to include over 37 000
high-affinity interfaces and imposed stringent geometric and energetic
criteria to obtain a set of over 1000 high-affinity helical dimers.
We made modifications to our prior methodology to tailor it to the
coiled coil motif (Figure ). In HippDB, we were interested in identifying minimal inhibitory
motifs to aid the design of synthetic inhibitors. This goal requires
identification of helical segments of a protein present at an interface
without the sequences not in contact. In the context of helical dimers,
the challenge is to capture the defining geometric parameters as accurately
as possible so any energetically irrelevant residues far from the
interface can be discarded separately. We altered our analysis such
that any continuous stretch of helical residues counts as a helical
element as long as part of the helix is present at the interface.
This modified method identifies as a single element of interest helix
dimers in which one helix makes contact with a partner protein at
multiple separate points along its length. Our earlier work would
have split this helix into multiple helical elements.
Figure 2
Schematic for identification of protein interfaces in the Protein
Data Bank (PDB) where a helix dimer contributes significantly to complex
formation. Interfacial helices from the previously described HippDB
dataset were culled to produce a set of structures for detailed analysis
via stringent distance, orientation, and energetic criteria. On the
basis of our evaluations, we classified the interactions among three
classes (Cases 1–3) of helical tertiary structure-mediated
PPIs. The energetic contribution of each interface helix dimer and
individual residues was approximated using Rosetta. The complete dataset
is hosted at www.nyu.edu/projects/arora/dippdb/cc.php.
We imposed
both geometric and energetic criteria to obtain a dataset
of interfaces where coiled coil-like structures play an important
role. We stipulated that each helix must contribute at least 6.0 Rosetta
energy units (REU) of ΔΔG in its interaction
with its partner, that the angle between their helical axes must be
within 30° of parallel or antiparallel, and that the two helices
are within 17 Å of each other. These conditions ensured coiled
coil-like geometry[24] and a substantial
energetic contribution, equivalent to at least three strong hot spot
residues, from each helix. The 6 REU threshold ensures that the selected
coiled coil interfaces contain a sufficient number of hot spot residues
to merit their mimicry by a potential synthetic inhibitor. Lower energetic
thresholds yield additional compelling complexes, but far too many
to investigate individually. We also imposed conditions on the percentage
of the interaction’s overall ΔΔG contributed by each helix. When both helices come from the same
protein chain, we required each helix to contribute at least 20% of
the chain’s ΔΔG. This requirement
excluded any helical dimers that did not make a substantial total
contribution to the binding interface, and ensured that the complexes
in the database will truly require a dimer and are not amenable to
disruption by mimic of a single helix.Finally, we aimed to
ensure that the dataset was not dominated
by pairs of chains from multimeric helix bundles, as our interest
lies in complexes where at least one partner is a globular protein.
We observed that in structures of conventional helix bundles, almost
every residue of each chain is present at the protein–protein
interface. To distinguish dimer–protein interfaces from helix
bundles, we required that at least one partner in every complex must
have at minimum 30% of its residues distant from the interface.The complete dataset is hosted at www.nyu.edu/projects/arora/ppidb/dippdb/cc.php. Dimers may be queried using PDB identification, total ΔΔG and ΔSASA, interhelical distances, and angles (Table ).
Table 1
Selected Fields Recorded in DippDBa
field name
description
Title
title of the original
PDB entry
Keywords
keywords
included in the original PDB structure file, which
often reflect function, localization, or other biology of interest
%ΔΔGb
percent of the total interaction ΔΔG contributed by one helix, as estimated by Rosetta
ΔSASAb
change in
solvent-accessible surface area identified by NACCESS
on one helix
Hot spot residuesb
list of the hot spot residues (single letter
code, residue
number, and ΔΔG) on the helix
Inter-helical angle
angle between the helix
axes, approximated by the N- to C-terminal
CA–CA vectors
%Residues at interfaceb
percentage of all the residues on
the chain of one dimer helix
that are found at the protein–protein interface
Additional fields are available.
Each field may be searched and sorted, and data may be downloaded
in CSV, PDF, or XLS format. Complexes of weaker affinity (ΔΔG cutoff of 4 REU) are included for comparison in the website
dataset.
Data provided for
both participating
helices.
Additional fields are available.
Each field may be searched and sorted, and data may be downloaded
in CSV, PDF, or XLS format. Complexes of weaker affinity (ΔΔG cutoff of 4 REU) are included for comparison in the website
dataset.Data provided for
both participating
helices.
Hot Spot Residues across
Helix Dimer Interfaces Are Concentrated
over Relatively Compact Areas
Archetypical coiled coil motifs
consist of multiple heptad repeats. A cursory analysis would
suggest that the hot spot residues may be distributed evenly over
many heptads, requiring the design of large molecules or biologics
for inhibition, as has been true for the design of coiled coil assembly
inhibitors.[38] We were surprised to find
that a plurality of complexes in DippDB possessed hot spot residues
over a relatively small region of the interface (Figure ). The average length of Case
1, 2, and 3 helical dimers is 19, 27, and 22 residues, respectively,
while their average hot spot spans are 13, 17, and 16.5 residues.
The critical residues are limited to a single heptad in 15% of Case
1 dimers and to two heptads in two-thirds of examples. The trend of
compact hot segments[3] in helical dimers
is observed in each Case with two-thirds of critical contact residues
averaging fewer than three heptads. Dimers that span three heptads
or less (roughly 30% of the dataset) average a hot spot per 4.7 residues.
This signature is strongly suggestive of an interface amenable to
inhibition by designed peptidomimetics. However, as peptidic coiled
coils of these length scales are not generally stable,[39] we have undertaken an experimental effort to
develop cross-linked helix dimers (CHDs) as minimal inhibitors of
coiled coils and other helical PPIs.[17]
Figure 3
Histogram
of the distance between the first and final hot spot
residue in Case 1 (black), Case 2 (light gray), and Case 3 (dark gray)
helical dimers.
Histogram
of the distance between the first and final hot spot
residue in Case 1 (black), Case 2 (light gray), and Case 3 (dark gray)
helical dimers.
Amino Acid Composition
of Helix Dimers Differs from Coiled Coils
Further examination
of DippDB interfaces suggests that helical
dimers at protein–protein interfaces have comparable amino
acid composition to other high-affinity interface helices[10,13] but significantly differ from canonical coiled coils. In general,
classical coiled coil motifs possess a distribution of amino acids
similar to the α-helix but with considerable additional bias
toward aliphatic residues, owing to their obligate interior packing
interactions.[23,26,30] We wished to understand whether helical dimers more closely reflected
the amino acid distribution on high-affinity helices or classical
coiled coils.In examining the distribution of amino acids in
these structures, we found it largely consistent with the distribution
on helices in general (Supporting Information, Figure S1).[10,13] Though there is considerable
selective pressure for coiled coil motifs to possess high proportions
of aliphatic amino acids, especially leucine and isoleucine, for optimal
knobs-into-holes packing arrangements, these residue types are not
overwhelmingly enriched in the α-helices that are part of DippDB.We also examined the distribution of hot spot residues on each
family of helix dimers. A summary of these data is shown in Figure , and details for
all three cases by residue types are included in the Supporting Information, Figure S2. Aliphatic and charged residues
are moderately enriched as hot spot residues, though generally consistent
with the helical baseline. In contrast, polar residues were greatly
depleted, and more so than the general helical case; conversely, aromatic
residues were more enriched in helical dimers.
Figure 4
Frequency of hot spot
residues, normalized to natural abundance,
in helix dimers and interfacial single helices, respectively. The
plot shows distribution of aliphatic residues (Leu, Ile, Val), polar
residues (Gln, Asn, Ser), aromatic residues (Phe, Trp, Tyr), and charged
residues (Arg, Asp, Glu, Lys) in the three contexts.
Frequency of hot spot
residues, normalized to natural abundance,
in helix dimers and interfacial single helices, respectively. The
plot shows distribution of aliphatic residues (Leu, Ile, Val), polar
residues (Gln, Asn, Ser), aromatic residues (Phe, Trp, Tyr), and charged
residues (Arg, Asp, Glu, Lys) in the three contexts.Separate from our analysis of amino acid composition
in general,
we examined the inter-helix contacts made by Case 1/2 and Case 3 helical
dimers and conventional coiled coils (Figure a). Helix dimers feature a larger proportion
of interstrand contacts between polar residues than typical coiled
coils. In the classical coiled coil motif, core polar mutations may
be tolerated via changes in stoichiometry, local distortions in geometry,
or an increased inner void volume.[30] The
presence of polar contacts at the interior of helical dimers emphasizes
the importance of their specific interactions. The leucine zipper
coiled coil (Figure b) contains four paired aliphatic knob-into-hole packing interactions,
which dominate the interaction. In the particular Case 1 dimer example
depicted in Figure c, aliphatic packing interactions are limited and energetically insignificant;
one helix only possesses one “aliphatic” residue (a
threonine) facing its partner over five entire turns; in contrast,
the nonpolar residues from the other helix pack into non-canonical
holes. The Case 3 dimer example (Figure d) possesses energetically important nonpolar
residues, but they are not organized into the classic interior groove
of a coiled coil, and the phenylalanines are more than twice as energetically
important as the leucines by ΔΔG. Visual
inspection of these examples and others inspired us to quantify the
degree to which the helical dimer forms non-canonical packing interactions.
Figure 5
(a) Amino
acid composition of inter-helical contact residues in
helix dimers as opposed to true coiled coils. (b) The packing of a
classic leucine zipper, GCN4, features an aliphatic a/d groove with each residue packing into a complementary
hole. (c) This Case 1 helical dimer from 1,2-hydroquinol dehydrogenase
homodimer illustrates highly non-canonical packing interactions. (d)
The orphan nuclear receptor Nur77 contains an aliphatic core but lacks
any heptad repeat structure and knob/hole packing orientation. PDB
codes: 1LLM, 1TMX, and 3V3E.
(a) Amino
acid composition of inter-helical contact residues in
helix dimers as opposed to true coiled coils. (b) The packing of a
classic leucine zipper, GCN4, features an aliphatic a/d groove with each residue packing into a complementary
hole. (c) This Case 1 helical dimer from 1,2-hydroquinol dehydrogenase
homodimer illustrates highly non-canonical packing interactions. (d)
The orphan nuclear receptor Nur77 contains an aliphatic core but lacks
any heptad repeat structure and knob/hole packing orientation. PDB
codes: 1LLM, 1TMX, and 3V3E.
Quantifying the Non-canonicity of the Helical
Dimer Motif
We found the examples of non-canonical inner
grooves compelling
and performed a more comprehensive analysis to determine how non-canonical
these motifs are as compared to the classical coiled coil. We employed
Woolfson’s SOCKET analysis to explore the dataset (Supporting Information, Figure S4).[30] The SOCKET algorithm identifies knobs-into-holes
packing of coiled coils to distinguish them from helix dimers. Of
the 262 Case 1 dimers, only 24 dimers were identified as being coiled
coils by SOCKET (9.2%), of which 22 were antiparallel and 2 were parallel.
An additional 17 dimers contained only one complementary
knob-into-hole packing interaction. In contrast, canonical coiled
coils might have four such interactions per heptad (two per partner).
Moreover, more than half of those 24 coiled coils identified were
fewer than three heptads in length. This analysis implies that while
helical dimers may occasionally exhibit packing characteristic of
coiled coil motifs, they are too short to be stable on their own.[39] We also analyzed Case 2 and Case 3 dimers with
SOCKET. 27/261 Case 2 dimers contained significant coiled coil structure
(10.3%) and 21 contained one complementary interaction. Of Case 3
dimers, 133/919 were identified as coiled coils by SOCKET (14.4%)
and an additional 50 contained a single knob-in-hole interaction.To follow up on the disparities in inner groove composition demonstrated
in Figure , we specifically
studied the frequency of hydrophilic inner grooves. We found hydrophilic
inner groove residues quite common: each case averaged at least three,
and 40% of complexes overall had at least four such residues. We profile
two complexes whose interfacial ΔΔG results
almost exclusively (>90%) from hydrophilic contacts in the Supporting Information, Figure S5. Such features
are uncommon in canonical coiled coils. Polar residues may be found
at the interior of “inside-out” coiled coil motifs found
in membrane proteins,[40,41] though there is debate regarding
the extent of polarity on the inside.[42−44] Because the vast majority
of the proteins in DippDB are cytosolic, including these unusual examples,
we hypothesized that a similar environment may make this inside-out
geometry possible: the helical dimer is surrounded by hydrophobic
residues presented by the protein in which it is found and by its
binding partner, serving an analogous role to membrane lipids. The
networks of buried hydrogen bonds that may form at the interior of
a coiled coil are well studied, although the partially polar interiors
observed here present an extreme case.[45]We obtained the set of hydrophilic contact residues in Case
1 helical
dimers. A total of 79.4% of hydrophilic contact residues were flanked
by at least one nonpolar residue. Buried polar residues averaged under
30% relative SASA (Supporting Information, Figure S6). Typical SASA burial for these residue types is markedly
lower. This degree of burial is highly destabilizing if hydrogen bonds
are left buried but unsatisfied as a result, but it concomitantly
increases the value of satisfied hydrogen bonds, contributing to the
strength of these interfaces.[46,47] Overall, though it
is possible to find recognizably canonical packing at the interior
of helical dimers, the majority do not succumb to the same generalizations
as the coiled coil motif.
Helical Dimer Affinity Depends on Complementary
Packing Interactions
As discussed above, the knobs-into-holes
packing of coiled coils
distinguishes them from helical dimers.[30,48] Because the
two helices of a Case 3 helical dimers come from different chains,
we were able to compare knob–hole packing interactions to the
ΔΔG of the knob residue by identifying
the nearest three-residue hole on the partner helix (i, i+1, i+4 or i, i+3, i+4) to each knob. We restricted
this analysis to the inner groove of each helix and furthermore recomputed
the knob ΔΔG values on complexes only
including the dimer helices so as to omit any interactions with other
components of the protein. Though the packing is not conventional
enough to identify the dimers as coiled coils by SOCKET, we anticipated
that we would still be able to identify some trends. In contrast to
the classical knobs-into-holes aliphatic model, residues of all types
could form inner-groove contacts of considerable ΔΔG. Instead, we found the key feature, tightly correlated
to ΔΔG, was that knob residues made contact
with chemically complementary holes. Although aliphatic contact residues
of low to moderate ΔΔG do frequently
pack into polar holes and vice versa, every aliphatic and aromatic
residue type had higher average ΔΔG when
packing in a mostly aliphatic or aromatic hole than in a mostly polar
one (Supporting Information, Figure S7).
Helix-Turn-Helix Motifs Commonly Secure the Orientation of Antiparallel
Helical Dimers
In the creation of this dataset, we made no
stipulation about the relative position of these helix bundles in
the protein(s) in question. We conjectured that helix-turn-helix motifs,
or helical hairpins, which are common sources of ideal antiparallel
coiled coils,[49,50] might be particularly prominent
interface elements. Of the 262 Case 1 helical dimers, 81 are separated
by two to eight nonhelical residues. This substantial proportion of
helix-turn-helix motifs is encouraging, as it suggests that the tertiary
structure present at the interface is largely governed by local forces
that may be mimicked by a designed inhibitor. Overall, there are 133
parallel and 129 antiparallel dimers; thus, development of scaffolds
appropriate for both motifs is critical. In contrast,
of the 115 Case 2 helical dimers in which both dimer helices come
from the same chain, only 25 exhibit helix-turn-helix motifs. The
possible interplay between interface tertiary structure geometry and
the stoichiometry of formed complexes merits further study, as it
suggests a difference in folding cooperativity.[51]
Parallel Helical Dimers Are Connected by
Diverse Motifs
Even though the N and C termini of parallel
dimers on the same chain
are distant, the majority of parallel dimers are connected by a small
number of motifs. Of the 133 parallel Case 1 dimers, 55 are connected
by a two to four residue loop, a strand whose length varies with that
of the dimer, a single residue turn, a short 310 or α-helix,
followed by another short loop (Figure ). The second most prevalent motif includes 16 examples
of a strand bracketed by two 4–5-residue loops. The two helices
are linked by a loop-helix-loop in only four instances. The 55 parallel
Case 2 dimers are more heterogeneous. Five possess the loop-strand-turn-helix-loop
linker, five are connected by a loop-helix-loop, eight have a loop-strand-loop,
but 11 contain a loop-helix-loop-helix-loop-helix-loop. (Specific
PDBs for each motif are listed in the Supporting Information.) To our knowledge, this is the first effort to
establish common protein folds connecting helices of defined orientation
outside of canonical coiled coils.[52] Redesigned
and optimized derivatives of these motifs—particularly the
loop-strand-turn-helix-loop motif, as it is by far the most common
and the most structurally interesting—may serve as miniprotein
scaffolds.
Figure 7
Common antiparallel (a) and parallel (b) helical dimer motifs feature
linkers of a simple turn, a loop-strand-helix-loop, a loop-strand-loop,
and a loop-helix-loop. PDB codes: 1I4Y, 2UYG, 2CBY, and 1OAH.
Common antiparallel (a) and parallel (b) helical dimer motifs feature
linkers of a simple turn, a loop-strand-helix-loop, a loop-strand-loop,
and a loop-helix-loop. PDB codes: 1I4Y, 2UYG, 2CBY, and 1OAH.
Helical Dimers Typically Have Positive Net Charge
Given
the importance of salt bridge interactions holding coiled coils together,
we investigated the distribution of charge across helix monomers in
each Case. In Case 1, the average helix dimer has helices with net
charges of 1.19 and −1.05, and 61% are net neutral or positive
in total. In contrast, though Case 2 features dimers with net charges
per helix of 0.89 and −0.33, and thus a higher average charge,
66% are net neutral or positive. Case 3 dimers are 70% net neutral
or positive and have net charges for each helix of 0.88 and −0.08.
These percentages suggest that the surface bound by such helical dimers
may also be frequently negative or neutral. We calculated the surface
charge on proteins bound by Case 1 dimers to be 70% net neutral or
negative, with an average net charge of −0.75. Case 2 dimers
bind surfaces that are 80% net neutral or negative with an average
net charge of −0.68. Case 3 dimers bind surfaces that are 68%
neutral or negative with an average charge of −0.61. The net
positive charge in dimers is consistent with the higher number of
positively charged protein helices in general.[55](a) Interfaces mediated by Case 1 and Case 2 helical dimers possess
diverse functions and are dominated by enzyme complexes. (b) T. brucei’s UDP-galactose 4′-epimerase features
a mutation in the active site relative to the human enzyme, potentially
permitting specific targeting. Galactose metabolism is essential to
the parasite’s ability to cause African sleeping sickness,
a neglected tropical disease.[53] The enzyme
is active in dimeric and tetrameric forms; the target monomer is shown
as gray surface. (c) PCSK9propeptide (surface) binds with high affinity
to PCSK9 (cartoon) and inhibits its proteolytic activity. This enzyme
is linked to atherosclerosis and cardiovascular disease.[54] In both figures, the nearby active site from
the dimer chain is highlighted in yellow sticks. PDB codes: 1GY8 and 2QTW.
Helix Dimer Interfaces Mediate Fundamental
Biological Processes
The interfaces collated in DippDB have
the traits of prime targets
for drug design. We categorized the functions of PPIs as defined in
the PDB (Figure a)
and observed that they are implicated in biological processes from
enzymatic function to transcription to the immune response. We identified
a total of 523 interactions (Cases 1 and 2) that would require a helix
dimer mimetic or miniprotein for inhibition. Case 3 dimers, in contrast,
feature a pair of single helices interacting with each other across
an interface; thus, mimics of a single helix can disrupt these interactions.
Intracellular PPIs dominate the dataset; thus, inhibition of these
complexes will require development of synthetic analogues that can
permeate the cell membrane. Our study reveals new classes of previously
unidentified targets for helix dimer mimetics. Some of these newly
identified targets will potentially aid efforts in drug discovery.
In particular, it is interesting to note that the largest category,
various enzymes, accounts for 63% of DIPP interactions. This category
contains many hydrolases, oxidoreductases, and transferases, among
other enzymes. Although enzyme function has typically been controlled
using substrate or transition state analogues, helix dimers offer
a potentially attractive alternative scaffold. Figure b,c highlights two examples of interactions
involving UDP-galactose 4′-epimerase[53] and PCSK9[54] where helical dimers at dimeric
interfaces in enzyme biological assemblies of considerable medical
relevance bind near enzyme active sites.
Figure 8
(a) Interfaces mediated by Case 1 and Case 2 helical dimers possess
diverse functions and are dominated by enzyme complexes. (b) T. brucei’s UDP-galactose 4′-epimerase features
a mutation in the active site relative to the human enzyme, potentially
permitting specific targeting. Galactose metabolism is essential to
the parasite’s ability to cause African sleeping sickness,
a neglected tropical disease.[53] The enzyme
is active in dimeric and tetrameric forms; the target monomer is shown
as gray surface. (c) PCSK9 propeptide (surface) binds with high affinity
to PCSK9 (cartoon) and inhibits its proteolytic activity. This enzyme
is linked to atherosclerosis and cardiovascular disease.[54] In both figures, the nearby active site from
the dimer chain is highlighted in yellow sticks. PDB codes: 1GY8 and 2QTW.
Several targets in
DippDB are critically relevant to cancer phenotypes.[56−58] In particular, we explored classic cases leading to “Hallmarks
of Cancer” and discovered a set of possible targets mediated
by helical dimers.[59,60] For example, three complexes—MHF
histone tetramer/FANCM helicase (PDB code 4E45),[61] Mre11
nuclease/Rad50 ABC ATPase (PDB code 3QF7),[62] and the
N- and C-terminal domains of the Mms21 subunit of Smc5 (PDB code 3HTK)[63]—have a role in DNA repair. The interaction between
EPO and its receptor (PDB code 1EER) is implicated in the hypoxic response;[64] the complex between murineIfnar1 and interferon-beta
(PDB code 3WCY) is implicated in the regulation of apoptosis,[65,66] and the inhibition of the catalytic subunit p110β by the SH2
domain of the regulatory subunit p85β of phosphoinositide 3-kinase
(PDB code 2Y3A) is implicated in angiogenesis[67] and
invasive cell growth.[68] In each case, mimicry
of both helices is predicted to be essential to optimize
binding affinity. Beyond these targets, helical dimer interfaces include
bacterial transcription and metabolism, quorum sensing, cell signaling,
and more. A list of targets implicated in transcription is included
in the Supporting Information, Table S1. We examined the 354 PDB structures that contain at least one Case
1 or Case 2 helical dimer and explored the Gene Ontology (GO) terms
annotating each complex.[69] For example, two structures (PDB codes 2P5T and 1GVN) were toxins annotated with “cell
killing”, four possess “nucleic acid binding transcription
factor activity”, and seven are implicated in organismal development;
two are involved in immune responses.We anticipate that these
and more targets will become tractable
for modulation by designed tertiary structure mimetics; in Figure , we depict several
interactions of particular pharmacological interest whose tertiary
structure binding sites have not been drugged. These complexes, or
the pathways they modulate, are known to be of therapeutic interest.[70−72] We illustrate them here to highlight the importance of helix dimer
domains in coaxing the formation of these complexes and the potential
of dimer mimics as inhibitors.
Figure 9
Examples of protein–protein interactions
mediated by helix
dimers. (a) LSD1-mediated nucleosome demethylation requires complexation
with CoREST; existing inhibitors of LSD1 bind its substrate pocket,
and predicted binding sites have thus far omitted the helical dimer
interface.[73,74] (b) Sam68 RNA-binding protein
is implicated in pro-oncogenic activity and modulates alternative
splicing of CD44 and Bcl-xL. (c) The Kaposi’s sarcoma protein
vFLIP forms an A2B2 heterotetramer with NF-kappa-B
essential modulator (NEMO) coiled coil; thus far, inhibition of this
complex’s function has only been achieved through geldanamycin
inhibition of Hsp90.[75] PDB codes: 2IW5, 2XA6, and 3CL3.
Examples of protein–protein interactions
mediated by helix
dimers. (a) LSD1-mediated nucleosome demethylation requires complexation
with CoREST; existing inhibitors of LSD1 bind its substrate pocket,
and predicted binding sites have thus far omitted the helical dimer
interface.[73,74] (b) Sam68RNA-binding protein
is implicated in pro-oncogenic activity and modulates alternative
splicing of CD44 and Bcl-xL. (c) The Kaposi’s sarcoma protein
vFLIP forms an A2B2 heterotetramer with NF-kappa-B
essential modulator (NEMO) coiled coil; thus far, inhibition of this
complex’s function has only been achieved through geldanamycin
inhibition of Hsp90.[75] PDB codes: 2IW5, 2XA6, and 3CL3.
Conclusion
The aim of any systematic
study of protein structures is both to
uncover general principles governing protein geometry and to develop
new insight into how to practically modulate protein function. Topologically
defined segments often mediate protein–protein interactions,[11−15] and mimicry of these regions has emerged as a successful strategy
for inhibitor design. Several examples of PPI inhibitors derived from
mimicry of interfacial α-helical and β-strand domains
have been described.[7,76−90] Emerging examples of tertiary mimetics as PPI inhibitors illustrate
the broad potential of moving beyond secondary structure mimicry.[16,91] We were motivated to analyze protein complexes featuring helical
dimers to create a list of potential targets where mimics of a single
helix may not be sufficient.Modification of the protocol used
to develop a database of interface
helices (HippDB)[10] provided a set of protein–protein
interactions where the critical binding residues reside on two helices
oriented in parallel or antiparallel configurations. The dataset includes
some helical dimers that would be classified as true coiled coils
because of their heptad repeats and supercoiling as well as helical
dimers that find other means of making contacts with each other. We
find that helical dimer complexes exhibit uncommon structural features.
Some helical dimers violate the typical expectations for coiled coil
interiors and are held together largely by salt bridges and hydrogen
bonds, while others violate expectations for typical protein interfaces,
which contain mostly large, aliphatic hot spot residues.The
length of dimers in contact with the protein partners spans
one to three heptads, suggesting that medium-sized molecules or miniproteins
will be able to disrupt these complexes. The online database provides
a list of all entries in the dataset along with their PDB identifiers
and the energetic contributions of the hot spot residues. We anticipate
that this analysis will enable discovery of new classes of protein–protein
interaction inhibitors as potential therapeutics.
Authors: Bobo Dang; Haifan Wu; Vikram Khipple Mulligan; Marco Mravic; Yibing Wu; Thomas Lemmin; Alexander Ford; Daniel-Adriano Silva; David Baker; William F DeGrado Journal: Proc Natl Acad Sci U S A Date: 2017-09-25 Impact factor: 11.205