Giuseppe Sicoli1, Thomas Kress2, Hervé Vezin1, Karin Ledolter3, Dennis Kurzbach4. 1. Laboratoire Avancé de Spectroscopie pour les Interactions, la Réactivité et l'Environnement (LASIRE), UMR CNRS 8516, Université de Lille, Avenue Paul Langevin - C4, F-59655 Villeneuve d'Ascq, France. 2. Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K. 3. Department for Structural and Computational Biology, Max F. Perutz Laboratories, University Vienna, Campus Vienna BioCenter 5, 1030 Vienna, Austria. 4. Faculty of Chemistry, Institute of Biological Chemistry, University of Vienna, Währinger Str. 38, 1090 Vienna, Austria.
Abstract
We report a conformational switch between two distinct intrinsically disordered subensembles within the active site of a transcription factor. This switch highlights an evolutionary benefit conferred by the high plasticity of intrinsically disordered domains, namely, their potential to dynamically sample a heterogeneous conformational space housing multiple states with tailored properties. We focus on proto-oncogenic basic-helix-loop-helix (bHLH)-type transcription factors, as these play key roles in cell regulation and function. Despite intense research efforts, the understanding of structure-function relations of these transcription factors remains incomplete as they feature intrinsically disordered DNA-interaction domains that are difficult to characterize, theoretically as well as experimentally. Here we characterize the structural dynamics of the intrinsically disordered region DNA-binding site of the vital MYC-associated transcription factor X (MAX). Integrating nuclear magnetic resonance (NMR) measurements, molecular dynamics (MD) simulations, and electron paramagnetic resonance (EPR) measurements, we show that, in the absence of DNA, the binding site of the free MAX2 homodimer samples two intrinsically disordered conformational subensembles. These feature distinct structural properties: one subensemble consists of a set of highly flexible and spatially extended conformers, while the second features a set of "hinged" conformations. In this latter ensemble, the disordered N-terminal tails of MAX2 fold back along the dimer, forming transient long-range contacts with the HLH-region and thereby exposing the DNA binding site to the solvent. The features of these divergent substates suggest two mechanisms by which protein conformational dynamics in MAX2 might modulate DNA-complex formation: by enhanced initial recruitment of free DNA ligands, as a result of the wider conformational space sampled by the extended ensemble, and by direct exposure of the binding site and the corresponding strong electrostatic attractions presented while in the hinged conformations.
We report a conformational switch between two distinct intrinsically disordered subensembles within the active site of a transcription factor. This switch highlights an evolutionary benefit conferred by the high plasticity of intrinsically disordered domains, namely, their potential to dynamically sample a heterogeneous conformational space housing multiple states with tailored properties. We focus on proto-oncogenic basic-helix-loop-helix (bHLH)-type transcription factors, as these play key roles in cell regulation and function. Despite intense research efforts, the understanding of structure-function relations of these transcription factors remains incomplete as they feature intrinsically disordered DNA-interaction domains that are difficult to characterize, theoretically as well as experimentally. Here we characterize the structural dynamics of the intrinsically disordered region DNA-binding site of the vital MYC-associated transcription factor X (MAX). Integrating nuclear magnetic resonance (NMR) measurements, molecular dynamics (MD) simulations, and electron paramagnetic resonance (EPR) measurements, we show that, in the absence of DNA, the binding site of the free MAX2 homodimer samples two intrinsically disordered conformational subensembles. These feature distinct structural properties: one subensemble consists of a set of highly flexible and spatially extended conformers, while the second features a set of "hinged" conformations. In this latter ensemble, the disordered N-terminal tails of MAX2 fold back along the dimer, forming transient long-range contacts with the HLH-region and thereby exposing the DNA binding site to the solvent. The features of these divergent substates suggest two mechanisms by which protein conformational dynamics in MAX2 might modulate DNA-complex formation: by enhanced initial recruitment of free DNA ligands, as a result of the wider conformational space sampled by the extended ensemble, and by direct exposure of the binding site and the corresponding strong electrostatic attractions presented while in the hinged conformations.
Cellular regulation relies upon
a large battery of transcription factors (TFs)[1] and their DNA interactions. Despite widespread medicinal, pharmacological,
and biological interest, many essential features of DNA-TF recognition
remain poorly understood notwithstanding long-standing research efforts.
In this regard, the particular importance of intrinsic disorder, i.e.,
the occurrence of domains void of any stable secondary or tertiary
structure, has become increasingly evident.[2−5] Uversky, Dunker, and co-workers
found that intrinsically disordered regions (IDRs) promote TF efficiency
through enhanced conformational plasticity, endowing these domains
with the potential to sample a multitude of conformations with distinct
properties and thus also with multifunctionality.[6] Similarly, Dyson and Wright have shown that IDR-related
binding events are often facilitated by transient complexes that dynamically
evolve to form high-affinity complexes.[7]However, an atomic level description of intrinsic disorder
in the
active sites of TFs is still lacking, which obscures important insights.
A deeper understanding of the role of intrinsic disorder in transcriptional
regulation is highly desirable not only for the elucidation of fundamental
structure–activity relationships but also for an understanding
of the involvement of proto-oncogenic TFs in tumorigenesis[8]—two points that warrant deeper investigations.Hence, we here aim at shedding light on intrinsically disordered
active sites of TFs by providing an atomistic description of the conformational
dynamics of the DNA-binding IDR of the MYC Associated factor X (MAX), an
essential agent associated with a plethora of vital
processes, including cell proliferation and apoptosis.[9,10]In the cell nucleus MAX occurs as a coiled-coil homodimer,
here
denoted as MAX2, that consists of three domains (Figure a): (i) the basic intrinsically disordered N-terminal domain (NTD)[11−13] that houses the DNA binding site, which constitutes the focus of
this study, and (ii) a helix–loop–helix
(HLH) segment that connects the NTD to (iii) a leucine
zipper (LZ) that serves as an anchor between the two subunits that
form the homodimer.
Figure 1
(a) Sketch of MAX2’s functional elements.
The
DNA-binding NTD spans residues 1–16, followed by a helix–loop–helix
segment and a leucine zipper. (b) Evolution of distances r(5–5) between the Cα atoms of the two N-terminal
R5 residues of each MAX subunit during an MD trajectory. (c) Structural
representation of hinged (left) and extended (right) subensembles
found during the MD trajectory of panel b. Examples for the R5–R5
distances are indicated. The two different subensembles account for
the large distance fluctuations in panel b. Note that the hinged subensemble
appears more defined than the extended one. The different MAX domains
are indicated on the right. (d) RMSD of the NTD only relative to the
starting structure. On the left, the narrow dispersion of the hinged
ensemble is indicated in green (ΔRMSD < 0.3 nm). On the right
the larger RMSD fluctuations of the extended ensemble are highlighted
in orange (ΔRMSD > 1.3 nm). The RMSD analysis confirms that
the extended subensemble is less constrained than the hinged one.
(a) Sketch of MAX2’s functional elements.
The
DNA-binding NTD spans residues 1–16, followed by a helix–loop–helix
segment and a leucine zipper. (b) Evolution of distances r(5–5) between the Cα atoms of the two N-terminal
R5 residues of each MAX subunit during an MD trajectory. (c) Structural
representation of hinged (left) and extended (right) subensembles
found during the MD trajectory of panel b. Examples for the R5–R5
distances are indicated. The two different subensembles account for
the large distance fluctuations in panel b. Note that the hinged subensemble
appears more defined than the extended one. The different MAX domains
are indicated on the right. (d) RMSD of the NTD only relative to the
starting structure. On the left, the narrow dispersion of the hinged
ensemble is indicated in green (ΔRMSD < 0.3 nm). On the right
the larger RMSD fluctuations of the extended ensemble are highlighted
in orange (ΔRMSD > 1.3 nm). The RMSD analysis confirms that
the extended subensemble is less constrained than the hinged one.Despite the importance of the disordered NTD for
the DNA recognition
process, and in the overall cellular interaction network, a description
of its structural dynamics has not been available. To date, only two
structural studies, by Sauvé et al.[14] and Sammak et al.,[15] have examined its
conformation, finding weak propensities for helical elements. To help
provide the desired deeper understanding of disordered TFs and MAX2’s IDR in particular, we here provide detailed structural
dynamics at residue resolution. To this end, we integrate data from
nuclear magnetic resonance (NMR) measurements, molecular dynamics
(MD) simulations, and electron paramagnetic resonance (EPR) spectroscopy.
Similar approaches have already been successful in earlier work describing
large-scale structural tuning of folded DNA-TF complexes[16,17] and are here extended to the description of IDRs.Investigations
of HLH-type transcription factors by NMR have recently
witnessed a renaissance as new methodological developments enable
the needed high-resolution depictions of these challenging substrates.
Brutscher and co-workers showed how a fragment-based analysis by NMR
can reveal the conformational dynamics of HLH domains.[18] Blackledge and co-workers have combined titration
of denaturation agents with NMR to reveal residual structures in intrinsic
disorder,[19] while Somlyay et al. have shown
how to determine interactions of HLH domains using 19F
NMR.[20] In addition, Macek et al. have used
NMR to show that the MAX–Myc interaction can be interrupted
by phosphorylation of Myc.[21]Combinations
of paramagnetic relaxation enhancement (PRE) NMR and
MD have been successfully applied by Dobson et al. to describe structural
ensembles of intrinsically disordered proteins (IDPs).[22] Similarly, Salvi et al.,[23] Shaw et al.[24] and Ha-Duong et
al.[25] used MD and NMR to determine relaxation
parameters and ensemble properties of IDPs. Combinations of EPR and
MD, have likewise gained attention as ever longer MD trajectories
have become available. Oganesyan et al.[26] thus studied myoglobin conformational dynamics, Saxena et al. applied
this combination to Cu(II)-binding proteins,[27] and Trommer and co-workers found formerly unknown degrees of conformational
plasticity of the maltose binding protein.[28]Capitalizing on these developments, we here reveal a conformational
switch between two distinct, intrinsically disordered subensembles[29] which are sampled by the NTD of free MAX2.This switch highlights the evolutionary-selected benefit
of having
intrinsically disordered domains, with high structural plasticity,
which allow for dynamic sampling of conformational spaces with multiple
distinct states, each endowed with tailored properties.In particular,
we describe a transition between (i) a subensemble
that comprises conformationally extended states of
the N-terminal tails of the MAX2 subunits which explore
a wide range of sample space and (ii) a second subensemble
of hinged conformations, which expose the DNA-binding site to the
solvent when the disordered N-terminal tails (either one or both simultaneously)
laterally fold back toward the HLH-domain.To deepen our understanding
of intrinsic disorder in MAX2 activity, we first combined
all-atoms MD simulations with contact
maps from paramagnetic NMR and nanoscale distance measurements from
EPR to characterize the structural dynamics of DNA-free homodimers.
Second, we compared our findings with similar data on MAX2–DNA complexes.MD Simulations Reveal Bimodal
Sampling of Disordered NTD
States. Starting from the NMR-derived solution structure[12] of MAX2, we began our investigations
with all-atom MD simulations using explicit solvent (25 mM NaCl solution
at pH 7.4 and a temperature of 310 K). These simulations provided
the initial evidence for the bimodal conformational space of the intrinsically
disordered NTD.In addition to extended conformations, which
are expected for an
IDR, we also found conformations in which the N-terminal tails of
either one—or often even both—subunits fold back toward
the HLH domain. These hinged conformations sample a well-defined conformational
space, yet do not adopt classical secondary structure elements.The bimodal sampling of extended versus hinged conformations is
readily seen, for example, by tracking the observed end-to-end distances
between the two N-termini during the MD runs. Figure b shows the distance between the Cα atoms of residues R5 of the two MAX2 subunits (denoted r(5–5) in Figure b) for a 200 ns MD trajectory. Residue R5 was chosen
for representation here to match experimental data presented in the
following sections.In the simulations, the NTD switches between
two ensembles of conformations,
one that features shorter distances r(5–5)
< 3 nm and one with longer ones r(5–5)
> 4 nm.An analysis of the simulated protein conformations
underlying the
two identified distance populations reveals that the ensemble of short
distances is exclusively composed of doubly hinged conformations (as
those shown, for example, in Figure c, left) in which both subunits fold sideways back
toward the HLH domains. A cluster analysis of the sampled distances
corroborates this observation (see Figure S1.) The fluctuations of conformational RMSDs from the starting structure
of this subensemble were found to be quite small (ΔRMSD <
0.3 nm; see Figure d) indicating that these conformations represent a defined feature
of MAX2’s conformational space, clearly distinguishable
from random-coil type structures.In contrast, the ensemble
leading to longer r(5–5)
distances was traced back to extended conformations (Figures c (right) and S1) where the NTD stretches out into the solvent
sampling a wide range of heterogeneous states. The more dynamic nature
of the extended subensemble is reflected in larger fluctuations of
the RMSDs from the starting structure (ΔRMSD > 1.3 nm; see Figure d, right).Importantly, intermediate r(5–5) distances
between 3 and 4 nm were only rarely observed, which indicates that
the two ensembles are indeed distinct.This conformational switch
was seen in three independent MD runs
(vide infra and Figure S2).The energetic basis of the hinged conformation can be rationalized
by examining the expected electrostatic interactions. A charge analysis
of MAX2 can be found in Figure S3: the NTD is seen to be heavily positively charged, while the LZ
zipper is strongly negatively charged. The large dipole moment[30] spanned between the two axial termini would
be expected to strongly attract the NTD toward the C-terminal part
of the dimer, resulting in the observed back folding.Paramagnetic Relaxation Enhancement NMR Corroborates a
Bimodal Conformational Space. Guided by the MD results, we
probed the conformational space of the NTD using PRE[31] NMR experiments. The experimental data are consistent with
the MD data, which confirmed the simulated structural ensembles. Additionally,
residual conformational freedom within the hinged subensemble could
be revealed.In PRE NMR, signals of amino acids that transiently
approach a
paramagnetic spin-label (SL) are broadened because of accelerated
nuclear relaxation, resulting in lowered signal amplitudes. The PRE
effect can be quantified by the signal suppression ratio V = SPRE/SREF, where SPRE is typically a 1H–15N
NMR cross peak amplitude for the ith protein residue
observed in the presence of the SL, and SREF is the corresponding amplitude in
a reference spectrum obtained with a chemically reduced diamagnetic
label. This signal suppression ratio follows a steep r–6 proportionality, where r is
the distance between the spin label and an observed amino acid, and
thus depends also on the structural dynamics of the protein. The complete
suppression of a signal (V = 0) always requires that r < 2.5 nm.[32] Proximity measures
are thus accessible between the labeling site and adjacent residues.[33]For these experiments, we used a spin-labeled
MAX mutant containing
an R5Ccysteine point mutation in the disordered NTD, which was then
paramagnetically labeled with MTSL (S-(1-oxyl-2,2,5,5-tetramethyl-2,5-dihydro-1H-pyrrol-3-yl)methylmethanesulfonothioate)
(cf. ref (16)).Figure a shows
signal suppression ratios V obtained for each amino
acid of MAX2, arising from spin-labels incorporated at
position R5C in both MAX2 subunits. (Both MAX2-subunits are identical; hence, values at each residue index report
the combined signals from two amino acids.) We identified three significant
PRE sites: (1) Site “S”, spanning residues
1–20, which shows ratios V < 0.1 because
of proximity to the SL site in the primary amino acid sequence. (2)
Site “L1”, between residues 35–45,
which shows ratios V > 0.3, and (3) site “L2”, between residues 45–65, with ratios V > 0.5.
Figure 2
(a) Residue dependence of experimental PRE signal suppression
ratios V. Three distinct PRE sites are observed,
indicated as S
(short-range), L1 (long-range 1), and L2 (long-range 2). The spin
label site R5C is indicated by the red dot. Reduced PRE values ≪1
indicate transient contacts between residue R5C and the lateral loop
of the HLH (site L1) and LZ segments (site L2). (b) Calculated PRE
values from the two different conformational ensembles (hinged and
extended) sampled in an MD trajectory. The experimentally observed
values can be reproduced within the precision of the approach that
is outlined in the main text. The extended subensemble cannot account
for long-range PREs, only for PREs at site S. The
hinged ensemble could again be divided into two substates: one where
residue R5 approaches the HLH domain and another one where it approaches
the LZ domain. The former leads to reproduction of the effect at site
L1 and the latter at site L2. The bottom panel displays a superposition
of the theoretical PRE values from the different subensembles. The
match between measured and calculated PREs shows that the MD simulation
indeed captures the conformational properties of MAX2.
(c) The simulated conformational ensembles used to calculate PREs
for sites S, L1, and L2. The green dot indicates the position of residue
R5, which was the SDSL site in the experiment.
(a) Residue dependence of experimental PRE signal suppression
ratios V. Three distinct PRE sites are observed,
indicated as S
(short-range), L1 (long-range 1), and L2 (long-range 2). The spin
label site R5C is indicated by the red dot. Reduced PRE values ≪1
indicate transient contacts between residue R5C and the lateral loop
of the HLH (site L1) and LZ segments (site L2). (b) Calculated PRE
values from the two different conformational ensembles (hinged and
extended) sampled in an MD trajectory. The experimentally observed
values can be reproduced within the precision of the approach that
is outlined in the main text. The extended subensemble cannot account
for long-range PREs, only for PREs at site S. The
hinged ensemble could again be divided into two substates: one where
residue R5 approaches the HLH domain and another one where it approaches
the LZ domain. The former leads to reproduction of the effect at site
L1 and the latter at site L2. The bottom panel displays a superposition
of the theoretical PRE values from the different subensembles. The
match between measured and calculated PREs shows that the MD simulation
indeed captures the conformational properties of MAX2.
(c) The simulated conformational ensembles used to calculate PREs
for sites S, L1, and L2. The green dot indicates the position of residue
R5, which was the SDSL site in the experiment.The PREs observed for sites L1 and L2 necessarily stem from close approaches of residue R5C to these
sites, i.e. from long-range contacts. The hinged conformational subensemble
observed in the MD runs provides an intuitive explanation for these
effects. To verify this hypothesis, we calculated theoretical PRE
ratios using SL–1HN amide proton distances
obtained from the structures sampled in the MD simulations. PRE rates
and the resulting signal suppression ratios were determined following
the approach of Wagner and co-workers (detailed explanations can be
found in refs (17 and 32)). In brief,
a PRE rate was calculated for each 1HN amide
proton in each simulated conformation. This was done by using a spherical
correlation function and a single effective correlation time τe that combines the effective electron relaxation time and
the motion of the SL-1HN connection vector.
The PRE rate Γ2,i for the 1HN atom of residue i is given bywithωL denotes the Larmor frequency
of the proton. S denotes the electron spin number,
and g is the electron’s g-factor. Importantly, r is the distance between the unpaired electron and the
amide proton of residue i as extracted from the simulated
conformations.This was followed by a calculation of the arithmetic
mean ⟨Γ2,i⟩ over the rates obtained
for the different conformations.
The resulting average was then used to calculate the ratio V for residue i using the approach detailed in refs: (17 and 32)Here, R2, denotes the intrinsic relaxation
rate (absence of spin label)
of the 1HN nucleus, and t denotes
the time that the proton magnetization is transverse during a TROSY
experiment.With this strategy, we could predict the experimentally
observed
PRE effects. Figure b displays the results of the MD-based calculations. For each site
a subensemble of simulated structures could be found that allowed
reproducing the experimental data within the precision expected for
this approach. (Spherical correlation functions oversimplify IDR dynamics
as pointed out by Ferrage and co-workers.)[34]Figure c visualizes
the ensembles of simulated conformations which were used to predict
PREs for sites S, L1, and L2, respectively. Interestingly, the hinged subensemble
contained two distinguishable sets of structures: One where the N-terminal
tail housing residue R5C approaches the HLH domain (site L1) and a second one where it approaches the LZ domain (site L2). The different structures underlying these two “subsub-ensembles”
are depicted in the center and bottom panels of Figure c, which shows how residue R5C laterally
approaches the homodimer causing the experimentally observed long-range
PREs.Note that MTSL labeling can bias the conformational ensembles
sampled
by IDRs. However, the MD simulations here were run with the nonlabeled
form of MAX2 and hence reflect the native, nonbiased ensemble.
The agreement between simulation and experiment therefore suggests
that biases due to the MTSL label are minimal.The approach
presented here thus provides a qualitative match between
simulated and experimental data; a quantification of the ensemble
populations would yet require much longer MD trajectories as well
as knowledge of the exact forms of the spectral density function underlying
the PRE effect.Nanoscale Distance Measurements by EPR. To further
characterize the conformational space of the NTD, we employed double
electron–electron resonance (DEER, also referred to as PELDOR)
measurements.[35] DEER relies on pairwise
site-directed spin labeling (SDSL) and the detection of the dipolar
spin–spin coupling between the two unpaired electrons. Data
are recorded in a flash-frozen solution, to provide a representative
snapshot of the conformational ensemble present at the glass transition
temperature of the sample. This method yields a distribution P(r) of distances r between
the two SLs. These distances correspond to the structures found in
the vitrified substrate—i.e., for the case at hand, to the
conformations of MAX2’s NTD—as each features
a particular spacing of the two labeled R5C residues.We found P(r) to span a range
of 1.5–4.5 nm (Figure a) indicating a wide array of underlying conformations. Figure b shows the underlying
DEER form factor and fit. From the MD simulations, we expect the hinged
subensemble to underlie short distances in P(r) and the extended subensemble to underlie longer distances.
Figure 3
(a) Experimentally
determined SL–SL distance distribution P(r) (black, error in red) obtained for
the MTSL tagged R5C-R5C mutant. Superimposed are calculated distance
distributions from precomputed rotamer libraries (blue for a hinged
state and in red for an extended conformation). The Cα distances are indicated. The calculated distributions fit well within
the experimentally determined distance range. (b) DEER form-factor
(black) underlying the distance distribution in panel a and fit (red)
as obtained through the Tikhonov regularization approach. (c and d)
Simulated structures underlying the calculated distance distributions
panel a. (e and f) Distance histograms of Cζ–Cζ distances for residues R5 obtained from two 200 ns
long MD trajectory (panel e corresponds to Figure ). Within the experimental precision (see
main text), the simulated distances match the experimentally determined
distances. (g and h) Evolution of Cα–Cα distances for residues R5 for the MD trajectory underlying
the histograms in panels e and f.
(a) Experimentally
determined SL–SL distance distribution P(r) (black, error in red) obtained for
the MTSL tagged R5C-R5C mutant. Superimposed are calculated distance
distributions from precomputed rotamer libraries (blue for a hinged
state and in red for an extended conformation). The Cα distances are indicated. The calculated distributions fit well within
the experimentally determined distance range. (b) DEER form-factor
(black) underlying the distance distribution in panel a and fit (red)
as obtained through the Tikhonov regularization approach. (c and d)
Simulated structures underlying the calculated distance distributions
panel a. (e and f) Distance histograms of Cζ–Cζ distances for residues R5 obtained from two 200 ns
long MD trajectory (panel e corresponds to Figure ). Within the experimental precision (see
main text), the simulated distances match the experimentally determined
distances. (g and h) Evolution of Cα–Cα distances for residues R5 for the MD trajectory underlying
the histograms in panels e and f.To compare the measured EPR distribution with the MD results, we
extracted Cζ–Cζ distances
for residues R5 (denoted r(Cζ–Cζ) in Figure e,f) from each snapshot of two 200 ns long trajectories. (Cζ is the terminal carbon atom of the R5 arginine side
chain and was chosen here to approximate the position of the unpaired
electron in an MTSL labeled cysteine, which is the relevant reference
point in our DEER experiments.) The resulting distance populations,
as displayed through the histograms in Figure c,d, match the experimental distance distribution P(r) well, except for longer distances
>4.5 nm, which would not be observable experimentally, but were
sampled
in the MD simulations (the DEER experiment was recorded with a maximal
mixing time of 2.5 μs (cf. Figure b) such that distances above ca. 4.5 nm cannot
be reliably extracted from the form-factor[36]).Panels e and f in Figure (similar to Figure c) represent the simulated distances as trajectories
of r(Cα- Cα) distances
for
the two MD runs (see the Supporting Information for more replicas). Both show a comparable conformational switch
between the two subensembles, as evidenced by the transitions between
the two distance regimes of 2–3.5 nm for the hinged and 4–6
nm for the extended subensemble.To exclude systematic errors
in the analysis which might arise
because the R5 side chain is two bonds shorter than the R5C-MTSL side
chain, we calculated distance distributions using the MMM software
package for a hinged and an extended conformation found in the MD
simulations. In brief, using MMM we attached MTSL in silico to positions R5 and predicted a distance distribution based upon
precomputed rotamer libraries.[37]Figure a shows the experimentally
derived span of distances, superposed with those predicted for the
extended and hinged structures. A good match could be achieved only
when conformations from both subensembles were considered. Panels
g and h of Figure show the structures used for the MMM analysis and demonstrate how
the hinged subensemble leads to shorter distances. When both MAX2 subunits are hinged, distances between ca. 2 and 3 nm can
be accounted for, while the extended ensemble can only account for
distances >3 nm, corroborating our initial expectations.While the experimental data and simulations agree qualitatively,
the DEER distance distributions do not feature two distinct distance
distributions as one might expect from the MD trajectories. Two points
contribute to this: (i) the MTSLspin label attached
to position R5C features intrinsically high conformational plasticity,
which can cause broadening of the measured distance distributions
(as suggested by the MMM analysis, cf. Figure a) which would not manifest itself in simulations
of the unlabeled protein and (ii) the Tikhonov regularization
employed to extract the distance distributions from the dipolar evolution
function may introduce an additional broadening.[36]To investigate this further, we have run a complementary
MD simulation
with an MTSL-labeled R5C mutant (see Figure S10). This run showed partial overlap of the electron–electron
distance distributions for the hinged and extended conformations,
respectively, but with clearly distinguishable maxima. This indicates
that a combination of effects arising from both the conformational
flexibility of the SL and the data regularization procedure might
contribute to broadening observed in the experimentally derived distance
distribution.CW EPR: Population Quantification. To quantify
the relative populations of the extended and hinged subensembles,
we fit room-temperature continuous-wave (CW) EPR spectra to simulated
overlapping spectra, to represent a superposition of two spin-labeled
protein conformations with different internal rotational correlation
times (τc) of the SL (for details see the Supporting Information Script S1). Here, τc can be considered to be a measure of the mobility of the
SL, and thus of the conformational plasticity of its local environment.
The best fit to the data was obtained using a superposition in which
ca. 85% of the ensemble is found in a more mobile state (τc = 0.14 ns), and 15% is in a more rigid state (τc = 1.18 ns.) Hence, assuming that the spin label in the hinged
conformation experiences reduced rotational motion, and that it is
thus represented by the slow simulation component, the hinged states
make up 15% of the conformational space of MAX2’s
NTD.Additional supporting CW and DEER EPR data, including data
for
two further reference mutants, and simulations involving possible
third subensembles can be found in Figures S4–S9.Note that the populations found by CW EPR do not match those
of
the DEER distance distribution. While the former show higher populations
of extended states, the shorter distances, corresponding to the hinged
ensemble apparently dominate in the latter. However, the distance
populations in the DEER experiments cannot be quantitatively interpreted,
because short distance peaks with smaller distance distributions will
appear overpopulated by the employed Tikhonov regularization-based
data analysis, while longer distances with broader distance distributions
will in turn be underrepresented. This is due to the summation of
the damped cosine functions, i.e. form factors that represent the
various conformations in the extended ensemble. The resulting sum
converges to decay functions that resemble the DEER background and
tend to not contribute to the experimental background corrected form
factor.[36] Moreover, the ensemble populations
in the DEER experiments might be different from those in the CW EPR
experiments because of the freezing of the sample at its glass-transition
temperature. There an altered distribution might predominate, with
increased populations of hinged conformations relative to ambient
conditions.Hence, within the precision of our approach the
CW EPR and DEER
data are both in agreement with the MD data.Implications
for DNA Binding. The combined experimental
and MD data presented here for MAX2’s DNA-binding
domain supports a model in which, in the absence of DNA, MAX2’s NTD samples two distinct subensembles with different properties,
each of which likely plays a role in its regulatory activities. Intrinsic
disorder in the DNA-binding NTD could assist in modulating binding
to target DNAs, whereas the enhanced conformational plasticity enables
a potent structural switch providing higher binding efficiencies as
compared to a hypothetical permanently rigid binding site. Indeed,
the importance of conformational switches for TF activity has been
pointed out recently.[38] Further observations
of conformational subensembles in IDRs have been reported by Choi
et al.[39] and Na et al.[40]For the case at hand, a mechanism appears intuitive
in which extended
NTD conformations recruit DNA to MAX2 by scanning a large
sample space, sometimes also termed “fly casting”.[41] In contrast, the hinged conformations expose
the final DNA binding site to foster complex stabilization. Indeed,
a heavily positively charged RKRRDH sequence (aa 13–18) at
the core of the DNA binding site is exposed in the hinged subensemble,
which might electrostatically attract DNA molecules from their initial
encounter site toward their final position within the DNA–TF
complex (Figure a).
Because of the dimeric nature of MAX2, it is also possible
that both subunits could act independently. While one NTD recruits
the ligand, for example, the other may open the path to the binding
site. It is well-known that, once bound,[11] the NTD adopts a stable helical form. The detailed folding mechanism
is, however, still a matter of debate. Both conformational selection
and induced fit scenarios have been proposed.[14,42] The conformational sampling reported here can however be reconciled
with both models.
Figure 4
(a) Depiction of the solvent-exposed surfaces (yellow)
of the positively
charged amino acids in the NTD of a hinged conformation. The RKRRDH
stretch is indicated. (b) DEER-derived distance distribution P(r) for the DNA-free R5C mutant (red)
and a DNA-bound R5C mutant (blue) as published earlier (see main text).
The errors are shown as superimposed shading. (c) Form factors and
fits underlying the data in panel a (DEER signals before background
correction are reported in Supporting Information Figure S6). Evidently, the sampled range of distances does
not change significantly upon DNA binding. The arrows serve as a guide
to the eye.
(a) Depiction of the solvent-exposed surfaces (yellow)
of the positively
charged amino acids in the NTD of a hinged conformation. The RKRRDH
stretch is indicated. (b) DEER-derived distance distribution P(r) for the DNA-free R5C mutant (red)
and a DNA-bound R5C mutant (blue) as published earlier (see main text).
The errors are shown as superimposed shading. (c) Form factors and
fits underlying the data in panel a (DEER signals before background
correction are reported in Supporting Information Figure S6). Evidently, the sampled range of distances does
not change significantly upon DNA binding. The arrows serve as a guide
to the eye.Finally, in our earlier work we
could show that MAX2’s NTD undergoes substantial
internal motions even when in
the stable DNA-bound helical state. A comparison of the data[16] for MAX2 bound to a DNA ligand with
the results obtained here on DNA-free MAX2 shows that the
range of end-to-end distances sampled by MAX2’s
NTD is similarly broad (1.5–4.5 nm) in both the DNA-free and
DNA-bound states. (Figure b,c shows DEER-derived distance distributions and form factors
before (red) and after (blue) binding of DNA to MAX2 for
the R5C-R5C double mutant.) This shows that the sampled conformational
space is not reduced by DNA binding. Instead, comparable degrees of
freedom on the nanometer scale appear to be conserved. Thus, penalties
due to losses in conformational entropy upon binding might be mitigated.In conclusion, experimental and computational characterization
of intrinsic disorder in transcription factors such as MAX2 have recently advanced to a stage where detailed models of their
structural dynamics can be developed, enabling the long-desired high-resolution
description of their modes of action. It remains to be seen if this
understanding can stimulate new developments, such as in rational
drug design and molecular targeting.
Authors: Samuel A Lambert; Arttu Jolma; Laura F Campitelli; Pratyush K Das; Yimeng Yin; Mihai Albu; Xiaoting Chen; Jussi Taipale; Timothy R Hughes; Matthew T Weirauch Journal: Cell Date: 2018-02-08 Impact factor: 41.582
Authors: William B Tu; Sara Helander; Robert Pilstål; K Ashley Hickman; Corey Lourenco; Igor Jurisica; Brian Raught; Björn Wallner; Maria Sunnerhagen; Linda Z Penn Journal: Biochim Biophys Acta Date: 2014-06-14
Authors: Ucheor B Choi; Rashek Kazi; Natalie Stenzoski; Lonnie P Wollmuth; Vladimir N Uversky; Mark E Bowen Journal: J Biol Chem Date: 2013-06-19 Impact factor: 5.157
Authors: Susan Sammak; Najoua Hamdani; Fabrice Gorrec; Mark D Allen; Stefan M V Freund; Mark Bycroft; Giovanna Zinzalla Journal: Biochemistry Date: 2019-07-11 Impact factor: 3.162
Authors: Mohammad A A Fakhree; Ine Segers Nolten; Christian Blum; Mireille M A E Claessens Journal: J Phys Chem Lett Date: 2018-02-27 Impact factor: 6.475