Wen-Ting Chu1, Jin Wang1,2. 1. State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin 130022, China. 2. Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, New York 11794, United States.
Abstract
Large-scale conformational changes of proteins, including the open-closed transitions, are crucial for a variety of protein functions. These open-closed transitions are often associated with ligand binding. However, the understandings of the underlying mechanisms of the conformational changes within proteins during the open-closed transitions are still challenging at present. In this study, we quantified the intrinsic underlying conformational energy landscapes of five different proteins with large-scale open-closed transitions. This is realized by exploring the underlying density of states and the intrinsic conformational energy landscape topography measure Λ. Λ is a dimensionless ratio of conformational energy gap δE versus conformational energy roughness δE and configurational entropy S or size of the intrinsic conformational energy landscape. By quantifying the Λ of intrinsic open-closed conformational (Λoc) and intrinsic global folding (Λglobal) energy landscapes, we show that both intrinsic open-closed conformation energy and entropy landscapes are funneled toward the closed state. Furthermore, our results indicate the strong correlations between Λ and thermodynamics (conformational state transition temperature against trapping temperature) as well as between Λ and kinetics (open-closed kinetic time) of these proteins. This shows that the intrinsic conformational landscape topography determines both the conformational thermodynamic stability and kinetic speed of the conformational dynamics. Our investigations provide important insights for understanding the fundamental mechanisms of the protein conformational dynamics in a physical and global way.
Large-scale conformational changes of proteins, including the open-closed transitions, are crucial for a variety of protein functions. These open-closed transitions are often associated with ligand binding. However, the understandings of the underlying mechanisms of the conformational changes within proteins during the open-closed transitions are still challenging at present. In this study, we quantified the intrinsic underlying conformational energy landscapes of five different proteins with large-scale open-closed transitions. This is realized by exploring the underlying density of states and the intrinsic conformational energy landscape topography measure Λ. Λ is a dimensionless ratio of conformational energy gap δE versus conformational energy roughness δE and configurational entropy S or size of the intrinsic conformational energy landscape. By quantifying the Λ of intrinsic open-closed conformational (Λoc) and intrinsic global folding (Λglobal) energy landscapes, we show that both intrinsic open-closed conformation energy and entropy landscapes are funneled toward the closed state. Furthermore, our results indicate the strong correlations between Λ and thermodynamics (conformational state transition temperature against trapping temperature) as well as between Λ and kinetics (open-closed kinetic time) of these proteins. This shows that the intrinsic conformational landscape topography determines both the conformational thermodynamic stability and kinetic speed of the conformational dynamics. Our investigations provide important insights for understanding the fundamental mechanisms of the protein conformational dynamics in a physical and global way.
Large-scale structural
rearrangements in biomolecules have been
observed in experiments, during the processes of ligand binding, catalysis,
signal transduction, and regulation.[1−5] These structural changes are often found to be critical to the functions
of biomolecules. The intrinsic flexibility of the protein during binding
is critical for uncovering the underlying mechanisms in kinetics and
thermodynamics of the conformational changes.By X-ray crystallography,
NMR, EM, and other biophysical techniques,
ligand-induced conformational changes from apo state
(usually open) to ligand-bound state (usually closed) have been observed
at the atomic level. In the “Database of Macromolecular Movements”,[3] the conformational changes can be classified
into five main types, referenced as predominantly shear, predominantly
hinge, not hinge or shear, involving partial refolding of the structure,
or unclassified.[2,3,6] For
ligand binding, the most often observed types of motions of the proteins
belong to shear and hinge ones. However, understanding the underlying
mechanisms of open–closed transitions, in particular the relationship
between local flexibility and global flexibility of the proteins,
is still challenging.For biomolecules, large-scale conformational
change within the
molecule is a rate-limiting step. The time scales of protein motion
(binding/unbinding, folding, etc.) span from femtosecond to beyond
seconds.[7,8] Though molecular dynamics (MD) simulation
provides a good way to investigate the protein systems and give atomic
structural information, it is rather time-consuming for conventional
MD simulations to deal with the protein systems with large-scale conformational
changes and effectively sample enough of the conformational states.
The structure-based models based on the energy landscape theory have
become a powerful tool for the studies of the mechanisms of protein
folding and binding processes, which are often associated with large
conformational changes.[9−16] Recently, two-basin structure-based models with two reference states,
and even multibasin structure-based models with multiple reference
states, have been developed and successfully applied to many typical
allosteric systems, for example, adenylate kinase (ADK, two-basin),[17−21] DNA Y-family polymerase IV (DPO4, two-basin),[22] glutamine-binding protein (GBP, two-basin),[23] calmodulin (CaM, two-basin),[24−26] maltose-binding
protein (MBP, multibasin),[27] and protein
kinase A (PKA, multibasin),[28] etc. By using
these methods, simulation results can provide valuable information
about energy, entropy, free energy, binding constants, and other physical
quantities from effective sampling of the conformational states.Energy landscape theory has guided our understanding of biomolecules
as well as their kinetic and thermodynamic processes. In the previous
studies, by quantifying the underlying density of states (DOS), we
have shown that the dimensionless ratio Λ between the energy
gap δE, energy roughness δE, and configurational entropy S of the system [] quantifies the topography
of the underlying
energy landscape and measures the degree of its funneledness. It has
been demonstrated that Λ, as the energy landscape topography
measure, determines the thermodynamic stability and kinetic rate of
the folding process of the proteins.[29] For
the protein-binding process, we have suggested that the topography
of binding energy landscape Λ (intrinsic specificity) also dictates
with the thermodynamic and kinetic specificity.[30−33] In addition, for the coupled
binding–folding process, the entire energy landscapes have
been proposed to be the combined one of the binding and folding energy
landscapes if the binding and folding are weakly coupled in the recognition
process.[34] Upon quantification of the topography
of the individual effective binding and folding landscapes, as well
as the whole global binding–folding energy landscapes, landscape
topography Λ is shown again to govern both thermodynamic feasibility
and kinetic binding–folding rate.[35] Therefore, the Λ will also be a valuable quantity to study
the large-scale conformational change within proteins between open
and closed states.Here in this study, for the uncovering of
the underlying relationship
between the thermodynamics and kinetics and the topography of intrinsic
energy landscape Λ of protein conformational changes (between
open and closed states), topography of the intrinsic local open–closed
conformation and intrinsic global folding energy landscapes will be
quantified for several different proteins. Five different proteins
with reference structures of open and closed conformations were selected
for this study: lysine/arginine/ornithine-binding protein (LAOBP),[36] adenylate kinase (ADK),[37,38] DNA Y-family polymerase IV (DPO4),[39] lipase
1 (LIP1),[40,41] and phosphonate-binding protein (PhnD).[42] As shown in Figure S1, these proteins are with different sizes and different motions between
open and closed states. We will quantify the intrinsic global energy
landscape as well as the intrinsic local open–closed conformation
energy landscape from the underlying DOS extracted from the conformational
dynamics simulation, by applying the two-basin structure-based model.
The topography of the conformational landscape for each individual
protein can be represented by the dimensionless ratio Λ between
the energy gap between the native (closed) state and the average non-native
states (δE = |En – ⟨Enon–native⟩|),
the roughness of the conformation energy landscape or the width of
the energy distribution of the non-native states (δE), and the size of the funnel measured by the configurational entropy
of non-native states of the conformation energy landscape (S). We focus on the topography of both intrinsic local open–closed
conformation and intrinsic global folding landscapes (Λoc and Λglobal), as well as the relationships
between them and the thermodynamic and kinetic properties of the proteins.
This study will be essential for uncovering the fundamental mechanisms
of large-scale conformational changes between open and closed states
within proteins, revealing the physical effects and significance of
landscape topography measure Λ.
Results and Discussions
Intrinsic
Energy and Entropy Landscapes Are Funneled toward
the Basin at Closed State
The intrinsic conformation energy
landscape can be quantified by the density of states (DOS), a statistical
energy distribution in microcanonical ensemble. With the help of the
WHAM algorithm,[43−45] the intrinsic conformation energy landscape is quantified
by transforming the canonical ensemble representation to microcanonical
representation. In general, the intrinsic energy landscape probes
the underlying interactions and usually has a very weak dependence
on the temperature. First, the energy landscape is illustrated by
the energy spectrum directly through the zero dimensional projection
of energy to itself, as shown in Figure S2A,B. In this study, both the intrinsic local conformation (open–closed)
and intrinsic global folding (folded and unfolded) energy landscapes
are shown in terms of the energy spectrum for the 5 different proteins.
They both have only one native state basin (as illustrated in Figure S2C,D). The minimum of intrinsic energy
landscape spectrum of the non-native ensemble is higher than that
of the native ensemble. Our results show that LIP1 has the largest
energy gap δE in intrinsic local open–closed
conformational and intrinsic global folding energy landscapes; DPO4
has the lowest energy gap δE in intrinsic local
open–closed conformation energy landscapes while ADK has the
lowest energy gap δE in intrinsic global folding
energy landscapes (listed in Table S1). Figure S2C,D also demonstrates that the size
of the intrinsic energy landscape measured by entropy through DOS
decreases as energy goes down. This clearly shows a funnel toward
native states. Meanwhile, many other valuable quantities, such as
average energy, heat capacity, and free energy, can be obtained from
DOS.We further show the two-dimensional DOS by projecting it
onto the fraction of native contacts Q and energy E, illustrated in Figure . It is observed that the number of states decreases
as either Q increases or E decreases.
This indicates that the size of the intrinsic energy landscape measured
by the number of states shrinks as the energy decreases and/or the
state gets closer to the native with the lowest energy, as a funnel
toward native state. It is obvious that closed states and folded states
have the lowest DOS and stay at the bottom of the energy funnel. In
addition, the transitions between open and closed states, as well
as between folded and unfolded states, can be visualized in the heat
capacity curves. As shown in Figure S3,
all the 5 proteins have a higher folding–unfolding transition
temperature (Ttransglobal) and a lower open–closed transition
temperature (Ttransoc). With the aim of further showing the details
of these transitions, the fraction of native closed contacts Qclosed and the root-mean-square deviation (RMSD)
with respect to the native open structure RMSDopen are
used to distinguish the open, closed, and unfolded states. On the
free energy landscapes, it is clear that closed states (Qclosed > 0.8) locate at the basin with the lowest free
energy at lower temperature (Figure S5,
temperature lower than Ttransoc), open states (Qclosed < 0.2 and RMSDopen < 1.0) at intermediate
temperature (Figure S6, temperature between Ttransoc and Ttransglobal), and unfolded states (RMSDopen > 5.0) at high temperature (temperature higher than Ttransglobal). The details of free energy analyses are included in the Supporting Information. However, both average
intrinsic energy and DOS landscapes have only one basin that points
to the closed state (Figure and Figure S4). This shows a clear
funneled intrinsic energy landscape toward the native closed state.
In general, free energy depends on the energy, entropy or DOS, and
temperature (G = H – TS). As a result, at low temperature, the effect of energy
contributes to the closed basin on free energy landscape; at intermediate
and high temperature, the effect of entropy leads to the open and
unfolded basins on free energy landscape.
Figure 1
Logarithm of open–closed
(A) and global (B) DOS of the 5
different proteins as a function of Q and energy.
Here, Q is calculated as the fraction of native closed
contacts for panel A and native global contacts for panel B.
Figure 2
Average energy of intrinsic local conformation
energy landscape
as a function of Qclosed and RMSDopen of LIP1 (A), DPO4 (B), LAOBP (C), ADK (D), and PhnD (E).
Closed states are located at the right bottom part; open states are
located at the left bottom part, and unfolded states are located at
the left upper part.
Logarithm of open–closed
(A) and global (B) DOS of the 5
different proteins as a function of Q and energy.
Here, Q is calculated as the fraction of native closed
contacts for panel A and native global contacts for panel B.Average energy of intrinsic local conformation
energy landscape
as a function of Qclosed and RMSDopen of LIP1 (A), DPO4 (B), LAOBP (C), ADK (D), and PhnD (E).
Closed states are located at the right bottom part; open states are
located at the left bottom part, and unfolded states are located at
the left upper part.
Intrinsic Conformation Energy Landscape Topography Λ Determines
the Thermodynamics of Protein Conformational Changes
According
to the energy landscape theory, we can quantify the topography of
the intrinsic energy landscape by a dimensionless quantity Λ,
which can be calculated by .[30,34,46−50] We analyzed the data of Λ, as well as the important
thermodynamic
characteristics of protein conformational changes such as the glassy
trapping temperature Tg and the conformation
state transition temperature Ttrans from
the DOS of the intrinsic open–closed conformation and intrinsic
global folding energy landscapes (details are referred to in the Supporting Information).All the related
data of the intrinsic energy landscapes and the thermodynamic characteristics
are listed in Table S1. Different superscripts
are used for different intrinsic energy landscapes, oc for intrinsic
open–closed conformation energy landscape, global for intrinsic
global folding energy landscape. Of all the 5 proteins in our studies,
ADK has the highest Λoc, indicating the most funneled
intrinsic open–closed conformation energy landscape and the
strongest open–closed conformation stability against trapping.
DPO4 has the lowest Λoc, which may be the one most
easily trapped into non-native states. As shown in Figure , there are significant differences
between the topography of the intrinsic open–closed conformation
energy landscapes of ADK and DPO4. Though they have similar entropy Soc, the energy gap δEoc of ADK landscape is much greater than that of DPO4
landscape, and the roughness Δoc of ADK landscape is much less than that of DPO4 landscape.
In general, the intrinsic open–closed conformation energy landscape
can be considered as a part of intrinsic global folding energy landscape,
locating at the bottom of the intrinsic global folding funnel (see Figure ). However, the significant
difference between ADK and DPO4 in the intrinsic open–closed
conformation energy landscapes does not exist in their intrinsic global
folding energy landscapes. The Λglobal of DPO4 is
slightly higher than that of DPO4, with much greater energy gap δEglobal and entropy Sglobal.
Figure 3
Intrinsic local open–closed conformation energy
landscape
as well as the intrinsic global folding energy landscape of ADK (blue
and purple funnels, top part panel) and DPO4 (yellow and red funnels,
bottom part panel). A simplified funnel is represented for each energy
landscape. The intrinsic global and open–closed energy landscapes
are located in the same coordinate system with different graph scales
in the lower left corner, respectively. The depth of the funnel in
the z axis corresponds to the energy gap δE. The opening width of the funnel is described with
the entropy S. For each intrinsic energy landscape,
we use the number of the local (small) basins to roughly show the
roughness δE. The script for generating this
funneled energy landscape was used from https://oaslab.com/drawing_funnels.html.
Intrinsic local open–closed conformation energy
landscape
as well as the intrinsic global folding energy landscape of ADK (blue
and purple funnels, top part panel) and DPO4 (yellow and red funnels,
bottom part panel). A simplified funnel is represented for each energy
landscape. The intrinsic global and open–closed energy landscapes
are located in the same coordinate system with different graph scales
in the lower left corner, respectively. The depth of the funnel in
the z axis corresponds to the energy gap δE. The opening width of the funnel is described with
the entropy S. For each intrinsic energy landscape,
we use the number of the local (small) basins to roughly show the
roughness δE. The script for generating this
funneled energy landscape was used from https://oaslab.com/drawing_funnels.html.To further analyze the 5 proteins
individually, it should be noted
that ADK has the lowest residue number as compared to the others but
relatively high number of open–closed contacts (high NCoc/N, see the details in the Supporting Information). The open–closed contact number
of DPO4 is the lowest one of all. In the “Database of Macromolecular
Movements”, 4 of the 5 proteins in our studies (LIP1, LAOBP,
ADK, and PhnD) can be classified as “hinge motions”,
except for the DPO4. DPO4 transfers from the “stable”
open state (with native open contacts) via the “unstable”
intermediate state (without native open or closed contacts) to the
“stable” closed state (with closed states),[22] which may be related to the relatively low Λoc of DPO4. According to the open and closed structures,[51] ADK has two hinges (one hinge for LID domain
and one hinge for NMP domain) while the other 3 proteins (LIP1, LAOBP,
and PhnD) only have one (see Figure S17). From the open to the closed state, ADK will go through a “relatively
stable” intermediate state with one domain closed, which may
be linked with the highest Λoc of all the 5 proteins.In simplified analytical models,[30,34] the thermodynamic
stability against trapping of the global binding–folding can
be determined by the whole intrinsic global binding–folding
landscape topographic measure with Tb/Tg = Λ + (Λ2 –
1)1/2. In the previous studies,[29,35] the intrinsic energy landscape topographic measure Λ has been
shown to be correlated with the thermodynamic folding/binding stability
against trapping temperature (Tf/Tg or Tb/Tg). Likewise, Λglobal correlates
with Ttransglobal/Tgglobal in our studies (as shown
in Figure , Ttransglobal and Tgglobal are the folding transition temperature and glassy trapping
temperature of global DOS). High Λglobal and Ttransglobal/Tgglobal values suggest a funneled intrinsic global folding energy
landscape and strong folding stability. Intriguingly, our results
demonstrate the strong correlation between open–closed conformation
energy topography measure Λoc and conformation transition
temperature against trapping Ttransoc/Tgoc (Ttransoc and Tgoc are the open–closed transition temperature and glassy trapping
temperature of open–closed DOS).
Figure 4
Correlations between
the intrinsic energy landscape topographic
measure (Λoc and Λglobal) and thermodynamic
characteristics. (A) Correlation between the intrinsic open–closed
landscape topography measure (Λoc) and open–closed
transition temperature against open–closed glassy trapping
temperature (Ttransoc/Tgoc). (B) Correlation between the intrinsic
global landscape topography measure (Λglobal) and
global transition temperature against global glassy trapping temperature
(Ttransglobal/Tgglobal). The red solid line is the linear fitting
results; the blue solid line is the analytical mean field theory prediction
of the relationship between Ttrans/Tg and Λ: Ttrans/Tg = Λ + (Λ2 –
1)1/2.
Correlations between
the intrinsic energy landscape topographic
measure (Λoc and Λglobal) and thermodynamic
characteristics. (A) Correlation between the intrinsic open–closed
landscape topography measure (Λoc) and open–closed
transition temperature against open–closed glassy trapping
temperature (Ttransoc/Tgoc). (B) Correlation between the intrinsic
global landscape topography measure (Λglobal) and
global transition temperature against global glassy trapping temperature
(Ttransglobal/Tgglobal). The red solid line is the linear fitting
results; the blue solid line is the analytical mean field theory prediction
of the relationship between Ttrans/Tg and Λ: Ttrans/Tg = Λ + (Λ2 –
1)1/2.The intrinsic energy
landscape topographic measure Λ is a
valuable quantity in that it can reflect the effect of energy gap δE, roughness δE, and entropy S, as well as the effect of the transition temperature Ttrans and the glassy trapping temperature Tg, combined as a whole rather than individually.
However, it is interesting to find out that the ratio of open–closed
conformation and global folding Λ (Λoc/Λglobal) can be significantly correlated with the ratio of these
characteristics (shown in Figure S7). As
mentioned above, Λoc and Λglobal are correlated with Ttransoc/Tgoc and Ttransglobal/Tgglobal, respectively. Therefore, Λoc/Λglobal is shown to have positive correlation with Ttransoc/Ttransglobal and (Tgoc/Tgglobal)−1 (Λoc/Λglobal has the reciprocal correlation
with Tgoc/Tgglobal, Figure S7A,B). Likewise, according to the equation , Λoc/Λglobal is suggested to have a positive correlation
with δEoc/δEglobal and [((Soc)1/2Δoc)/((Sglobal)1/2Δglobal)]−1 (Λoc/Λglobal has the reciprocal correlation
with ((Soc)1/2Δoc)/((Sglobal)1/2Δglobal), Figure S7C,F). Though Δoc/Δglobal and (Soc/Sglobal)1/2 have the opposite trends with Λoc/Λglobal, their product (((Soc)1/2Δoc)/((Sglobal)1/2Δglobal)) has
the reciprocal correlation with Λoc/Λglobal. The Λoc/Λglobal shows the coupling
effect between intrinsic open–closed and intrinsic folded–unfolded
landscapes, but it has the same order with Λoc (among
the 5 proteins, Λoc changes much greater than Λglobal). The results indicate that, of the 5 proteins in this
study, ADK has relatively more of a funneled intrinsic open–closed
conformation landscape and less of a funneled intrinsic folded–unfolded
landscape (the difference between Ttransoc and Ttransglobal is the
lowest), whereas DPO4 has relatively less of a funneled intrinsic
open–closed conformation landscape and more of a funneled intrinsic
folded–unfolded landscape (the difference between Ttransoc and Ttransglobal is the highest).
Average Local Frustration Reflects the Global
Roughness
The localized frustration is connected with local
conformational
flexibility and large-scale conformational changes. The “frustratometer”
introduced by Ferreiro et al.[52,53] can provide the local
frustration per residue. By using the online tool, the “frustratometer”
(http://www.frustratometer.tk/), we analyzed the local configurational frustration of the open
and closed forms of all the 5 proteins (see Figure S13). However, per residue frustration index can not be used
to compare with the characteristics of these proteins. We quantified
the whole frustration by calculating the average highly local frustration
(mean value of the entire protein) difference between open and closed
forms . As shown in Figure S14, the results suggest that correlates with the roughness Δoc (R2 =
0.72), (R2 = 0.70),
and (R2 = 0.66).
This reveals that the average local frustration (quantified from frustratometer)
can reflect the global roughness (roughness of the open–closed
conformational landscape). In addition, the shows the “real” (intrinsic)
roughness without the size effect.[29,30] Here LIP1
is the system with the highest roughness and the lowest of all; ADK is the one
with the lowest
roughness and the highest of all. Consequently,
the results indicate
that the correlations between and roughness of open–closed
conformational
landscape are independent with the protein size.
Intrinsic Conformation
Energy Landscape Topography Λ Determines
the Kinetics of Protein Conformational Changes
In the previous
studies,[29,35] the intrinsic energy landscape topographic
measure Λ has been shown to be correlated to the kinetic folding/binding
time τ, during the processes of protein folding–binding–folding.
Here in this study, we focus on the relationship between the topography
measure of intrinsic local open–closed conformation energy
landscape Λoc and the rate of conformation switching
from open to closed state. After 200 kinetic runs starting from the
open configurations, the mean value of first passage time (⟨τχ⟩) has been collected for each protein (see Table S1). Figure S12 illustrates the distribution of τχ. Both
⟨τχ⟩ and standard deviation of
τχ of DPO4 are much higher than those of other
proteins. As a result, DPO4 can be classified as GROUP I. The population
of high τχ of DPO4 is extremely high. These
all can be linked to the lowest Λoc value with relatively
low bias toward closed state and rougher conformation landscape. LIP1
and PhnD have similar medium ⟨τχ⟩
and standard deviation of τχ values, which
can be collected in GROUP II. In GROUP III, ADK and LAOBP all have
relatively low ⟨τχ⟩ and standard
deviation of τχ values. In addition, the distributions
of τχ in this group are toward the low τχ values. As shown in Figure , the protein with the lowest Λoc (DPO4) has the largest open–closed conformation transition
time. ln ⟨τχ⟩ can be fitted to
Λoc with the relationship of y = a + be (green
dashed line) or y = a + b/(x + c) (blue solid
line) with similar fitting R2. Except
for the protein with extremely high Λoc (ADK), other
proteins with Λoc lower than 2.0 seem to have a linear
correlation between Λoc and ln ⟨τχ⟩, indicating that low intrinsic landscape topography
measure Λoc may have a connection with low kinetic
rate of switching from open and closed state, whereas high Λoc does not correspond to significantly high kinetic rate.
These results may have something to do with the protein structure
as well as simulation model.[35] In addition,
logarithm of the standard deviation of τχ () has
the same behavior as ln ⟨τχ⟩.
We also calculated the second-order moments
of the open–closed transition time (⟨τχ2⟩/⟨τχ⟩2), which has a linear correlation
with roughness Δoc and configuration entropy S of the intrinsic open–closed
energy landscape. Above all, we demonstrate that a less biased, rougher,
and greater sized conformation landscape will correspond to a lower
conformational transition speed and lead to more significant fluctuations
in conformation switching kinetics.
Figure 5
Correlations between the properties of
intrinsic local open–closed
conformation energy landscape topography and the kinetics of open–closed
transitions. (A) Logarithm of mean first passage time from open to
closed states (ln ⟨τχ⟩) correlates
with the intrinsic open–closed landscape topography measure
(Λoc). (B) Logarithm of the standard deviation of
τχ () correlates
with Λoc.
(C) Second-order moments of τχ (⟨τχ2⟩
/⟨τχ⟩ 2) correlate
with roughness Δoc of intrinsic open–closed conformation energy landscape. (D)
⟨τχ2⟩/⟨τχ⟩2 correlates with configuration entropy S of intrinsic
open–closed conformation energy landscape. In panels A and
B, the green dashed line and blue solid line represent the e exponential
and reciprocal fitting results, respectively. In panels A–D,
the red solid line shows the linear fitting results.
Correlations between the properties of
intrinsic local open–closed
conformation energy landscape topography and the kinetics of open–closed
transitions. (A) Logarithm of mean first passage time from open to
closed states (ln ⟨τχ⟩) correlates
with the intrinsic open–closed landscape topography measure
(Λoc). (B) Logarithm of the standard deviation of
τχ () correlates
with Λoc.
(C) Second-order moments of τχ (⟨τχ2⟩
/⟨τχ⟩ 2) correlate
with roughness Δoc of intrinsic open–closed conformation energy landscape. (D)
⟨τχ2⟩/⟨τχ⟩2 correlates with configuration entropy S of intrinsic
open–closed conformation energy landscape. In panels A and
B, the green dashed line and blue solid line represent the e exponential
and reciprocal fitting results, respectively. In panels A–D,
the red solid line shows the linear fitting results.There are few experimental results of the open–closed
transition
kinetics (kopen and kclosed) available. The open–closed kinetic rates
of ADK have been reported by Hanson et al. via high-resolution single-molecule
FRET (kopen = 120 ± 40 s–1, kclosed = 220 ± 70 s–1).[51] However, these kinetic rates are
obtained at experimental temperature, not the τχ in our kinetic simulations. Therefore, it is not appropriate to
compare the kinetic rates directly. In DPO4, the open–closed
conformational change is the rate-limited step of ligand binding.
It has been reported that the mean binding time of DNA to DPO4 is
about 1.3 s.[54] Thus, the open–closed
kinetic rate of DPO4 at experimental temperature is much lower than
that of ADK. These results may have something to do with the topography
of the open–closed conformational energy landscape.Furthermore,
we found that the ln ⟨τχ⟩ correlates
highly with protein size N (see
the Supporting Information). This relationship
may have something to do with the parameters of the model. Similar
results have been reported in previous studies.[35] This correlation may decrease or disappear if we continuously
change the parameters of one protein system. However, the correlation
between energy landscape topography and the crucial characteristics
of proteins is independent with the parameters of the simulation model.
Methods
The two-basin coarse-grained structure-based model
(SBM)[12,55,56] for each selected
protein was
constructed, on the basis of the open and closed reference structures
of each protein in the Protein Data Bank (all the PDB IDs used in
the simulations are listed in Table S1).
The simulation details and analyses are introduced in the Supporting Information.
Conclusion
Intrinsic
movements within a protein, such as large-scale domain–domain
open–closed conformation transitions, are often essential for
biomolecular functions. To uncover the underlying relationship between
the intrinsic conformation energy landscapes and thermodynamics as
well as kinetic rates of protein conformational changes, we quantify
different kinds of intrinsic open–closed conformation energy
landscapes and intrinsic global folding energy landscapes for 5 individual
proteins with large-scale conformational changes. By applying the
WHAM algorithm to transform the canonical ensemble to microcanonical
ensemble, density of states, average energy, free energy, as well
as other thermodynamic characteristics have been calculated for each
protein. A dimensionless quantity, Λ, as a ratio of the energy
gap or bias toward native state δE versus energy
landscape roughness δE as well as configurational
entropy S () is shown to quantify
the topography of
intrinsic conformation energy landscape.For each protein, intrinsic
conformation energy landscape was projected
to a fraction of closed native contacts (Qclosed) and RMSD with respect to the native open structure RMSDopen. The results suggest that there is only one basin on both intrinsic
average energy and entropy (density of states) landscapes, pointing
to the native closed state. In thermodynamics, the intrinsic conformation
energy landscape topography measure Λ shows a strong linear
correlation with the transition temperature against the glassy trapping
temperature (Ttrans/Tg). In detail, intrinsic open–closed conformation
landscape topography measure Λoc correlates with
conformation transition versus trapping temperature Ttransoc/Tgoc, and intrinsic global folding landscape topography measure Λglobal correlates with folding transition versus trapping temperature Ttransglobal/Tgglobal. In addition, the ratio of open–closed conformation
and global folding Λ (Λoc/Λglobal) is found to be significantly correlated with the ratio of other
thermodynamic characteristics, such as Ttransoc/Ttransglobal, δEoc/δEglobal, Δoc/Δglobal, (Soc/Sglobal)1/2, and ((Soc)1/2Δoc)/((Sglobal)1/2Δglobal). In addition, the average local
frustration (quantified with ) is found to correlate
with the global
roughness (roughness of the open–closed conformational landscape),
and this relationship is independent with the protein size. In kinetics,
open–closed conformation transition time τχ can be linked with the topography measure Λoc as
well as the structural model. Our investigations show that the intrinsic
energy landscape topography can determine both the thermodynamics
and the kinetics of conformation switching dynamics of proteins.
Authors: Jin Wang; Ronaldo J Oliveira; Xiakun Chu; Paul C Whitford; Jorge Chahine; Wei Han; Erkang Wang; José N Onuchic; Vitor B P Leite Journal: Proc Natl Acad Sci U S A Date: 2012-09-10 Impact factor: 11.205