Jun Jiang1, Zaizhi Lai2, Jin Wang3, Shaul Mukamel4. 1. Department of Chemical Physics, University of Science and Technology of China , No. 96, JinZhai Road Baohe District, Hefei 230026, China ; Chemistry Department, University of California Irvine , 433A Rowland Hall, Irvine, California 92697, United States. 2. Department of Chemistry and Physics, University of New York at Stony Brook , Stony Brook, New York 11794, United States. 3. Department of Chemistry and Physics, University of New York at Stony Brook , Stony Brook, New York 11794, United States ; State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences , No. 5625, Ren Min Street, Changchun, Jilin 130021, China. 4. Chemistry Department, University of California Irvine , 433A Rowland Hall, Irvine, California 92697, United States.
Abstract
The function of protein relies on their folding to assume the proper structure. Probing the structural variations during the folding process is crucial for understanding the underlying mechanism. We present a combined quantum mechanics/molecular dynamics simulation study that demonstrates how coherent resonant nonlinear ultraviolet spectra can be used to follow the fast folding dynamics of a mini-protein, Trp-cage. Two dimensional ultraviolet signals of the backbone transitions carry rich information of both local (secondary) and global (tertiary) structures. The complexity of signals decreases as the conformational entropy decreases in the course of the folding process. We show that the approximate entropy of the signals provides a quantitative marker of protein folding status, accessible by both theoretical calculations and experiments.
The function of protein relies on their folding to assume the proper structure. Probing the structural variations during the folding process is crucial for understanding the underlying mechanism. We present a combined quantum mechanics/molecular dynamics simulation study that demonstrates how coherent resonant nonlinear ultraviolet spectra can be used to follow the fast folding dynamics of a mini-protein, Trp-cage. Two dimensional ultraviolet signals of the backbone transitions carry rich information of both local (secondary) and global (tertiary) structures. The complexity of signals decreases as the conformational entropy decreases in the course of the folding process. We show that the approximate entropy of the signals provides a quantitative marker of protein folding status, accessible by both theoretical calculations and experiments.
Protein folding is an important
biological process whereby polypeptides form a unique functional 3D
structure. Experimental probing of protein folding pathways along
the free-energy landscape[1] is critical
for the understanding of its mechanism, and the manipulation of protein
structure and function.[2] Thanks to significant
advances in computation power, the atomic level description of protein
folding has become feasible through molecular dynamics (MD) simulations.[3,4] The experimental monitoring of the folding process at the atomic
level is difficult due to the lack of suitable fast tools. Nuclear
magnetic resonance (NMR) spectra[5] give
high-quality data but require extensive data processing. Circular
dichroism (CD) spectroscopy[6] probes protein
secondary structures with low resolution and cannot reveal tertiary
or global structures, making it difficult to view different folding
stages. More importantly, the absence of a good indicator of global
protein geometry makes it hard to correlate the simulated structure
dynamics with experimentally probed folding status.Coherent
ultrafast 2D electronic spectroscopy has emerged recently
as a new structure refinement tool.[7−9] The photon echo signal
records the response of molecules to sequences of ultrafast (femtosecond)
coherent laser pulses, whose 2D correlation plots reflect the variation
of the nonlinear response functions with the controlled time intervals.
They reveal electronic couplings in proteins and interactions among
residues, which characterize time evolving structures. For instance,
we have demonstrated in theory the utilization of 2D infrared (IR)[10] and 2D Raman[11] to
capture the local change of certain structural elements during protein
folding. Recent advances in laser sources[12−14] make it possible
to extend this technique into the ultraviolet (UV) regime. UV photoresponse
of peptide involves electronic transitions of each peptide bond along
the protein backbone. In comparing to IR and Raman spectroscopic tools,
which mostly reflect local details, two-dimensional ultraviolet (2DUV)
spectroscopy of proteins, carrying abundant information about inter-
and intramolecular interactions, is particularly useful in identifying
both the local details and global geometries of protein structures.[15−17]In this work, we perform quantum mechanics/molecular dynamics
(QM/MM)
simulations of 2DUV spectroscopy of the folding process of the Trp-cage
peptide. Trp-cage is one of the most widely studied peptides[18] and is a convenient model system for theoretical
investigations of its fast nanosecond folding dynamics. The 2DUV technique
offers a good indicator of protein global geometry and provides an
powerful tool to probe the protein folding process.The Trp-cage
contains 20 residues with the sequence “NLYIQWLKDGG
PSSGRPPPS”. The extended conformation was built
up as the initial structure, based on which 50 trajectories with different
initial conditions were simulated at 315 K for 200 ns (details in
the Supporting Information). These trajectories
cover 10 μs simulations of protein folding, enough for building
the free-energy landscape. The free energy was calculated as F = −log(P), where P was the population obtained from all the 10 μs MD simulated
data, as shown in the inset graph of Figure 1A. 100 state points along the folding pathway on the energy landscape
denoted S1, S2, ..., S100, were selected by the clustering method.
To model the geometric fluctuations, we chose 100 subconformations
(MD snapshots) around each state point. The variations of RMSD (root
mean square deviation) and Rg (Radius
of gyration) along the folding path are shown in Figure 1B,C, respectively. Because Trp-cage folding is also a packing
process from a strand to a compact cage, the evolution of tertiary
structure can be characterized by the protein packing density.[19] We computed the packing density as the average
of the number of residue’s Cα atoms within a 9 Å
radius of the Cα atom of a given residue. The evolution of the
inverse of packing density displayed in Figure 1C is consistent with the Rg. The packing
density is closely related to the protein conformation entropy, and
its evolution suggests that Trp-cage conformation entropy decreases
along the packing (folding) process. Obviously, both the Rg and packing density (conformation entropy) are good
markers of the protein folding status. Unfortunately, these are not
readily accessible experimentally.
Figure 1
(A) From the unfolded strand to folded
cage structure of a Trp-cage
protein (PDB code: 1L2Y). The backbone trace is shown as a ribbon, and the side chains are
depicted with wires. The RMSD (B), Rg (C),
and inverse of packing density (D) along the free-energy landscape
of Trp-cage folding process (from S1 to S100). The inset in panel
B shows the free-energy landscape.
(A) From the unfolded strand to folded
cage structure of a Trp-cage
protein (PDB code: 1L2Y). The backbone trace is shown as a ribbon, and the side chains are
depicted with wires. The RMSD (B), Rg (C),
and inverse of packing density (D) along the free-energy landscape
of Trp-cage folding process (from S1 to S100). The inset in panel
B shows the free-energy landscape.Quantum mechanics
(QM) calculations were employed to compute ultraviolet
signals based on MD snapshots (details in the Supporting Information). Calculations of the excited states
by CASSCF/SCRF (the complete-active space self-consistent-field) method
has provided parameters for the transition energies of isolated peptide
bond unit (see Figure S1 in the Supporting Information), the resonant couplings, and electric and magnetic dipole moments.[20] The electronic structures and electrostatic
potential distributions of the peptide bonds and amino side chains
in proteins were computed at the density functional theory (DFT) level
B3LYP/6-311++G**. The Frenkel exciton Hamiltonian is the most practical
model to describe the photoresponse of molecular aggregates[21]where me is the e electronic transition on the mth chromophore. B̂† is the creation Pauli operator that
promotes the chromophore m into the excited state e, and B̂ is the corresponding annihilation operator. The excited-state
energy ε can be calculated as
the summation of the excited energy of the isolated chromophore (ε) and the inter- and intra- molecular
electrostatic corrections (i.e., Coulomb interactions between electric
transition dipoles and surrounding electrostatic fields)Here k runs over all atomic
or molecular sites surrounding the excited chromophore, ε is
the dielectric constant, r and r are the positions
coordinates, and ρ and ρ (ρ) represent the
ground (excited) state charge density.Meanwhile, the resonant
coupling between singly excited states me and nf is given by <0|†|0> = J where |0> is the ground state.where ρ(r) and
ρ(r) are the transition charge densities.Here
ε and J are calculated
using our exciton Hamiltonian with electrostatic fluctuations (EHEF)
algorithm[21] at a high ab initio level.
EHEF also enables us to combine the QM and MM outputs to construct
the effective exciton Hamiltonian of any MD snapshots (details in
the Supporting Information). The Hamiltonian
is applied to the response function framework based on which UV spectra
are computed using the SPECTRON code.[22] 2DUV calculations were performed for the nonchiral (xxxx, xyyx,
and xyxy) and chirality-induced (xxxy) pulse polarization configurations
(details in the Supporting Information).
Chiral 2D signals record interferences among transitions at different
parts of the whole protein and thus provide richer spectral features
compared with their nonchiral counterparts. The signals are displayed
on a nonlinear scale that interpolates between logarithmic for small
values and linear for large values, thus revealing both the strong
and weak features (details in the Supporting Information).The structures of five states, S1 (initial unfolded peptide),
S25,
S50, S75, and S100 (final folded peptide), are depicted in Figure 2A. Computed CD signals of S1, S25, S50, S75, and
S100 (averaged over 100 MD snapshots for each state) are displayed
in Figure 2B. Simulations of S100 with the
final folded structure are in a good agreement with the experimental
CD[23] of the folded Trp-cage peptide. CD
is the standard 1D spectroscopic technique widely used for identifying
protein secondary structures. CD signals reflect the variation of
secondary structural elements from state S1 to S100. The negative
feature at ∼56 000 cm–1(∼180
nm) and positive signals at ∼43 000 cm–1(∼230 nm) marked ‘RC’ are typical of a random
coil. These are seen in S1 and reversed in S75 and S100 following
the decrease in random coil shown in Figure 1D. The helix structures increase from S1 to S100, so CDs from 53 000
to 58 000 cm–1 (∼190 to 170 nm) marked
‘H’ change from negative to positive. However, because
of the large inhomogeneous broadening due to geometric fluctuations,
CD signatures of different structural elements are not well-resolved.
Figure 2
Structure
(A) and CD spectra (B) of five states (from top to bottom:S1,
S25, S50, S75, and S100) along the Trp-cage folding process. Spectra
are averaged over 100 MD snapshots for each state. We labeled CD signals
of the random coil and Helix as RC and H, respectively. Black dotted
CD curve is from experiment Exp1[23] for
the folded Trp-cage protein.
The 2D photoecho signal is strongly affected by the couplings between
electronic transitions and structural variations. 2DUV xxxx (nonchiral)
and xxxy (chiral) spectra of our chosen five states are displayed
in the right and middle columns of Figure 3, respectively. The xxxx spectra of the five states are very similar,
dominated by the negative diagonal ∼52 000 cm–1 peak accompanied by two positive side bands. In contrast, the xxxy
signals vary significantly as we move from S1 to S100. For instance,
the unfolded states S1 and S25 have negative diagonal peaks at from
48 000 to 56 000 cm–1, which are typical
for random-coil and strand-structural elements.[24] A helical structure normally produces positive diagonal
signals in that region.[24] Therefore, moving
from S50 to S75 and S100, the increased helical structure reduces
the negative signals and induces additional positive peaks at the
diagonal part.
Figure 3
From left to right: 2DUV
xxxx, xxxy, and xyyx-xyxy spectra of five
states (from top to bottom:S1, S25, S50, S75, S100) along the Trp-cage
folding process. Spectra are averaged over 100 MD snapshots for each
state. The scale bar is plotted at the right top edge, and signal
peaks are marked by white square dots.
The xxxy chiral signals also reflect the tertiary
structure. As
expected from the decrease in conformational entropy in the folded
structure, the xxxy spectral pattern becomes more compact and simple,
lowering the signal complexity. The number of peaks (marked with white
dots in Figure 2) in the xxxx and xxxy spectra
of the five states are plotted in Figure 4A.
The number of xxxx peaks remains fixed during the folding. The xxxy
peak number decreases from S1 to S50 and remains flat from S50 to
S100, implying a trend similar to the variation of Rg and the inverse of packing density. Our previous study[17] showed that the approximate entropy (ApEn)[25] provides a good measure for the complexity of
2D signals. In the inset of Figure 4B, we display
a scanning line perpendicular to the diagonal of the 2D contour map,
starting from the bottom left (lower energy) to the upper right (higher
energy) corner. The projections of the xxxy signals of state S1 and
S100 along this line are depicted in Figure 4B, showing that S1 signals have richer structures (such as more peaks)
than S100. Figure 4C depicts the variation
of ApEn values of xxxy spectra during the folding process. The xxxy
ApEn decreases considerably as we move from S1 to S50 and does not
vary from S50 to S100, consistent with the evolution of Rg and the inverse of packing density (conformational entropy)
shown in Figure 1C,D. Spectra were obtained
by averaging over 100 MD snapshots for each state point. As 2DUV signals
are becoming feasible,[12−15] protein folding can be measured by the ApEn value of 2DUV chiral
signals.
Figure 4
Evolution of the number
of 2DUV peaks (A) and ApEn (B) during the
Trp-cage folding process. Spectra are averaged over 100 MD snapshots
for each state. (C) 2DUV xxxy signal evolution curves of states S1
and S100 along the scanning line given in the inset of panel C. Purple
circles highlight the multiple-peak patterns. Spectra are averaged
over 100 MD snapshots for each state.
Chiral signals are harder to measure due to their very
weak intensities.
The technique of difference spectroscopy between two nonchiral spectra
with different polarizations can cancel the single exciton contributions,
while the correlations of transitions are retained and better resolved.
The computed 2DUV xyyx-xyxy difference spectra of our five states
are displayed in the right column of Figure 3. The signal complexity is reduced as we move from S1 to S100. This
may also be seen from the number of 2DUV xyyx-xyxy spectral peaks
(marked with white dots in Figure 3), and the
ApEn values are shown in Figure 4A,C. The change
of the complexities of difference spectra thus also provides a quantitative
measure of the decrease in protein conformational entropy during the
folding process. To check the convergence of the signal evolutions,
we have also computed 2DUV xxxy and xyyx-xyxy signals directly on
every single MD snapshot and found nearly the same evolution behaviors
for their ApEn values (see Figure S2 in the Supporting
Information) as those from averaged inhomogeneous spectra.Structure
(A) and CD spectra (B) of five states (from top to bottom:S1,
S25, S50, S75, and S100) along the Trp-cage folding process. Spectra
are averaged over 100 MD snapshots for each state. We labeled CD signals
of the random coil and Helix as RC and H, respectively. Black dotted
CD curve is from experiment Exp1[23] for
the folded Trp-cage protein.From left to right: 2DUV
xxxx, xxxy, and xyyx-xyxy spectra of five
states (from top to bottom:S1, S25, S50, S75, S100) along the Trp-cage
folding process. Spectra are averaged over 100 MD snapshots for each
state. The scale bar is plotted at the right top edge, and signal
peaks are marked by white square dots.Evolution of the number
of 2DUV peaks (A) and ApEn (B) during the
Trp-cage folding process. Spectra are averaged over 100 MD snapshots
for each state. (C) 2DUV xxxy signal evolution curves of states S1
and S100 along the scanning line given in the inset of panel C. Purple
circles highlight the multiple-peak patterns. Spectra are averaged
over 100 MD snapshots for each state.In summary, we have
employed a QM/MM protocol to simulate the 2DUV
spectra of the folding of a peptide. We demonstrated that 2DUV signals
are sensitive to the change of peptide secondary and tertiary structure
and especially useful in probing the global structural changes in
comparing with IR and Raman tools. The complexity of 2DUV spectra
of peptide backbone as measured by their ApEn is a good marker for
the conformational entropy and provides a quantitative index of folding
status with the same accuracy as the calculated RMSD and Rg values. The RMSD and Rg can
only be extracted from the known atomic structures at every time point,
which require huge computation resources for most proteins and are
not accessible to any existing experimental techniques. 2DUV, in contrast,
can offer a fast experimental measurement and theoretical verification
of the protein folding state and may thus provide new insights into
the protein evolution mechanism and function.
Authors: Darius Abramavicius; Jun Jiang; Benjamin M Bulheller; Jonathan D Hirst; Shaul Mukamel Journal: J Am Chem Soc Date: 2010-06-09 Impact factor: 15.419
Authors: Shaul Mukamel; Darius Abramavicius; Lijun Yang; Wei Zhuang; Igor V Schweigert; Dmitri V Voronine Journal: Acc Chem Res Date: 2009-04-21 Impact factor: 22.384