Edyta Małolepsza1, John E Straub. 1. Department of Chemistry, Boston University , 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States.
Abstract
New sets of parameters (maps) for calculating amide I vibrational spectra for proteins through a vibrational exciton model are proposed. The maps are calculated as a function of electric field and van der Waals forces on the atoms of peptide bonds, taking into account the full interaction between peptide bonds and the surrounding environment. The maps are designed to be employed using data obtained from standard all-atom molecular simulations without any additional constraints on the system. Six proteins representing a wide range of sizes and secondary structure complexity were chosen as a test set. Spectra calculated for these proteins reproduce experimental data both qualitatively and quantitatively. The proposed maps lead to spectra that capture the weak second peak observed in proteins containing β-sheets, allowing for clear distinction between α-helical and β-sheet proteins. While the parametrization is specific to the CHARMM force field, the methodology presented can be readily applied to any empirical force field.
New sets of parameters (maps) for calculating amide I vibrational spectra for proteins through a vibrational exciton model are proposed. The maps are calculated as a function of electric field and van der Waals forces on the atoms of peptide bonds, taking into account the full interaction between peptide bonds and the surrounding environment. The maps are designed to be employed using data obtained from standard all-atom molecular simulations without any additional constraints on the system. Six proteins representing a wide range of sizes and secondary structure complexity were chosen as a test set. Spectra calculated for these proteins reproduce experimental data both qualitatively and quantitatively. The proposed maps lead to spectra that capture the weak second peak observed in proteins containing β-sheets, allowing for clear distinction between α-helical and β-sheet proteins. While the parametrization is specific to the CHARMM force field, the methodology presented can be readily applied to any empirical force field.
The
amide I vibration arises predominantly from the stretching
vibration of the carbonyl group in the peptide bond with small contributions
from CN and HN bond vibrations and CCN angle bending. This vibration
is notable due to its substantial oscillator strength and dependence
on backbone secondary structure, while being relatively insensitive
to the geometries of the amino acid’s side chain. This vibration
is often used to probe and examine the secondary structure of proteins,
both experimentally and in computational studies.[1−8]A number of observations have been made for relating amide
I vibrational
shifts to the local conformation of the peptide backbone. These “fingerprinting”
approaches are based on relative simple rules[7] for assigning the secondary structure based on an amide I vibration
spectrum. (i) α-helix has a peak typically located near 1655
cm–1, which shifts toward lower frequencies with
increasing helix length; this peak does not appear for peptides shorter
than six amino acids. (ii) β-sheet exhibits a strong peak near
1630 cm–1 and a weak peak near 1685 cm–1. (iii) Parallel β-sheets have a main peak at slightly higher
wavenumbers than antiparallel β-sheets, but the difference does
not typically allow one to distinguish between them in many experimental
studies. Detailed discussions of the amide I vibration can be found
elsewhere.[4,7,9]A number
of refined methods for computing amide I vibrational spectra
of globular proteins have been developed. These methods typically
make use of structural data for the protein derived from molecular
dynamics simulation. The models are often based on the vibrational
exciton approximation,[5,6,10−13] where the local amide I vibrations have frequencies dependent on
the environment (or isotopic labeling) and are coupled through dipole–dipole
interactions with other amide I vibrations. These techniques have
proven highly successful in the interpretation of spectra of globular
proteins in aqueous environments. Discussion of other methods used
to compute amide I spectra can be found elsewhere.[14,15]In recent years, there has been increasing interest in the
interpretation
of amide I spectra of peptide systems other than globular proteins,
including (1) smaller peptides[11,13,16−20] and (2) peptides and proteins in more heterogeneous environments.[10,21−25] For peptides in reverse micelle environments, direct interaction
with the surfactant and nonpolar phase, as well as interaction with
aqueous solvent, must be addressed.[26] For
peptides in membrane environments, there may be widely varying local
environments, including the lipid tail region of a bilayer, for transmembrane
peptides, or the headgroup region, which may include direct interactions
with mono- and multivalent ions. Methods for the calculation of amide
I spectra in these highly heterogeneous environments must account
for the substantial changes to the local amide I frequencies and their
coupling induced by the more heterogeneous solvation surroundings.This work presents an extension of the vibrational exciton based
models for the calculation of amide I vibrational spectra of peptides
and proteins designed to be applicable to peptides and proteins in
heterogeneous environments. The method is parametrized based on simulations
of a small molecule, N-methylacetamide, in a variety
of solvents, including water, DMSO, and chloroform, and applied to
the calculation of spectra for a set of proteins using the CHARMM22
force field.[27] A comparison of the resulting
spectra with experimental data suggests that the method is robust
in capturing essential features of spectra in a variety of systems.
Methods
Theory of Linear Spectroscopy
Ab
initio calculations
of IR spectra including the amide I mode contribution are computationally
demanding for large proteins making it necessary to find methodologies
that can simplify such calculations from data generated in standard
molecular dynamics (MD) simulations of a protein. One such method
employs the vibrational exciton model.[12,16] The protein
is represented by a set of oscillators, also called chromophores,
one for each peptide bond. It is assumed that the amide I modes do
not couple with other types of vibrations in the system, only to the
corresponding amide I mode of another peptide bond. The Hamiltonian
of the system is expressed in terms of creation and annihilation operators, b† and b, asThe
diagonal element, ω = (E – E)/ℏ,
is a frequency of transition between the vibrational ground and first
excited states of the i-th oscillator. This fundamental
frequency corresponds to the frequency of an oscillator unperturbed
by interactions with other oscillators. Off-diagonal elements, β, represent coupling between oscillators i and j, and μ is the fluctuating transition dipole moment of oscillator i interacting with applied electric field E⃗(t).The linear spectra were calculated as
the Fourier transform of
the correlation function of the transition dipole moments with the
phenomenologically added term e– to account for the vibrational lifetime,
according to the method presented by Jansen and Knoester[28]where T1 is the
lifetime of the singly excited states, U(t, 0) represents time evolution operator, described in more details
in Supporting Information.
Standard Model
Couplings between the i-th and j-th peptide bonds can be calculated in
the transition dipole coupling approximation.[29] The electrostatic interaction between two peptide bonds is modeled
by the interaction of the transition dipole moments, μ⃗ and μ⃗, associated with these two peptide
bonds, asThis approximation is assumed to be reasonable
if the separation between peptide bonds, r, is greater than their dimension. It should not
be applied to model interaction between the nearest neighboring peptide
bonds. In works by Jansen et al.,[11,30] Gorbunov et
al.,[6] and Torii and Tasumi,[31] couplings between the nearest neighboring peptide
bonds were calculated ab initio and represented as functions of Ramachandran
angles, ψ and ϕ.There are several methods for determining
values of the fundamental frequencies and two methods are briefly
presented below. One approach utilizes only geometrical properties
of the protein independent of the environment.(i) The fundamental
frequency is represented as a sum of a constant,
ωconst, and frequency shift, δω, reflecting
when the oxygen or nitrogen atoms from the peptide bond form a hydrogen
bond. The value of ωconst is usually chosen from
between 1655 and 1710 cm–1.[5,12,13,16,32] δω depends on the length of the hydrogen
bond and is typically taken to be −20 and −10 cm–1 for oxygen or nitrogen atoms, respectively. An isotopic
shift can also be included in the model as a constant value added
to ωconst.[22]The
second approach employs the electric field or electrostatic
potential at the peptide bond atoms and therefore introduces a dependence
on the surrounding environment.(ii) The fundamental frequency
for each peptide bond is calculated
as a function of the electrostatic potential[33−36] or electric field[19,25,36] or all of them[13] experienced by atoms of the peptide bond, which may be
approximated in terms of the partial charges of the surrounding solvent
molecules and the electrostatic potentialor the electric fieldwhere i runs over atoms of
the peptide bond, C, O, N, H, α = {x,y,z}, and j runs over
point charges surrounding the i-th peptide bond.
Coefficients c and c are determined by
fitting the frequency to the electrostatic potential or electric field
on a test system such as N-methylacetamide (NMA)
to reproduce either ab initio calculations for NMA–water clusters,
containing up to four water molecules,[36] or the experimentally measured transition frequencies for NMA. The
intercept, ω0, is often chosen to be the gas-phase
amide I frequency of NMA equal to 1717 cm–1. Therefore,
the value of the shift in the fundamental frequency, ω –
ω0, can be understood as the first order perturbation
correction to the frequency in vacuum due to interactions with solvent
molecules. This model reminds one of the Stark effect in which each
component of the electric field on each atom may have a different
proportionality coefficient. ω0 can also be treated
as an additional parameter in the fitting procedure.
Proposed Extended
Model
Work presented in this paper
is focused on improving the second method presented above through
the use of data obtained from molecular dynamics simulations. There
are five aspects to be considered in the improvement of the fits.(i) If the amide I frequency of NMA in vacuum is chosen as the reference
(i.e., unperturbed) frequency, to first order the corrections to the
frequency arise from interactions of NMA with solvent molecules. There
are two types of such nonbonded interactions, electrostatic and van
der Waals, and both are included in the frequency shift.(ii)
In existing methods, the electrostatic potential and electric
field employed in the fits are typically derived from partial charges
on selected solvent molecules surrounding the NMA molecule. Such a
field or potential does not fully represent an actual field or potential
on the NMA molecule during molecular dynamics simulations, where periodic
boundary conditions are employed and electrostatic interactions are
calculated using the Ewald summation. In this work, all fits presented
were obtained based on the total electrostatic and van der Waals forces
exerted on the atoms of peptide bonds during simulations.(iii)
The proposed method uses data obtained from standard all-atom
molecular simulations carried out with the NAMD[37] package where the SHAKE[38] algorithm
was applied to fix the length of bonds to hydrogen atoms. However,
it can be applied to results from any other program. This stands in
contrast to other approaches in which additional restrictions are
imposed in the calculation of the spectrum, such as fixing the length
of all other bonds using the LINKS[39] algorithm
(which is not implemented at this time in CHARMM or in NAMD). (Heavy
atom bonds vibrate with a period of at least 20 fs that is equal to
the shortest period of angle vibrations for C–O–H. Therefore,
constraining all bonds can take full advantage of the LINKS algorithm
only when those angles are also constrained.)(iv) NMA provides
a model of an isolated peptide bond. Results
based solely on MD simulations of NMA in water may not be representative
of more heterogeneous environments experienced by a peptide bond in
a protein. Reliable fits should also incorporate the hydrophobic surroundings
of an amino acid buried inside a protein, interactions with lipophilic
regions of a membrane, or contact with monovalent ions. This can be
accomplished by fitting data from MD simulations of NMA in water and
in solvents representing a range of dielectric constants including
DMSO and chloroform.[19](v) Finally,
the maps developed in this work are designed to highlight
the difference between the amide I frequency of an α-helix and
of a β-sheet. Such fits are especially useful in studying aggregation
processes where proteins commonly undergo changes in secondary structure.[40−43]In the next sections, new maps are presented together with
the
methodology employed in their calculation. The amide I spectra of
a test set of proteins are presented to validate the quality of the
proposed maps.
Model Parametrization for
NMA
NMA Simulations
All fits presented below were calculated
based on the results of MD simulations performed using the CHARMM22
force field and the NAMD package. A single molecule of NMA was placed
in the center of a pre-equilibrated cubic box (70 × 70 ×
70 Å3) of solvent and equilibrated at 300 K. Three
solvents were considered, water, DMSO, and chloroform. Water was represented
by the TIP3P[44] model, parameters for DMSO
were taken from the work by Strader and Feller,[45] and parameters for chloroform were derived from the work
of Cieplak et al.[46] Periodic boundary conditions
and particle mesh Ewald summation were employed, nonbonded interactions
were switched to zero over the window 10 to 12 Å, the time step
was set to 1 fs, geometries were saved every 100 fs, and all bonds
with hydrogen atoms were constrained using the SHAKE algorithm.
Force Calculations
All forces on the C, O, N, and H
atoms of NMA were computed under the same conditions used during the
production MD runs, including the same cutoffs, PME parameters, and
box size. All force vectors were aligned in such a way that the XY-plane was defined by the C, O, and N atoms of a peptide
bond with the CO bond oriented along the Y-axis.
Finally, forces were converted from kcal/mol/Å to atomic units
by a factor 0.0008432983.
Fit Formula
New maps for the fundamental
frequency
proposed in this work include not only the electric field vectors
on the atoms of the peptide bonds, but also van der Waals forces.
Initially, the fitting function contained 25 parameters, 12 for components
of electric fields, E⃗, on C, O, N, and H atoms of NMA, 12 parameters corresponding
to vectors from van der Waals forces, F⃗, and one parameter for the interceptwhere frequency
is in units of wave numbers.
Fitting Procedure
Frequency functions
were fitted to
six experimental values including positions of the peak maximum, ωexp, and full width at half-maximum, Γexp,
of the amide I frequency of NMA in water, DMSO, and chloroform. Several
forms of the penalty function were employed in the fitting process
(for details see Supporting Information).Some field or force
components make greater contributions to the
fundamental frequency than others, and their analysis can help in
identifying the most important parameters. Figure 1 shows histograms of the field and force components on the
NMA peptide bond atoms derived from MD simulations in three solvents.
We observed that on average only 9 out of 24 components made a greater
than 0.001 au contribution. The next fit included only those nine
dominant components. As the amide I frequency consists mostly of vibrations
of the carbonyl group in peptide bonds, it is not surprising that
force components in the direction of the CO bond (which defines the Y-axis) are the most dominant contributions. There is only
one other component, E⃗H, with a peak further from zero than 0.002 au. Out of 12 parameters
for van der Waals forces, only the y-components on
oxygen and nitrogen atoms appear to be significant.
Figure 1
Histograms
of field and force components on NMA peptide bond atoms
in water (left), DMSO (center), and chloroform (right). Top panel
refers to electric field components, while bottom shows van der Waals
forces. Dotted lines show positions of 0, ± 0.002 au. Only components
with absolute mean value greater than 0.001 au are shown. If the absolute
mean value is in the range 0.001 and 0.002 au, its histogram is shown
as a dashed line.
Apart from
analyzing the field and force components on NMA peptide
bond atoms, one can test the components together with corresponding
fit coefficients to determine which coefficient provides the largest
contribution to the fundamental frequency shift. (A sample of results
for some 25 parameter fits is shown in Figure 7 in Supporting Information.) In each case, a different set of
fit parameters was found to be important in determining the frequency
shift including components that were neglected based solely on the
field and force histograms. This indicates that an analysis of fit
parameters alone or forces alone might be misleading. Each set was
chosen as input for another fitting procedure and this process was
repeated until the most significant fit parameters were determined.Histograms
of field and force components on NMA peptide bond atoms
in water (left), DMSO (center), and chloroform (right). Top panel
refers to electric field components, while bottom shows van der Waals
forces. Dotted lines show positions of 0, ± 0.002 au. Only components
with absolute mean value greater than 0.001 au are shown. If the absolute
mean value is in the range 0.001 and 0.002 au, its histogram is shown
as a dashed line.
The Best
Maps
The number of variables in the fit formula led to the
generation
of many maps by a multivariable fitting procedure. The choice of the
best maps was based on reproducing amide I spectra of two proteins,
an alanine-based peptide with sequence (AAAAK)4AAAAY (referred
to as AKA5) and trpzip2 (1LE1). AKA5 consists
of only 25 residues and forms one well-defined α-helix, while
even smaller trpzip2 is an example of β-hairpin protein. AKA5 and trpzip2 make a complementary set to study the assignment
of secondary structure based on amide I vibrational spectra. Technical
details of spectra calculations are presented in the next section.The two best fits for the amide I frequency are presented in Table 2. Both fits consist of nine components.
Table 2
Coefficients of the Two Best Fits
for the Electric Field and van der Waals Force Components on Atoms
of the Peptide Bond for Use in Equation 3
ω0
Cy
Oy
Nx
Ny
Hx
Hy
Oyvdw
Nxvdw
Nyvdw
#1
1674.55
–103.6
1320.4
–632.7
3932.5
–1390.8
–1740.4
–7905.1
–858.3
#2
1674.64
–374.3
1787.4
–286.1
3643.2
–1598.1
–1412.3
–705.9
–8233.2
From the analysis of fields and forces on the atoms of NMA presented
above, only two components of van der Waals forces were found to be
significant, the y-component on the oxygen and nitrogen
atoms, Ovdw and Nvdw, respectively. Only these two
components were expected to make a significant contribution to the
frequency shift. Surprisingly, for both fits the coefficient corresponding
to the x-component of the van der Waals force on
the nitrogen atom, Nvdw, is the largest of all. However, analysis
of the frequency shift caused by this component in the case of the
NMA spectrum (presented in Figure 2) shows
that despite the value of this coefficient, its contribution is less
than 5 cm–1 for the spectrum of NMA in DMSO.
Figure 2
Histograms
of the main contributions to the NMA amide I frequency
shift in water, DMSO, and chloroform for the two best fits. Histograms
were smoothed using running averaging over 10 frames. Only contributions
greater than 1 cm–1 are shown. Colors as in Figure
1.
Histograms
of the main contributions to the NMAamide I frequency
shift in water, DMSO, and chloroform for the two best fits. Histograms
were smoothed using running averaging over 10 frames. Only contributions
greater than 1 cm–1 are shown. Colors as in Figure
1.The main contributions to the
frequency for all fits come from
three components of the electric field, O, N and H. The contribution from the carbon atom is found to be negligible
even though the amide I vibration consists primarily of the CO vibration.
Interestingly, for map #1 the van der Waals forces contribute more
than 1 cm–1 only in the case of spectra in DMSO.
For map #2 the van der Waals forces contribute more than 1 cm–1 for spectra in water and DMSO. For NMA in chloroform,
van der Waals forces make negligible contributions for both maps.Figure 3 presents the comparison of experimental
and computational amide I spectra for NMA in three considered solvents.
Calculated spectra agree well with experimental spectra for both proposed
maps.
Figure 3
Amide I spectra of NMA in water, DMSO, and chloroform. Black lines
represent experimental data, while cyan lines and magenta circles
show spectra calculated using the #1 and #2 proposed maps, respectively.
Experimental data (digitized and interpolated) were derived for NMA
in water and DMSO from Hamm et al.[16] and
for NMA in chloroform from Wang et al.[19]
Amide I spectra of NMA in water, DMSO, and chloroform. Black lines
represent experimental data, while cyan lines and magenta circles
show spectra calculated using the #1 and #2 proposed maps, respectively.
Experimental data (digitized and interpolated) were derived for NMA
in water and DMSO from Hamm et al.[16] and
for NMA in chloroform from Wang et al.[19]
Validation of the New Maps
and Spectra for Proteins
To validate the new maps, amide
I frequency spectra of four additional
proteins were calculated and compared with experimental results. For
benchmark calculations, a series of proteins were chosen including
ubiquitin (1UBQ), ribonuclease (7RSA), myoglobin (1WLA), and concavalin
(1JBC). The set of six proteins include a wide range of structures
and sizes as shown in Figure 4 from the 12-residue
β-hairpin trpzip2, through the 153-residue myoglobin (formed
almost exclusively of α-helices), to the 237-residue concavalin
(including two large β-sheet surfaces).
Figure 4
Ribbon structures of
the six proteins in the test set showing β-strands
(yellow), α-helices (cyan), and 310-helices (pink).
The structures were visualized with VMD.[47]
MD Simulations
Each peptide was placed in a pre-equilibrated
water box, neutralized, energy minimized, heated, and equilibrated
at 300 K. For AKA5, trpzip2, ubiquitin, ribonuclease, and
myoglobin, a cubic box of 70 × 70 × 70 Å3 was used, while for concavalin an orthorhombic box of size 70 ×
70 × 85 Å3 was employed. The simulations were
run under the same conditions as described for NMA.Ribbon structures of
the six proteins in the test set showing β-strands
(yellow), α-helices (cyan), and 310-helices (pink).
The structures were visualized with VMD.[47]All fields and forces were calculated
according to the protocol used for NMA with one exception. Off-diagonal
couplings between the nearest neighbor peptide bonds are taken from
Jansen’s maps.[11,30] As such, partial charges of all
backbone atoms of the neighboring residues should not contribute to
the fundamental frequencies. Therefore, those atoms were excluded
from the field and force calculations. Partial charges on all remaining
atoms of the protein were included in the calculations.
Construction
of the Hamiltonian Matrix and Transition Dipole
Vectors
Transition dipoles were constructed according to
the work by Torii and Tasumi.[31] Each vector’s
length was set to 2.73 D with the vector displaced 10° from the
CO bond in the CON plane. Its origin was located at r⃗C + 0.665 n̂CO + 0.258 n̂CN, where r⃗C is the position of the carbon atom of the peptide bond, while n̂CO and n̂CN are normalized vectors in the directions of the CO and CN
bonds, respectively. The magnitudes of all vectors are in angstroms.The fundamental frequency for each peptide bond (the diagonal values
of the Hamiltonian matrix) was calculated as a sum of three terms:
(1) the unperturbed oscillator frequency calculated from the frequency
fit, (2) the shift caused by the closest neighboring peptide bond
on the N-site (for the i-th oscillator that shift
is a function of two Ramachandran angles ϕ and ψ), and (3) the corresponding shift caused on the C-side of the peptide
bond (function of angles ϕ and
ψ). Both types of shifts were
taken from tables calculated by Jansen and co-workers.[11,30] The only exceptions were made for the fundamental frequencies of
the amide groups in the side chains and for the peptide bond associated
with proline, where, due to the lack of the hydrogen atom in the peptide
bond, the frequency fit cannot be used. Both cases have been treated
explicitly in the recent papers[19,48] using approaches that
impose constraints on all bonds, and as such could not be applied
in the present work. Hence, the amide groups located on the side chains
were not taken into account, and values of 1620 or 1670 cm–1 were used for proline. The chosen value makes no significant difference
to the final line shape.The off-diagonal couplings for the
nearest neighbors as functions
of the Ramachandran angles between adjacent peptide bonds were taken
from maps provided by Jansen.[11,30] The remaining off-diagonal
couplings were calculated in the transition dipole coupling approximation.[29]
Spectra Calculations
Spectra were
calculated according
to eq 2 with the moving window approach skipping
10 frames between windows. MD simulation data were collected every
100 fs and this time step was chosen to be the integration step in
the eq 2. To validate the 100 fs time step employed,
line shapes were also computed using a time step of 10 fs for NMA
in three solvents and the two smallest and most mobile proteins, see Supporting Information in Figure 8. The window
size was set to 256 frames. The homogeneous dephasing time T1 was set to 1.0 ps for proteins, and to 0.45
ps for NMA. Frequencies and couplings were converted from cm–1 to ps–1 using the factor 2πc before integration, where c is the speed of light.
Calculations were performed using in-house Python scripts, written
using the MDAnalysis package,[49] and can
be shared upon request.
Results
Amide I spectra calculated
using maps #1 and
#2 for proteins in the test set are presented in Figure 5 together with experimental data. Spectra were normalized
so that the maximum value was 1.
Figure 5
Comparison of the experimental and calculated
spectra for the amide
I vibration for the six test proteins. Black lines indicate experimental
data, while blue and red lines show spectra calculated using the #1
and #2 proposed maps, respectively. Experimental data (digitized and
interpolated) were derived for trpzip2 from the work of Smith and
Tokmakoff,[17] for AKA5 from Barber-Armstrong
et al.,[50] for ubiquitin, ribonuclease,
and myoglobin from Ganim et al.,[18] and
for concavalin from Karjalainen et al.[51]
Comparison of the experimental and calculated
spectra for the amide
I vibration for the six test proteins. Black lines indicate experimental
data, while blue and red lines show spectra calculated using the #1
and #2 proposed maps, respectively. Experimental data (digitized and
interpolated) were derived for trpzip2 from the work of Smith and
Tokmakoff,[17] for AKA5 from Barber-Armstrong
et al.,[50] for ubiquitin, ribonuclease,
and myoglobin from Ganim et al.,[18] and
for concavalin from Karjalainen et al.[51]The two smallest proteins in the
benchmark set, trpzip2 and AKA5, are of special interest.
As shown in Figure 4, trpzip2 consists of only
one β-hairpin, while AKA5 forms one α-helix.
Because of that substantial difference
in the secondary structures, these two peptides serve as excellent
benchmarks for methods proposed for the determination of amide I spectra.
Successful capture of the discrepancies between the amide I spectra
of α-helical and β-type proteins is a validation of the
usefulness of the fits in distinguishing between these two types of
secondary structure.Positions of the experimental main peaks
for both proteins are
separated by only 5 cm–1 with 1636 cm–1 for trpzip2[17] and 1631 cm–1 for AKA5.[50] This separation
is too small for reliable assignment of the secondary structure based
on positions of the main peaks. However, the experimental spectrum
of trpzip2 also exhibits a weak second peak near 1673 cm–1.[17] Consequently, the secondary structure
can be determined by the presence of the weak second peak in the spectrum.
For this reason, all presented fits somewhat overestimate the weak
peak in the spectrum of trpzip2.In the case of larger proteins
from the test set with more complex
tertiary structure (ubiquitin, ribonuclease, myoglobin, and concavalin)
good agreement between the experimental and the computed spectra is
observed. The positions of the main peaks are reproduced as well as
the line widths.The two proposed maps presented here do not
utilize the same set
of parameters for the contribution of van der Waals forces, but the
most important aspects of the spectra are captured in both fits. There
are three main contributions to the frequency shift. Two come from
the y-component of the electric field on the oxygen
and nitrogen atoms and with opposing sign from the y-component of the electric field on the hydrogen atom (shown in Figure 6 with black, blue, and cyan for both maps, respectively).
Similar behavior was observed for the case of the NMA molecule, especially
in water and DMSO.
Figure 6
Histograms
of the contributions to the frequency shift for six
proteins in the test set for the maps #1 and #2. Panels show contributions
from the electric field and van der Waals force components, respectively.
Histograms were smoothed using running averaging over 10 frames. Only
contributions greater than 1 cm–1 are shown. Colors
as in Figure 1.
van der Waals forces on the oxygen atom appear
only in map #2,
while map #1 utilizes the x-component on the nitrogen
atom. However, the y-component of the van der Waals
forces on the nitrogen atom makes the dominant contribution of all
van der Waals force components. For both maps, this component plays
a significant role in spectra calculations for each protein in the
benchmark set (except the smallest protein trpzip2, where only electric
field components contribute to the fundamental frequency).Histograms
of the contributions to the frequency shift for six
proteins in the test set for the maps #1 and #2. Panels show contributions
from the electric field and van der Waals force components, respectively.
Histograms were smoothed using running averaging over 10 frames. Only
contributions greater than 1 cm–1 are shown. Colors
as in Figure 1.Figures 9 and 10 in the Supporting Information present comparison of
correlations between electric and van der
Waals contributions to the fundamental frequency for both maps. For
small proteins, trpzip2 and AKA5, we observe minor correlation
between components, such as between the y-component
of the electric field on the hydrogen atom, and between the y-component of the van der Waals field on the nitrogen atom
for trpzip2, or between x-component of the electric
field on the hydrogen atom and x-component of the
van der Waals field on the nitrogen atom. However, for larger proteins
correlations become negligible and there is no example of correlation
that is present in any of the studied proteins.van der Waals
forces make smaller contributions to the fundamental
frequency shift than the electric field forces. However, this contribution
is not negligible except in the case of trpzip2. Therefore, while
van der Waals forces are relatively less important in the calculation
of spectra, maps without those components are less accurate.
Conclusions
This paper presents a new approach to the
evaluation of fundamental
vibrational frequencies and spectra using input from standard classical
molecular dynamics simulations. We have presented a revised methodology
for the calculation of infrared spectra of amide I vibrations for
proteins in solution. The spectra are calculated using the vibrational
exciton model. The couplings between oscillators corresponding to
peptide bonds are calculated in the transition dipole coupling approximation.
The fundamental frequencies are obtained using the total electric
field and van der Waals forces on atoms of all peptide bonds. Addition
of the van der Waals forces is a novel approach, as in earlier works
only the electric field was utilized. Use of van der Waals interactions
together with the electrostatic contributions improves the accuracy
and robustness of the method.To validate the new methodology
over a wide range of secondary
and tertiary structures of proteins, a set of six test proteins with
varying content of secondary and tertiary structure was tested. In
each case, the calculated spectra satisfactorily reproduced the experimental
results, which validates the quality of the proposed maps for amide
I frequency calculations and the methodology for the calculation of
spectra. Reproduction of the weak second peak in the spectra for β-sheet
rich proteins is a sensitive measure of the accuracy of any proposed
approach for the calculations of amide I spectra of proteins, and
the proposed methodology successfully addresses that challenge and
sets a standard for other proposed methods.The amide I vibration
is a strong indicator of protein structure.
Even linear spectra may be used to gain insight into the predominance
of α-helices and β-strands. The development of new approaches
to the computation of amide I spectra continues to be a highly active
field of research [see refs (5), (6), (8), (11), (13), (15−20), (25), (30−36), (51)−[54]]. The goal of this work was to develop a robust
method for computing amide I spectra for proteins in heterogeneous
environments. The method presented may be combined with the widely
used CHARMM22 force field to allow computation of IR spectra as a
function of protein conformation and solvent environment. The methodology
has an advantage over existing methods in that it does not require
additional constraints of bonds in the system under study or special
“filtering” of forces. Standard results of molecular
dynamics simulations can be used to compute the spectra in a straightforward
manner. The application of this methodology to the calculation of
spectra using molecular dynamics other than CHARMM22 should be straightforward.
Table 1
Experimental Values of Peak Maximum
and Full Width at Half Maximum in cm–1 for Amide
I Spectra of NMA in Water, DMSO, and Chloroform
Authors: Wendy Barber-Armstrong; Teraya Donaldson; Himali Wijesooriya; R A Gangani D Silva; Sean M Decatur Journal: J Am Chem Soc Date: 2004-03-03 Impact factor: 15.419
Authors: Thomas la Cour Jansen; Arend G Dijkstra; Tim M Watson; Jonathan D Hirst; Jasper Knoester Journal: J Chem Phys Date: 2006-07-28 Impact factor: 3.488
Authors: B R Brooks; C L Brooks; A D Mackerell; L Nilsson; R J Petrella; B Roux; Y Won; G Archontis; C Bartels; S Boresch; A Caflisch; L Caves; Q Cui; A R Dinner; M Feig; S Fischer; J Gao; M Hodoscek; W Im; K Kuczera; T Lazaridis; J Ma; V Ovchinnikov; E Paci; R W Pastor; C B Post; J Z Pu; M Schaefer; B Tidor; R M Venable; H L Woodcock; X Wu; W Yang; D M York; M Karplus Journal: J Comput Chem Date: 2009-07-30 Impact factor: 3.376
Authors: Carlos R Baiz; Bartosz Błasiak; Jens Bredenbeck; Minhaeng Cho; Jun-Ho Choi; Steven A Corcelli; Arend G Dijkstra; Chi-Jui Feng; Sean Garrett-Roe; Nien-Hui Ge; Magnus W D Hanson-Heine; Jonathan D Hirst; Thomas L C Jansen; Kijeong Kwac; Kevin J Kubarych; Casey H Londergan; Hiroaki Maekawa; Mike Reppert; Shinji Saito; Santanu Roy; James L Skinner; Gerhard Stock; John E Straub; Megan C Thielges; Keisuke Tominaga; Andrei Tokmakoff; Hajime Torii; Lu Wang; Lauren J Webb; Martin T Zanni Journal: Chem Rev Date: 2020-06-29 Impact factor: 60.622
Authors: Anna Victoria Martinez; Edyta Małolepsza; Laura Domínguez; Qing Lu; John E Straub Journal: J Phys Chem B Date: 2014-11-06 Impact factor: 2.991
Authors: Ana V Cunha; Evgeniia Salamatova; Robbert Bloem; Steven J Roeters; Sander Woutersen; Maxim S Pshenichnikov; Thomas L C Jansen Journal: J Phys Chem Lett Date: 2017-05-19 Impact factor: 6.475
Authors: Eric A Muller; Thomas P Gray; Zhou Zhou; Xinbin Cheng; Omar Khatib; Hans A Bechtel; Markus B Raschke Journal: Proc Natl Acad Sci U S A Date: 2020-03-13 Impact factor: 11.205