Literature DB >> 24654732

Empirical maps for the calculation of amide I vibrational spectra of proteins from classical molecular dynamics simulations.

Abstract

New sets of parameters (maps) for calculating amide I vibrational spectra for proteins through a vibrational exciton model are proposed. The maps are calculated as a function of electric field and van der Waals forces on the atoms of peptide bonds, taking into account the full interaction between peptide bonds and the surrounding environment. The maps are designed to be employed using data obtained from standard all-atom molecular simulations without any additional constraints on the system. Six proteins representing a wide range of sizes and secondary structure complexity were chosen as a test set. Spectra calculated for these proteins reproduce experimental data both qualitatively and quantitatively. The proposed maps lead to spectra that capture the weak second peak observed in proteins containing β-sheets, allowing for clear distinction between α-helical and β-sheet proteins. While the parametrization is specific to the CHARMM force field, the methodology presented can be readily applied to any empirical force field.

Entities: Chemical Disease Gene

Mesh：

Substances：
Amides
Proteins

Year: 2014 PMID： 24654732 PMCID： PMC4317051 DOI： 10.1021/jp412827s

Source DB: PubMed Journal: J Phys Chem B ISSN： 1520-5207 Impact factor: 2.991

Introduction

The amide I vibration arises predominantly from the stretching vibration of the carbonyl group in the peptide bond with small contributions from CN and HN bond vibrations and CCN angle bending. This vibration is notable due to its substantial oscillator strength and dependence on backbone secondary structure, while being relatively insensitive to the geometries of the amino acid’s side chain. This vibration is often used to probe and examine the secondary structure of proteins, both experimentally and in computational studies.[1−8] A number of observations have been made for relating amide I vibrational shifts to the local conformation of the peptide backbone. These “fingerprinting” approaches are based on relative simple rules[7] for assigning the secondary structure based on an amide I vibration spectrum. (i) α-helix has a peak typically located near 1655 cm–1, which shifts toward lower frequencies with increasing helix length; this peak does not appear for peptides shorter than six amino acids. (ii) β-sheet exhibits a strong peak near 1630 cm–1 and a weak peak near 1685 cm–1. (iii) Parallel β-sheets have a main peak at slightly higher wavenumbers than antiparallel β-sheets, but the difference does not typically allow one to distinguish between them in many experimental studies. Detailed discussions of the amide I vibration can be found elsewhere.[4,7,9] A number of refined methods for computing amide I vibrational spectra of globular proteins have been developed. These methods typically make use of structural data for the protein derived from molecular dynamics simulation. The models are often based on the vibrational exciton approximation,[5,6,10−13] where the local amide I vibrations have frequencies dependent on the environment (or isotopic labeling) and are coupled through dipole–dipole interactions with other amide I vibrations. These techniques have proven highly successful in the interpretation of spectra of globular proteins in aqueous environments. Discussion of other methods used to compute amide I spectra can be found elsewhere.[14,15] In recent years, there has been increasing interest in the interpretation of amide I spectra of peptide systems other than globular proteins, including (1) smaller peptides[11,13,16−20] and (2) peptides and proteins in more heterogeneous environments.[10,21−25] For peptides in reverse micelle environments, direct interaction with the surfactant and nonpolar phase, as well as interaction with aqueous solvent, must be addressed.[26] For peptides in membrane environments, there may be widely varying local environments, including the lipid tail region of a bilayer, for transmembrane peptides, or the headgroup region, which may include direct interactions with mono- and multivalent ions. Methods for the calculation of amide I spectra in these highly heterogeneous environments must account for the substantial changes to the local amide I frequencies and their coupling induced by the more heterogeneous solvation surroundings. This work presents an extension of the vibrational exciton based models for the calculation of amide I vibrational spectra of peptides and proteins designed to be applicable to peptides and proteins in heterogeneous environments. The method is parametrized based on simulations of a small molecule, N-methylacetamide, in a variety of solvents, including water, DMSO, and chloroform, and applied to the calculation of spectra for a set of proteins using the CHARMM22 force field.[27] A comparison of the resulting spectra with experimental data suggests that the method is robust in capturing essential features of spectra in a variety of systems.

Methods

Theory of Linear Spectroscopy

Ab initio calculations of IR spectra including the amide I mode contribution are computationally demanding for large proteins making it necessary to find methodologies that can simplify such calculations from data generated in standard molecular dynamics (MD) simulations of a protein. One such method employs the vibrational exciton model.[12,16] The protein is represented by a set of oscillators, also called chromophores, one for each peptide bond. It is assumed that the amide I modes do not couple with other types of vibrations in the system, only to the corresponding amide I mode of another peptide bond. The Hamiltonian of the system is expressed in terms of creation and annihilation operators, b† and b, as The diagonal element, ω = (E – E)/ℏ, is a frequency of transition between the vibrational ground and first excited states of the i-th oscillator. This fundamental frequency corresponds to the frequency of an oscillator unperturbed by interactions with other oscillators. Off-diagonal elements, β, represent coupling between oscillators i and j, and μ is the fluctuating transition dipole moment of oscillator i interacting with applied electric field E⃗(t). The linear spectra were calculated as the Fourier transform of the correlation function of the transition dipole moments with the phenomenologically added term e– to account for the vibrational lifetime, according to the method presented by Jansen and Knoester[28]where T1 is the lifetime of the singly excited states, U(t, 0) represents time evolution operator, described in more details in Supporting Information.

Standard Model

Couplings between the i-th and j-th peptide bonds can be calculated in the transition dipole coupling approximation.[29] The electrostatic interaction between two peptide bonds is modeled by the interaction of the transition dipole moments, μ⃗ and μ⃗, associated with these two peptide bonds, asThis approximation is assumed to be reasonable if the separation between peptide bonds, r, is greater than their dimension. It should not be applied to model interaction between the nearest neighboring peptide bonds. In works by Jansen et al.,[11,30] Gorbunov et al.,[6] and Torii and Tasumi,[31] couplings between the nearest neighboring peptide bonds were calculated ab initio and represented as functions of Ramachandran angles, ψ and ϕ. There are several methods for determining values of the fundamental frequencies and two methods are briefly presented below. One approach utilizes only geometrical properties of the protein independent of the environment. (i) The fundamental frequency is represented as a sum of a constant, ωconst, and frequency shift, δω, reflecting when the oxygen or nitrogen atoms from the peptide bond form a hydrogen bond. The value of ωconst is usually chosen from between 1655 and 1710 cm–1.[5,12,13,16,32] δω depends on the length of the hydrogen bond and is typically taken to be −20 and −10 cm–1 for oxygen or nitrogen atoms, respectively. An isotopic shift can also be included in the model as a constant value added to ωconst.[22] The second approach employs the electric field or electrostatic potential at the peptide bond atoms and therefore introduces a dependence on the surrounding environment. (ii) The fundamental frequency for each peptide bond is calculated as a function of the electrostatic potential[33−36] or electric field[19,25,36] or all of them[13] experienced by atoms of the peptide bond, which may be approximated in terms of the partial charges of the surrounding solvent molecules and the electrostatic potentialor the electric fieldwhere i runs over atoms of the peptide bond, C, O, N, H, α = {x,y,z}, and j runs over point charges surrounding the i-th peptide bond. Coefficients c and c are determined by fitting the frequency to the electrostatic potential or electric field on a test system such as N-methylacetamide (NMA) to reproduce either ab initio calculations for NMA–water clusters, containing up to four water molecules,[36] or the experimentally measured transition frequencies for NMA. The intercept, ω0, is often chosen to be the gas-phase amide I frequency of NMA equal to 1717 cm–1. Therefore, the value of the shift in the fundamental frequency, ω – ω0, can be understood as the first order perturbation correction to the frequency in vacuum due to interactions with solvent molecules. This model reminds one of the Stark effect in which each component of the electric field on each atom may have a different proportionality coefficient. ω0 can also be treated as an additional parameter in the fitting procedure.

Proposed Extended Model

Work presented in this paper is focused on improving the second method presented above through the use of data obtained from molecular dynamics simulations. There are five aspects to be considered in the improvement of the fits. (i) If the amide I frequency of NMA in vacuum is chosen as the reference (i.e., unperturbed) frequency, to first order the corrections to the frequency arise from interactions of NMA with solvent molecules. There are two types of such nonbonded interactions, electrostatic and van der Waals, and both are included in the frequency shift. (ii) In existing methods, the electrostatic potential and electric field employed in the fits are typically derived from partial charges on selected solvent molecules surrounding the NMA molecule. Such a field or potential does not fully represent an actual field or potential on the NMA molecule during molecular dynamics simulations, where periodic boundary conditions are employed and electrostatic interactions are calculated using the Ewald summation. In this work, all fits presented were obtained based on the total electrostatic and van der Waals forces exerted on the atoms of peptide bonds during simulations. (iii) The proposed method uses data obtained from standard all-atom molecular simulations carried out with the NAMD[37] package where the SHAKE[38] algorithm was applied to fix the length of bonds to hydrogen atoms. However, it can be applied to results from any other program. This stands in contrast to other approaches in which additional restrictions are imposed in the calculation of the spectrum, such as fixing the length of all other bonds using the LINKS[39] algorithm (which is not implemented at this time in CHARMM or in NAMD). (Heavy atom bonds vibrate with a period of at least 20 fs that is equal to the shortest period of angle vibrations for C–O–H. Therefore, constraining all bonds can take full advantage of the LINKS algorithm only when those angles are also constrained.) (iv) NMA provides a model of an isolated peptide bond. Results based solely on MD simulations of NMA in water may not be representative of more heterogeneous environments experienced by a peptide bond in a protein. Reliable fits should also incorporate the hydrophobic surroundings of an amino acid buried inside a protein, interactions with lipophilic regions of a membrane, or contact with monovalent ions. This can be accomplished by fitting data from MD simulations of NMA in water and in solvents representing a range of dielectric constants including DMSO and chloroform.[19] (v) Finally, the maps developed in this work are designed to highlight the difference between the amide I frequency of an α-helix and of a β-sheet. Such fits are especially useful in studying aggregation processes where proteins commonly undergo changes in secondary structure.[40−43] In the next sections, new maps are presented together with the methodology employed in their calculation. The amide I spectra of a test set of proteins are presented to validate the quality of the proposed maps.

Model Parametrization for NMA

NMA Simulations

All fits presented below were calculated based on the results of MD simulations performed using the CHARMM22 force field and the NAMD package. A single molecule of NMA was placed in the center of a pre-equilibrated cubic box (70 × 70 × 70 Å3) of solvent and equilibrated at 300 K. Three solvents were considered, water, DMSO, and chloroform. Water was represented by the TIP3P[44] model, parameters for DMSO were taken from the work by Strader and Feller,[45] and parameters for chloroform were derived from the work of Cieplak et al.[46] Periodic boundary conditions and particle mesh Ewald summation were employed, nonbonded interactions were switched to zero over the window 10 to 12 Å, the time step was set to 1 fs, geometries were saved every 100 fs, and all bonds with hydrogen atoms were constrained using the SHAKE algorithm.

Force Calculations

All forces on the C, O, N, and H atoms of NMA were computed under the same conditions used during the production MD runs, including the same cutoffs, PME parameters, and box size. All force vectors were aligned in such a way that the XY-plane was defined by the C, O, and N atoms of a peptide bond with the CO bond oriented along the Y-axis. Finally, forces were converted from kcal/mol/Å to atomic units by a factor 0.0008432983.

Fit Formula

New maps for the fundamental frequency proposed in this work include not only the electric field vectors on the atoms of the peptide bonds, but also van der Waals forces. Initially, the fitting function contained 25 parameters, 12 for components of electric fields, E⃗, on C, O, N, and H atoms of NMA, 12 parameters corresponding to vectors from van der Waals forces, F⃗, and one parameter for the interceptwhere frequency is in units of wave numbers.

Fitting Procedure

Frequency functions were fitted to six experimental values including positions of the peak maximum, ωexp, and full width at half-maximum, Γexp, of the amide I frequency of NMA in water, DMSO, and chloroform. Several forms of the penalty function were employed in the fitting process (for details see Supporting Information). Some field or force components make greater contributions to the fundamental frequency than others, and their analysis can help in identifying the most important parameters. Figure 1 shows histograms of the field and force components on the NMA peptide bond atoms derived from MD simulations in three solvents. We observed that on average only 9 out of 24 components made a greater than 0.001 au contribution. The next fit included only those nine dominant components. As the amide I frequency consists mostly of vibrations of the carbonyl group in peptide bonds, it is not surprising that force components in the direction of the CO bond (which defines the Y-axis) are the most dominant contributions. There is only one other component, E⃗H, with a peak further from zero than 0.002 au. Out of 12 parameters for van der Waals forces, only the y-components on oxygen and nitrogen atoms appear to be significant.

Figure 1

Histograms of field and force components on NMA peptide bond atoms in water (left), DMSO (center), and chloroform (right). Top panel refers to electric field components, while bottom shows van der Waals forces. Dotted lines show positions of 0, ± 0.002 au. Only components with absolute mean value greater than 0.001 au are shown. If the absolute mean value is in the range 0.001 and 0.002 au, its histogram is shown as a dashed line.

Apart from analyzing the field and force components on NMA peptide bond atoms, one can test the components together with corresponding fit coefficients to determine which coefficient provides the largest contribution to the fundamental frequency shift. (A sample of results for some 25 parameter fits is shown in Figure 7 in Supporting Information.) In each case, a different set of fit parameters was found to be important in determining the frequency shift including components that were neglected based solely on the field and force histograms. This indicates that an analysis of fit parameters alone or forces alone might be misleading. Each set was chosen as input for another fitting procedure and this process was repeated until the most significant fit parameters were determined. Histograms of field and force components on NMA peptide bond atoms in water (left), DMSO (center), and chloroform (right). Top panel refers to electric field components, while bottom shows van der Waals forces. Dotted lines show positions of 0, ± 0.002 au. Only components with absolute mean value greater than 0.001 au are shown. If the absolute mean value is in the range 0.001 and 0.002 au, its histogram is shown as a dashed line.

The Best Maps

The number of variables in the fit formula led to the generation of many maps by a multivariable fitting procedure. The choice of the best maps was based on reproducing amide I spectra of two proteins, an alanine-based peptide with sequence (AAAAK)4AAAAY (referred to as AKA5) and trpzip2 (1LE1). AKA5 consists of only 25 residues and forms one well-defined α-helix, while even smaller trpzip2 is an example of β-hairpin protein. AKA5 and trpzip2 make a complementary set to study the assignment of secondary structure based on amide I vibrational spectra. Technical details of spectra calculations are presented in the next section. The two best fits for the amide I frequency are presented in Table 2. Both fits consist of nine components.

Table 2

Coefficients of the Two Best Fits for the Electric Field and van der Waals Force Components on Atoms of the Peptide Bond for Use in Equation 3

	ω₀	C_y	O_y	N_x	N_y	H_x	H_y	O_y^vdw	N_x^vdw	N_y^vdw
#1	1674.55	–103.6	1320.4	–632.7	3932.5	–1390.8	–1740.4		–7905.1	–858.3
#2	1674.64	–374.3	1787.4	–286.1	3643.2	–1598.1	–1412.3	–705.9	–8233.2

From the analysis of fields and forces on the atoms of NMA presented above, only two components of van der Waals forces were found to be significant, the y-component on the oxygen and nitrogen atoms, Ovdw and Nvdw, respectively. Only these two components were expected to make a significant contribution to the frequency shift. Surprisingly, for both fits the coefficient corresponding to the x-component of the van der Waals force on the nitrogen atom, Nvdw, is the largest of all. However, analysis of the frequency shift caused by this component in the case of the NMA spectrum (presented in Figure 2) shows that despite the value of this coefficient, its contribution is less than 5 cm–1 for the spectrum of NMA in DMSO.

Figure 2

Histograms of the main contributions to the NMA amide I frequency shift in water, DMSO, and chloroform for the two best fits. Histograms were smoothed using running averaging over 10 frames. Only contributions greater than 1 cm–1 are shown. Colors as in Figure 1. The main contributions to the frequency for all fits come from three components of the electric field, O, N and H. The contribution from the carbon atom is found to be negligible even though the amide I vibration consists primarily of the CO vibration. Interestingly, for map #1 the van der Waals forces contribute more than 1 cm–1 only in the case of spectra in DMSO. For map #2 the van der Waals forces contribute more than 1 cm–1 for spectra in water and DMSO. For NMA in chloroform, van der Waals forces make negligible contributions for both maps. Figure 3 presents the comparison of experimental and computational amide I spectra for NMA in three considered solvents. Calculated spectra agree well with experimental spectra for both proposed maps.

Figure 3

Amide I spectra of NMA in water, DMSO, and chloroform. Black lines represent experimental data, while cyan lines and magenta circles show spectra calculated using the #1 and #2 proposed maps, respectively. Experimental data (digitized and interpolated) were derived for NMA in water and DMSO from Hamm et al.[16] and for NMA in chloroform from Wang et al.[19]

Validation of the New Maps and Spectra for Proteins

To validate the new maps, amide I frequency spectra of four additional proteins were calculated and compared with experimental results. For benchmark calculations, a series of proteins were chosen including ubiquitin (1UBQ), ribonuclease (7RSA), myoglobin (1WLA), and concavalin (1JBC). The set of six proteins include a wide range of structures and sizes as shown in Figure 4 from the 12-residue β-hairpin trpzip2, through the 153-residue myoglobin (formed almost exclusively of α-helices), to the 237-residue concavalin (including two large β-sheet surfaces).

Figure 4

Ribbon structures of the six proteins in the test set showing β-strands (yellow), α-helices (cyan), and 310-helices (pink). The structures were visualized with VMD.[47]

MD Simulations

Each peptide was placed in a pre-equilibrated water box, neutralized, energy minimized, heated, and equilibrated at 300 K. For AKA5, trpzip2, ubiquitin, ribonuclease, and myoglobin, a cubic box of 70 × 70 × 70 Å3 was used, while for concavalin an orthorhombic box of size 70 × 70 × 85 Å3 was employed. The simulations were run under the same conditions as described for NMA. Ribbon structures of the six proteins in the test set showing β-strands (yellow), α-helices (cyan), and 310-helices (pink). The structures were visualized with VMD.[47] All fields and forces were calculated according to the protocol used for NMA with one exception. Off-diagonal couplings between the nearest neighbor peptide bonds are taken from Jansen’s maps.[11,30] As such, partial charges of all backbone atoms of the neighboring residues should not contribute to the fundamental frequencies. Therefore, those atoms were excluded from the field and force calculations. Partial charges on all remaining atoms of the protein were included in the calculations.

Construction of the Hamiltonian Matrix and Transition Dipole Vectors

Transition dipoles were constructed according to the work by Torii and Tasumi.[31] Each vector’s length was set to 2.73 D with the vector displaced 10° from the CO bond in the CON plane. Its origin was located at r⃗C + 0.665 n̂CO + 0.258 n̂CN, where r⃗C is the position of the carbon atom of the peptide bond, while n̂CO and n̂CN are normalized vectors in the directions of the CO and CN bonds, respectively. The magnitudes of all vectors are in angstroms. The fundamental frequency for each peptide bond (the diagonal values of the Hamiltonian matrix) was calculated as a sum of three terms: (1) the unperturbed oscillator frequency calculated from the frequency fit, (2) the shift caused by the closest neighboring peptide bond on the N-site (for the i-th oscillator that shift is a function of two Ramachandran angles ϕ and ψ), and (3) the corresponding shift caused on the C-side of the peptide bond (function of angles ϕ and ψ). Both types of shifts were taken from tables calculated by Jansen and co-workers.[11,30] The only exceptions were made for the fundamental frequencies of the amide groups in the side chains and for the peptide bond associated with proline, where, due to the lack of the hydrogen atom in the peptide bond, the frequency fit cannot be used. Both cases have been treated explicitly in the recent papers[19,48] using approaches that impose constraints on all bonds, and as such could not be applied in the present work. Hence, the amide groups located on the side chains were not taken into account, and values of 1620 or 1670 cm–1 were used for proline. The chosen value makes no significant difference to the final line shape. The off-diagonal couplings for the nearest neighbors as functions of the Ramachandran angles between adjacent peptide bonds were taken from maps provided by Jansen.[11,30] The remaining off-diagonal couplings were calculated in the transition dipole coupling approximation.[29]

Spectra Calculations

Spectra were calculated according to eq 2 with the moving window approach skipping 10 frames between windows. MD simulation data were collected every 100 fs and this time step was chosen to be the integration step in the eq 2. To validate the 100 fs time step employed, line shapes were also computed using a time step of 10 fs for NMA in three solvents and the two smallest and most mobile proteins, see Supporting Information in Figure 8. The window size was set to 256 frames. The homogeneous dephasing time T1 was set to 1.0 ps for proteins, and to 0.45 ps for NMA. Frequencies and couplings were converted from cm–1 to ps–1 using the factor 2πc before integration, where c is the speed of light. Calculations were performed using in-house Python scripts, written using the MDAnalysis package,[49] and can be shared upon request.

Results

Amide I spectra calculated using maps #1 and #2 for proteins in the test set are presented in Figure 5 together with experimental data. Spectra were normalized so that the maximum value was 1.

Figure 5

Comparison of the experimental and calculated spectra for the amide I vibration for the six test proteins. Black lines indicate experimental data, while blue and red lines show spectra calculated using the #1 and #2 proposed maps, respectively. Experimental data (digitized and interpolated) were derived for trpzip2 from the work of Smith and Tokmakoff,[17] for AKA5 from Barber-Armstrong et al.,[50] for ubiquitin, ribonuclease, and myoglobin from Ganim et al.,[18] and for concavalin from Karjalainen et al.[51] The two smallest proteins in the benchmark set, trpzip2 and AKA5, are of special interest. As shown in Figure 4, trpzip2 consists of only one β-hairpin, while AKA5 forms one α-helix. Because of that substantial difference in the secondary structures, these two peptides serve as excellent benchmarks for methods proposed for the determination of amide I spectra. Successful capture of the discrepancies between the amide I spectra of α-helical and β-type proteins is a validation of the usefulness of the fits in distinguishing between these two types of secondary structure. Positions of the experimental main peaks for both proteins are separated by only 5 cm–1 with 1636 cm–1 for trpzip2[17] and 1631 cm–1 for AKA5.[50] This separation is too small for reliable assignment of the secondary structure based on positions of the main peaks. However, the experimental spectrum of trpzip2 also exhibits a weak second peak near 1673 cm–1.[17] Consequently, the secondary structure can be determined by the presence of the weak second peak in the spectrum. For this reason, all presented fits somewhat overestimate the weak peak in the spectrum of trpzip2. In the case of larger proteins from the test set with more complex tertiary structure (ubiquitin, ribonuclease, myoglobin, and concavalin) good agreement between the experimental and the computed spectra is observed. The positions of the main peaks are reproduced as well as the line widths. The two proposed maps presented here do not utilize the same set of parameters for the contribution of van der Waals forces, but the most important aspects of the spectra are captured in both fits. There are three main contributions to the frequency shift. Two come from the y-component of the electric field on the oxygen and nitrogen atoms and with opposing sign from the y-component of the electric field on the hydrogen atom (shown in Figure 6 with black, blue, and cyan for both maps, respectively). Similar behavior was observed for the case of the NMA molecule, especially in water and DMSO.

Figure 6

Histograms of the contributions to the frequency shift for six proteins in the test set for the maps #1 and #2. Panels show contributions from the electric field and van der Waals force components, respectively. Histograms were smoothed using running averaging over 10 frames. Only contributions greater than 1 cm–1 are shown. Colors as in Figure 1.

van der Waals forces on the oxygen atom appear only in map #2, while map #1 utilizes the x-component on the nitrogen atom. However, the y-component of the van der Waals forces on the nitrogen atom makes the dominant contribution of all van der Waals force components. For both maps, this component plays a significant role in spectra calculations for each protein in the benchmark set (except the smallest protein trpzip2, where only electric field components contribute to the fundamental frequency). Histograms of the contributions to the frequency shift for six proteins in the test set for the maps #1 and #2. Panels show contributions from the electric field and van der Waals force components, respectively. Histograms were smoothed using running averaging over 10 frames. Only contributions greater than 1 cm–1 are shown. Colors as in Figure 1. Figures 9 and 10 in the Supporting Information present comparison of correlations between electric and van der Waals contributions to the fundamental frequency for both maps. For small proteins, trpzip2 and AKA5, we observe minor correlation between components, such as between the y-component of the electric field on the hydrogen atom, and between the y-component of the van der Waals field on the nitrogen atom for trpzip2, or between x-component of the electric field on the hydrogen atom and x-component of the van der Waals field on the nitrogen atom. However, for larger proteins correlations become negligible and there is no example of correlation that is present in any of the studied proteins. van der Waals forces make smaller contributions to the fundamental frequency shift than the electric field forces. However, this contribution is not negligible except in the case of trpzip2. Therefore, while van der Waals forces are relatively less important in the calculation of spectra, maps without those components are less accurate.

Conclusions

This paper presents a new approach to the evaluation of fundamental vibrational frequencies and spectra using input from standard classical molecular dynamics simulations. We have presented a revised methodology for the calculation of infrared spectra of amide I vibrations for proteins in solution. The spectra are calculated using the vibrational exciton model. The couplings between oscillators corresponding to peptide bonds are calculated in the transition dipole coupling approximation. The fundamental frequencies are obtained using the total electric field and van der Waals forces on atoms of all peptide bonds. Addition of the van der Waals forces is a novel approach, as in earlier works only the electric field was utilized. Use of van der Waals interactions together with the electrostatic contributions improves the accuracy and robustness of the method. To validate the new methodology over a wide range of secondary and tertiary structures of proteins, a set of six test proteins with varying content of secondary and tertiary structure was tested. In each case, the calculated spectra satisfactorily reproduced the experimental results, which validates the quality of the proposed maps for amide I frequency calculations and the methodology for the calculation of spectra. Reproduction of the weak second peak in the spectra for β-sheet rich proteins is a sensitive measure of the accuracy of any proposed approach for the calculations of amide I spectra of proteins, and the proposed methodology successfully addresses that challenge and sets a standard for other proposed methods. The amide I vibration is a strong indicator of protein structure. Even linear spectra may be used to gain insight into the predominance of α-helices and β-strands. The development of new approaches to the computation of amide I spectra continues to be a highly active field of research [see refs (5), (6), (8), (11), (13), (15−20), (25), (30−36), (51)−[54]]. The goal of this work was to develop a robust method for computing amide I spectra for proteins in heterogeneous environments. The method presented may be combined with the widely used CHARMM22 force field to allow computation of IR spectra as a function of protein conformation and solvent environment. The methodology has an advantage over existing methods in that it does not require additional constraints of bonds in the system under study or special “filtering” of forces. Standard results of molecular dynamics simulations can be used to compute the spectra in a straightforward manner. The application of this methodology to the calculation of spectra using molecular dynamics other than CHARMM22 should be straightforward.

Table 1

Experimental Values of Peak Maximum and Full Width at Half Maximum in cm–1 for Amide I Spectra of NMA in Water, DMSO, and Chloroform

	ω_exp	Γ_exp
water[16]	1624.0	30.5
DMSO[16]	1659.5	25.0
chloroform[19]	1665.0	26.0

38 in total

1. Signatures of beta-sheet secondary structures in linear and two-dimensional infrared spectroscopy.

Authors: Christopher M Cheatum; Andrei Tokmakoff; Jasper Knoester
Journal: J Chem Phys Date: 2004-05-01 Impact factor: 3.488

2. Empirical relationships between isotope-edited IR spectra and helix geometry in model peptides.

Authors: Wendy Barber-Armstrong; Teraya Donaldson; Himali Wijesooriya; R A Gangani D Silva; Sean M Decatur
Journal: J Am Chem Soc Date: 2004-03-03 Impact factor: 15.419

3. Simulated IR, isotropic and anisotropic Raman, and vibrational circular dichroism amide I band profiles of stacked β-sheets.

Authors: Reinhard Schweitzer-Stenner
Journal: J Phys Chem B Date: 2012-03-30 Impact factor: 2.991

4. Optimization of model parameters for describing the amide I spectrum of a large set of proteins.

Authors: Eeva-Liisa Karjalainen; Tore Ersmark; Andreas Barth
Journal: J Phys Chem B Date: 2012-04-13 Impact factor: 2.991

5. Modeling the amide I bands of small peptides.

Authors: Thomas la Cour Jansen; Arend G Dijkstra; Tim M Watson; Jonathan D Hirst; Jasper Knoester
Journal: J Chem Phys Date: 2006-07-28 Impact factor: 3.488

Review 6. CHARMM: the biomolecular simulation program.

Authors: B R Brooks; C L Brooks; A D Mackerell; L Nilsson; R J Petrella; B Roux; Y Won; G Archontis; C Bartels; S Boresch; A Caflisch; L Caves; Q Cui; A R Dinner; M Feig; S Fischer; J Gao; M Hodoscek; W Im; K Kuczera; T Lazaridis; J Ma; V Ovchinnikov; E Paci; R W Pastor; C B Post; J Z Pu; M Schaefer; B Tidor; R M Venable; H L Woodcock; X Wu; W Yang; D M York; M Karplus
Journal: J Comput Chem Date: 2009-07-30 Impact factor: 3.376

Review 7. What vibrations tell us about proteins.

Authors: Andreas Barth; Christian Zscherp
Journal: Q Rev Biophys Date: 2002-11 Impact factor: 5.318

8. Ab initio-based exciton model of amide I vibrations in peptides: definition, conformational dependence, and transferability.

Authors: Roman D Gorbunov; Daniil S Kosov; Gerhard Stock
Journal: J Chem Phys Date: 2005-06-08 Impact factor: 3.488

9. MDAnalysis: a toolkit for the analysis of molecular dynamics simulations.

Authors: Naveen Michaud-Agrawal; Elizabeth J Denning; Thomas B Woolf; Oliver Beckstein
Journal: J Comput Chem Date: 2011-04-15 Impact factor: 3.376

Review 10. Empirical amide I vibrational frequency map: application to 2D-IR line shapes for isotope-edited membrane peptide bundles.

Authors: Y-S Lin; J M Shorb; P Mukherjee; M T Zanni; J L Skinner
Journal: J Phys Chem B Date: 2009-01-22 Impact factor: 2.991

8 in total

Review 1. Vibrational Spectroscopic Map, Vibrational Spectroscopy, and Intermolecular Interaction.

Authors: Carlos R Baiz; Bartosz Błasiak; Jens Bredenbeck; Minhaeng Cho; Jun-Ho Choi; Steven A Corcelli; Arend G Dijkstra; Chi-Jui Feng; Sean Garrett-Roe; Nien-Hui Ge; Magnus W D Hanson-Heine; Jonathan D Hirst; Thomas L C Jansen; Kijeong Kwac; Kevin J Kubarych; Casey H Londergan; Hiroaki Maekawa; Mike Reppert; Shinji Saito; Santanu Roy; James L Skinner; Gerhard Stock; John E Straub; Megan C Thielges; Keisuke Tominaga; Andrei Tokmakoff; Hajime Torii; Lu Wang; Lauren J Webb; Martin T Zanni
Journal: Chem Rev Date: 2020-06-29 Impact factor: 60.622

2. Assessment of amide I spectroscopic maps for a gas-phase peptide using IR-UV double-resonance spectroscopy and density functional theory calculations.

Authors: J K Carr; A V Zabuga; S Roy; T R Rizzo; J L Skinner
Journal: J Chem Phys Date: 2014-06-14 Impact factor: 3.488

3. Solvent-Independent Anharmonicity for Carbonyl Oscillators.

Authors: Samuel H Schneider; Huong T Kratochvil; Martin T Zanni; Steven G Boxer
Journal: J Phys Chem B Date: 2017-03-08 Impact factor: 2.991

4. Origin of thiocyanate spectral shifts in water and organic solvents.

Authors: Ruoqi Zhao; Joseph C Shirley; Euihyun Lee; Adam Grofe; Hui Li; Carlos R Baiz; Jiali Gao
Journal: J Chem Phys Date: 2022-03-14 Impact factor: 3.488

5. Exploring the role of hydration and confinement in the aggregation of amyloidogenic peptides Aβ(16-22) and Sup35(7-13) in AOT reverse micelles.

Authors: Anna Victoria Martinez; Edyta Małolepsza; Eva Rivera; Qing Lu; John E Straub
Journal: J Chem Phys Date: 2014-12-14 Impact factor: 3.488