Literature DB >> 30449106

Frequency Range Selection Method for Vibrational Spectra.

T Q Teodoro1,2, M A J Koenis3, S E Galembeck2, V P Nicu4, W J Buma3, L Visscher1.   

Abstract

Theoretical calculations of vibrational properties are widely used to explain and predict experimental spectra. However, with standard quantum chemical methods all molecular motions are considered, which is rather time-consuming for large molecules. Because typically only a specific spectral region is of experimental interest, we propose here an efficient method that allows calculation of only a selected frequency interval. After a computationally cheap low-level estimate of the molecular motions, the computational time is proportional to the number of normal modes needed to describe this frequency range. Results for a medium-sized molecule show a reduction in computational time of up to 1 order of magnitude with negligible loss in accuracy. We also show that still larger computational savings are possible by using an additional intensity-selection procedure.

Entities:  

Year:  2018        PMID: 30449106      PMCID: PMC6287222          DOI: 10.1021/acs.jpclett.8b02963

Source DB:  PubMed          Journal:  J Phys Chem Lett        ISSN: 1948-7185            Impact factor:   6.475


The vibrational structure of a molecule can be probed by means of several spectroscopy techniques, such as infrared (IR), Raman, vibrational circular dichroism (VCD), and Raman optical activity (ROA).[1,2] At the core of the methods used to model these properties lies the harmonic approximation to molecular vibrations,[3] which implies the diagonalization of a mass-weighted Hessian(with M being the atomic mass matrix and the tilde indicating that the Hessian is in mass-weighted coordinates) to obtain as eigenvectors the normal modes, U, and as eigenvalues the squares of the corresponding fundamental vibrational frequencies, ν. The computationally intensive task for constructing H (the nonmass-weighted Hessian) in eq is to calculate the second derivatives of the molecular energy E with respect to changes in the nuclear coordinates, Rκ and Rλ (the Greek indices are used in this equation only to represent a general basis for the atomic coordinates): While evaluation of these derivatives is most easily done in Cartesian nuclear coordinates, this approach yields a 3Natoms × 3Natoms dimensional Hessian that is time-consuming to construct in full. Several methods have therefore been proposed to reduce the number of molecular motions that need to be considered explicitly.[4−8] All these methods can be viewed as formally transforming the mass-weighted Hessian in Cartesian coordinates, H̃CC (where the superscript CC denotes the use of Cartesian coordinates for each of the Hessian indices), to a K-dimensional (K < 3Natoms) representation H̃AA (with the superscript AA referring to use of approximate coordinates for both indices):where QCA is a rectangular (3Natoms × K) transformation matrix. The key step is thereby the elimination of motions of molecular fragments that are expected to be less important for interpreting the experimental spectrum (thus K is equal to number of motions of interest). However, this reduction of the number of coordinates usually relies on the assumption that the normal modes of interest are localized in specific regions of the molecule. Another approach is the Davidson procedure developed by Reiher and Neugebauer.[9] The method is designed to iteratively refine the normal modes and associated eigenvalues (frequencies) of a set of specific vibrations that are provided as an initial guess. This so-called mode-tracking algorithm is particularly suited to determine local vibrations.[10] For instance, it reproduced in only 20–30% of the computational time the vibrations associated with the thiophenolate anion adsorbed on a Ag(111) surface with the same accuracy as a full vibrational analysis of the whole system.[11] Here we propose an even simpler “black box” approach in which the iterative refinement stage of the block-Davidson method of ref (9) is omitted entirely. We just compute a full spectrum with a fast approximate method then select and re-evaluate only the normal modes in the spectral interval of experimental interest. This one-step procedure is sufficient if the set of eigenvectors provided by an approximate method spans the space of eigenvectors of the higher-level method in the same spectral interval, which is the case in many practical applications. To sketch the ease of implementation, we will use an index j for approximate modes, while labeling Cartesian coordinates with the index p. We start by defining the gradient gC of the energy at the target level of theory with respect to nuclear displacements in Cartesian coordinates (dRC) as This quantity can be computed analytically in most electronic structure implementations. We then need a second derivative of the energy with respect to the selected set of approximate coordinates. By determining the first-order perturbed wave function with respect to these displacements these matrix elements can in principle also be obtained analytically, but it is easier to use numerical differentiation with an n-point stencil. For n = 2, we need 2K displacements along the normal modes of interest calculated at a lower level of theory to obtain a Hessian with one Cartesian and one approximate index, H̃CA:The displacements ΔR̃A from the equilibrium geometry (R0) at the target level are taken along the mass-weighted vectors ŨCA = M–1/2UCA of the approximate normal modes (UCA) with a step size Δ. This half-transformed matrix is then further transformed to a square matrixand it is diagonalized to yield the vibrational frequencies ν and eigenvectors UAR (with AR indicating that this term was obtained on the basis of the approximate modes in the range selection interval). Back transformation of UAR provides the normal modes UCR expressed in Cartesian coordinates at the target level of theory (with R still referring to the range selection interval):This representation of the normal modes at the target level of theory is then used to obtain the vibrational spectrum in the selected frequency range. Taking IR as an example, the intensities are straightfowardly obtained in a similar fashion. The P tensor of the electric dipole moment derivatives with respect to nuclear displacements (also known as the atomic polar tensor,[12] APT) is thereby constructed via numeric differentiation by calculating electric dipole moment vectors (μC) along the approximate modesand then transforming the resulting tensorTherefore, only energy gradients, gC, and electric dipole moments, μC, need to be calculated at the target level to provide the IR spectrum of the selected range. While second derivatives can be calculated at all points on the potential energy surface, including the minima obtained with the approximate method, the harmonic approximation works best if the equilibrium geometry in eq is obtained at the target level of theory. In addition, as is the case in optical properties such as VCD, one may need to compute a number of conformations to obtain proper spectra,[13] which is preferably also done at the higher level of theory to get the best results. Because geometry optimization is, for large systems, much faster than calculation of the Hessian, these steps will not present a computational bottleneck. The frequency range selection method described above was implemented in a Python-based script for obtaining IR absorption spectra of a pharmaceutical compound, dydrogesterone (Figure ). Calculations were carried out with the 2017 version of the Amsterdam Density Functional (ADF) modeling suite of programs.[14,15] The target (higher) level was a density functional theory (DFT) method, the Becke–Perdew exchange–correlation potential (BP86),[16−18] in combination with Slater type orbital (STO) nonrelativistic valence triple-ζ basis sets extended with one polarization function (TZP).[19] The electronic gas-phase ground-state structure obtained at the target level was used in all calculations. The approximate level chosen to provide the normal mode coordinates is a parametrized version of DFT, the third-order density functional-based tight binding method (DFTB3).[20] Parameters for this approximation were taken from the Slater–Koster files for organic and biomolecules, 3ob-freq-1-2.[21] Results were extracted by using the Python Library for Automating Molecular Simulation (PLAMS).[22] Additionally, an experimental spectrum was obtained with dydrogesterone purchased from Sigma-Aldrich with a European Pharmacopoeia Reference standard. A sample of 0.41 M in deuterated chloroform was prepared and injected in a sealed cell with 3 mm thick CaF2 windows and a 56 μm sample path length. Subsequently, the Fourier-transform vibrational absorption spectrum was obtained using a Bruker Vertex 70 spectrometer. A baseline correction was performed by using the absorption spectrum of pure deuterated chloroform. All spectra presented in the following were obtained by convoluting calculated stick spectra with a Lorentzian with half width at half-maximum of 4 cm–1.
Figure 1

Structure of dydrogesterone.

Structure of dydrogesterone. As shown in Figure , the DFT calculation reproduces the experiment very well. However, a large number of the simulated normal modes are not needed for this comparison as they are not present in the experimental spectrum, viz., CaF2 absorbs below ∼950 cm–1. Out of the total 153 normal modes computed for this system (3 × 51 atoms), more than a third (57) belong to this range. Additionally, the bands above 2800 cm–1 in the experimental spectrum, which are associated with C–H stretches, are hardly distinguishable from each other, making the comparison with theory difficult. In the latter range there are 27 calculated bands. By also excluding the 6 rigid motions (translations and rotations of the system), only 66 bands distributed in the region from 950 to 1800 cm–1 are actually of interest, which correspond to only 43% of all computed modes.
Figure 2

Comparison of the experimental (green) IR spectrum of dydrogesterone with spectra calculated at the BP86/TZP (black) and DFTB3/3ob-freq-1-2 (blue) levels. Intensities in the DFTB spectrum were scaled by a factor of 0.3 for better visualization. The experimental band around 2300 cm–1 is not due to dydrogesterone but arises from an experimental artifact due to the solvent.

Comparison of the experimental (green) IR spectrum of dydrogesterone with spectra calculated at the BP86/TZP (black) and DFTB3/3ob-freq-1-2 (blue) levels. Intensities in the DFTB spectrum were scaled by a factor of 0.3 for better visualization. The experimental band around 2300 cm–1 is not due to dydrogesterone but arises from an experimental artifact due to the solvent. When using the DFTB3/3ob-freq-1-2 method, computational time was reduced by 2 orders of magnitude with respect to the BP86/TZP calculation. However, as shown in Figure , the calculated IR absorption spectrum is quite different than the spectrum calculated at the BP86/TZP level and the experimental spectrum. In the following, the DFTB3/3ob-freq-1-2 normal mode coordinates are used in the range selection method for the 950–1800 cm–1 interval with BP86/TZP being the target level. The resulting spectrum is shown in Figure . The overlap with the full DFT vibrational analysis is nearly perfect even though only 67 modes (44% of the total) were calculated at the same level of theory. In a comparison with the (two-point stencil) numerical evaluation of the full Hessian in a Cartesian basis, the range selection scheme would thus represent a ∼56% reduction in computational time. It has to be noted, however, that analytical Hessian calculations are usually a few times faster than numerical ones with the ADF implementations. With the same computational configuration, the range selection method was about 50% slower than the analytical calculation of the full spectrum.
Figure 3

Comparison of IR spectra of dydrogesterone in the 950–1800 cm–1 frequency interval calculated at the BP86/TZP level (black), by means of the frequency range selection scheme (red), and with the additional intensity-selection procedure. Only the 25 most intense bands predicted by DFTB3/3ob-freq-1-2 are calculated at the BP86/TZP level (purple).

Comparison of IR spectra of dydrogesterone in the 950–1800 cm–1 frequency interval calculated at the BP86/TZP level (black), by means of the frequency range selection scheme (red), and with the additional intensity-selection procedure. Only the 25 most intense bands predicted by DFTB3/3ob-freq-1-2 are calculated at the BP86/TZP level (purple). Therefore, for the present method to be advantageous with respect to the analytical calculation of the full Hessian, a further reduction in the number of modes to be considered is necessary. First, one may additionally screen the modes according to the intensities calculated at the approximate level.[23−25] As Figure shows, although the DFTB bands are not overall reproducing the DFT ones, the ranges where one finds the most intense peaks are fairly similar in both calculations. Thus, we analyzed the outcome of selecting only the most intense bands calculated with the DFTB3/3ob-freq-1-2 model (within the 950–1800 cm–1 frequency range). Results for the intensity selection of 25 bands (16% of the total) are shown in the bottom of Figure . Most features of the full DFT spectrum are still reproduced, although this calculation was 40% faster than the analytical evaluation of the full spectrum. Second, one can simply consider a smaller range. In many cases, e.g. in peptides and other large (bio)molecules, one is often interested in only the bands associated with C=O stretching movements. In the dydrogesterone example, there are four very intense bands in the experimental spectrum from ∼1580 to ∼1700 cm–1 which are associated with C=O and C=C stretches. To account for this region and deviations in the DFTB frequencies, the range selection scheme was applied in the 1500–1800 cm–1 interval. Results are shown in Figure . The overlap with the BP86/TZP spectrum in the selected range is still very good. One is thus able to obtain virtually the same information with only four normal modes being considered at a high level of theory.
Figure 4

Comparison of IR spectra of dydrogesterone calculated at the BP86/TZP (black) and DFTB3/3ob-freq-1-2 (blue) levels and by means of the frequency range selection scheme (red) from 1500 to 1800 cm–1 using normal mode coordinates calculated at the DFTB level. Intensities in the DFTB spectrum were scaled by a factor of 0.3 for better visualization.

Comparison of IR spectra of dydrogesterone calculated at the BP86/TZP (black) and DFTB3/3ob-freq-1-2 (blue) levels and by means of the frequency range selection scheme (red) from 1500 to 1800 cm–1 using normal mode coordinates calculated at the DFTB level. Intensities in the DFTB spectrum were scaled by a factor of 0.3 for better visualization. In the example above, a 10× reduction in computational time (in comparison with the analytical BP86/TZP calculation of the full spectrum) was achieved. Even larger speed-ups are possible through an analytical implementation.[26] On the other hand, the numerical differentiation exemplified here can be easily parallelized (as already done in the mode-tracking procedure[10]) given that the gradient calculations at each displaced geometry are independent from each other. We also note in passing that although anharmonic effects are not being addressed in the present analysis, these can be approximately considered by solving one-dimensional Schrödinger equations along each normal mode.[27] The present method is also expected to perform well for other vibrational properties and larger systems. The example shown next regards the comparison of VCD spectra calculated for a molecule containing a Bucky-ball, C105H48O18. In this example, the atomic axial tensor (AAT), which appears in the magnetic term in the rotational strength equation,[2] cannot be calculated in the same fashion as the APTs (eq ), but it can be obtained cheaply at a high level through implementation of the response equations derived by Coriani et al.[28] To illustrate the potential of this approach, the axial tensor and the APT obtained (analytically) with BP86/TZP are applied at all levels shown in the following. As Figure shows, the frequency range selection method (applied in two different regions) yields again spectra that are very comparable with the one from the full analytical BP86/TZP calculation. It is noticeable that even though there are many more near-degenerate modes in certain regions (see the 1100–1400 cm–1 selection) than in the dydrogesterone case, the similarities are still striking. It is also interesting how the present method can cope with very poor modes and frequencies. The comparison in the bottom of Figure shows that even by applying the same APTs and AATs at both (DFT and DFTB) levels, the inaccuracies in the DFTB Hessian yield a completely different spectrum.
Figure 5

Comparison of VCD spectra of C105H48O18 calculated at the BP86/TZP (black) and DFTB3/3ob-freq-1-2 (blue) levels (only in the calculation of the Hessian, as the APT and AAT applied to obtain both spectra were calculated with the DFT level), and by means of the frequency range selection scheme: 1100–1400 (red) and 1700–1800 (orange) cm–1 using normal mode coordinates calculated at the DFTB level.

Comparison of VCD spectra of C105H48O18 calculated at the BP86/TZP (black) and DFTB3/3ob-freq-1-2 (blue) levels (only in the calculation of the Hessian, as the APT and AAT applied to obtain both spectra were calculated with the DFT level), and by means of the frequency range selection scheme: 1100–1400 (red) and 1700–1800 (orange) cm–1 using normal mode coordinates calculated at the DFTB level. One note of caution about the present method can be learned from Figure . Even though one of the selections included only the (20) modes predicted by DFTB to be in the 1700–1800 cm–1 range, one notices that the negative band around 1600 cm–1 ends up being described as well. This means that the DFTB frequency of one or some modes was overestimated by more than ∼100 cm–1 with respect to the full BP86/TZP analysis. This appears to be an outlier as overall results are consistent, but it shows that if the lower level yields large deviations in the frequencies, the range selection could be compromised. Hence, the range selection with DFTB and other fast approximate theories, such as semiempirical methods based on the neglect of differential diatomic overlap integral approximation,[29,30] should take into account the expected error margin in frequencies for such methods. In conclusion, the range selection method outlined here is shown to be a powerful tool for obtaining accurate vibrational spectra of a selected frequency interval. As the computational effort is limited to the number of normal modes within the region, this method will be particularly interesting for large molecules (e.g., proteins) for which only a handful of modes out of thousands, such as amide carbonyl stretches[31] or low-lying modes,[32] are of interest.
  18 in total

1.  Optimized Slater-type basis sets for the elements 1-118.

Authors:  E Van Lenthe; E J Baerends
Journal:  J Comput Chem       Date:  2003-07-15       Impact factor: 3.376

2.  Quantum Chemical Free Energies: Structure Optimization and Vibrational Frequencies in Normal Modes.

Authors:  GiovanniMaria Piccini; Joachim Sauer
Journal:  J Chem Theory Comput       Date:  2013-10-18       Impact factor: 6.006

3.  Selective calculation of high-intensity vibrations in molecular resonance Raman spectra.

Authors:  Karin Kiewisch; Johannes Neugebauer; Markus Reiher
Journal:  J Chem Phys       Date:  2008-11-28       Impact factor: 3.488

4.  Localizing normal modes in large molecules.

Authors:  Christoph R Jacob; Markus Reiher
Journal:  J Chem Phys       Date:  2009-02-28       Impact factor: 3.488

5.  Density-functional approximation for the correlation energy of the inhomogeneous electron gas.

Authors: 
Journal:  Phys Rev B Condens Matter       Date:  1986-06-15

6.  Efficient calculation of electronic absorption spectra by means of intensity-selected time-dependent density functional tight binding.

Authors:  Robert Rüger; Erik van Lenthe; You Lu; Johannes Frenzel; Thomas Heine; Lucas Visscher
Journal:  J Chem Theory Comput       Date:  2015-01-13       Impact factor: 6.006

7.  Efficient Calculation of QM/MM Frequencies with the Mobile Block Hessian.

Authors:  An Ghysels; H Lee Woodcock; Joseph D Larkin; Benjamin T Miller; Yihan Shao; Jing Kong; Dimitri Van Neck; Veronique Van Speybroeck; Michel Waroquier; Bernard R Brooks
Journal:  J Chem Theory Comput       Date:  2011-01-06       Impact factor: 6.006

8.  DFTB3: Extension of the self-consistent-charge density-functional tight-binding method (SCC-DFTB).

Authors:  Michael Gaus; Qiang Cui; Marcus Elstner
Journal:  J Chem Theory Comput       Date:  2012-04-10       Impact factor: 6.006

9.  Intensity tracking for theoretical infrared spectroscopy of large molecules.

Authors:  Sandra Luber; Johannes Neugebauer; Markus Reiher
Journal:  J Chem Phys       Date:  2009-02-14       Impact factor: 3.488

10.  Parametrization and Benchmark of DFTB3 for Organic Molecules.

Authors:  Michael Gaus; Albrecht Goez; Marcus Elstner
Journal:  J Chem Theory Comput       Date:  2012-11-26       Impact factor: 6.006

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.