Laurens D M Peters1, Jörg Kussmann1, Christian Ochsenfeld1,2. 1. Chair of Theoretical Chemistry, Department of Chemistry, University of Munich (LMU), Butenandtstr. 7, D-81377 München, Germany. 2. Max Planck Institute for Solid State Research, Heisenbergstr. 1, D-70569 Stuttgart, Germany.
Abstract
Starting from our recently published implementation of nonadiabatic molecular dynamics (NAMD) on graphics processing units (GPUs), we explore further approaches to accelerate ab initio NAMD calculations at the time-dependent density functional theory (TDDFT) level of theory. We employ (1) the simplified TDDFT schemes of Grimme et al. and (2) the Hammes-Schiffer-Tully approach to obtain nonadiabatic couplings from finite-difference calculations. The resulting scheme delivers an accurate physical picture while virtually eliminating the two computationally most demanding steps of the algorithm. Combined with our GPU-based integral routines for SCF, TDDFT, and TDDFT derivative calculations, NAMD simulations of systems of a few hundreds of atoms at a reasonable time scale become accessible on a single compute node. To demonstrate this and to present a first, illustrative example, we perform TDDFT/MM-NAMD simulations of the rhodopsin protein.
Starting from our recently published implementation of nonadiabatic molecular dynamics (NAMD) on graphics processing units (GPUs), we explore further approaches to accelerate ab initio NAMD calculations at the time-dependent density functional theory (TDDFT) level of theory. We employ (1) the simplified TDDFT schemes of Grimme et al. and (2) the Hammes-Schiffer-Tully approach to obtain nonadiabatic couplings from finite-difference calculations. The resulting scheme delivers an accurate physical picture while virtually eliminating the two computationally most demanding steps of the algorithm. Combined with our GPU-based integral routines for SCF, TDDFT, and TDDFT derivative calculations, NAMD simulations of systems of a few hundreds of atoms at a reasonable time scale become accessible on a single compute node. To demonstrate this and to present a first, illustrative example, we perform TDDFT/MM-NAMD simulations of the rhodopsin protein.
Nonadiabatic molecular dynamics
(NAMD) simulations using trajectory surface hopping (TSH)[1−4] have become a powerful tool to describe the dynamics of molecular
systems involving multiple electronic states. Their field of application
ranges from the description of rather small molecular machines[5−9] over medium-sized photoswitches[10,11] to the dynamics
of entire photoactive proteins.[12,13] They can be used with
a variety of excited-state methods, e.g., the complete
active space self-consistent field (CASSCF) method,[14] the algebraic–diagrammatic construction (ADC(2)),[15] several coupled cluster methods (e.g., CC2),[16] as well as time-dependent density
functional theory (TDDFT).[17,18] Also, triplet states[19] can be included.However, the greatest
challenge remains the large computational
cost of NAMD simulations, which is a result of the expensive excited-state
methods mentioned above and the fact that TSH requires not only one
but a series of trajectories to determine observables as ensemble
averages. This problem can be tackled by using semiempirical methods,[5,6] employing exciton models[20] or using graphics
processing units (GPUs). The latter have been applied to ground-state
properties,[21−28] excited-state calculations,[9,20,29−31]ab initio multiple spawning (AIMS),[32−35] and NAMD[36,37] simulations.Based on our
recent work,[9] we explore
in our present work the use of simplified TDDFT schemes[38,39] and the Hammes-Schiffer–Tully (HST)[2] model in addition to GPU-based integral routines. They tackle the
two major bottlenecks of NAMD: The calculation of the state energies
and the nonadiabatic couplings between the states. After a brief summary
of the corresponding theory and its validation for the investigated
problems, we show timings and use our approach to simulate the photoinduced
rotation of the retinal chromophore in the rhodopsin protein at the
TDDFT/MM level of theory. Details on the methods (thresholds, convergence
criteria, etc.) implemented in our FermiONs++ program package[25,26] and the computational setup can be found in the Supporting Information.In TSH,[1,2] a
system is allowed to switch the potential
energy surface (PES) within one trajectory. The occurrence of such
a surface hop depends on the hopping probability: if it exceeds a
randomly drawn number between 0 and 1, the trajectory continues on
a different PES with a rescaled nuclear velocity. The average of multiple
trajectories with a different series of random numbers describes the
behavior of the system. The hopping probability itself is calculated
from the change of the state energies and the nonadiabatic couplings
(Q), which can also be obtained as the product of the
nonadiabatic coupling vectors (τ) and the nuclear
velocity (Ṙ)where Φ is the wave function of the electronic state I.
While τ can be calculated using response theory, Q cannot be determined analytically.Energies of all
considered states, gradients, and Q are thus the main
ingredients of a TSH algorithm. The first can
be obtained from TDDFT by solving the TDDFT or random phase approximation
(RPA) equations[17,40,41]with ω being
the excitation energy of state I. A and B are the orbital rotation Hessians, and X and Y are the transition densities for excitation and de-excitation,
respectively. Neglecting B in eq is known as the Tamm–Dancoff approximation
(TDA),[42] leading toThe calculation of
excitation energies with eqs and 3 is time-consuming,
mainly because of the evaluation of the two-electron integrals in A and B. To accelerate these calculations, we
apply the simplified RPA and TDA methods by Grimme and co-workers.[38,39,43] Here, Coulomb and exchange kernels
are approximated (J′ for Coulomb and K′ for exchange) using the Mataga–Nishimoto–Ohno–Klopman[44−46] damped Coulomb operators together with the transition/charge density
monopoles q obtained from a Löwdin population
analysis[47]p, q, ...
are arbitrary molecular orbitals. r is the interatomic
distance, η is the mean of the chemical hardness of the atoms N and M. α and β are global
fit parameters, while cx is the amount
of exact exchange. This leads to the following approximate orbital
rotation Hessiansi, j, ...
denote occupied molecular orbitals, and a, b, ... denote virtual molecular orbitals. s is 2 or 0 for singlet–singlet
or singlet–triplet excitations, and ϵ is the orbital energy of p.The excitation
energies (ω′) are then obtained by
diagonalizing A′ in the case of sTDA or (A′ − B′)1/2(A′ + B′)(A′
− B′)1/2 in the case of sRPA.
To avoid the diagonalization of the entire matrix, the number of included
configuration-state functions (CSFs) is truncated using the thresholds
ϑpCSF, ϑsCSF, and ϑCSF.[38] Only primary (with an energy below
ϑpCSF) and secondary CSFs (with an energy between
ϑpCSF and ϑCSF and a significant
coupling to the primary CSFs > ϑsCSF) are considered.
The sTDA scheme greatly reduces the cost of excited-state calculations,
giving access to absorption spectra of large molecular systems.[38] The sRPA scheme yields better transition densities,[39] leading to better transition dipole moments
and even enabling the calculation of higher order dynamical response
properties.[48]Excited-state gradients
(ωx) and τ’s at
the TDDFT/TDA level of theory can be derived from eqs and 3 using
linear response theory.[49−59] These equations cannot easily be transferred to sTDA/sRPA, as the
derivatives of J′ and K′ with
respect to the nuclear coordinates have, so far, not been implemented.
Instead, we are using the calculated ω′, X′, and Y′ in the
RPA/TDA algorithms for ωx and τ featuring exact evaluations of the two-electron integrals, i.e., without the Mataga–Nishimoto–Ohno–Klopman
damped Coulomb operators. As a consequence of this, we expect a slight
disagreement between the calculated energies and the excited-state
properties (ωx and τ’s).
To validate this approach, we compare optimized structures of biphenyl
(I) using the S1 potential energy surface
at the TDDFT and sTDDFT levels of theory in Table and show ωx’s and τ’s of optimized ground-state
structures of protonated formaldimine (II) and the Schiff
base of the retinal chromophore (III) in Figure . A validation based on NAMD
is discussed below. Additionally, selected
natural transition orbitals[60] and a screening
of the thresholds (ϑpCSF, ϑCSF,
and ϑsCSF) are shown in the Supporting Information.
Table 1
Comparison of Optimized
Structuresa of Biphenyl (I) Calculated at
RPA, TDA, sRPA, and sTDA (PBE0/def2-SVP) Levels of Theory Listing
the Central C–C Distance (c) and the Dihedral
γ
RPA
TDA
sRPA
sTDA
c [Å]
1.42
1.44
1.44
1.44
|γ| [deg]
0.02
0.03
0.02
0.01
Using the S1 potential
energy surface.
Figure 1
(a–d) RPA and
sRPA excited-state gradients of the second
excited state (a + b, green) and nonadiabatic coupling vectors between
the ground and the second excited state (c + d, red) of II at the PBE0/def2-SVP level of theory. (e–h) RPA and sTDA
excited-state gradients of the first excited state (e + f, green)
and nonadiabatic coupling vectors between the ground and the first
excited state (g + h, red) of III at the ωB97/def2-SVP
level of theory. All calculations have been performed at optimized
ground-state geometries.
Using the S1 potential
energy surface.(a–d) RPA and
n class="Chemical">sRPA excited-state gradients of the second
excited state (a + b, green) and nonadiabatic coupling vectors between
the ground and the second excited state (c + d, red) of II at the PBE0/def2-SVP level of theory. (e–h) RPA and sTDA
excited-state gradients of the first excited state (e + f, green)
and nonadiabatic coupling vectors between the ground and the first
excited state (g + h, red) of III at the ωB97/def2-SVP
level of theory. All calculations have been performed at optimized
ground-state geometries.
All four optimized S1 structures of I in Table are nearly planar
and feature a similar central C–C distance. The ωx’s and τ’s based on
sTDDFT results in Figure are also in good agreement with the RPA and TDA properties.
The only differences are the weaker nonadiabatic couplings of II and III and the fact that ω1x of III has a larger contribution in the six-membered ring and
a smaller contribution in the conjugated system. Both are the result
of the slightly different excitation energies (II: 9.76
eV (RPA), 9.87 eV (sRPA); III: 2.74 eV (RPA), 3.17 eV
(sTDA)) and transition densities. In the case of III,
three other observations can be made (see Figures S8–10 in
the Supporting Information): (1) sRPA performs
worse than sTDA, (2) PBE0[61−64]/def2-SVP[65,66] is, in contrast to
ωB97[67]/def2-SVP, not able to capture
the charge transfer character of the excitation, and (3) the natural
transition orbitals at the RPA and sTDA levels of theory are nearly
identical, indicating that the major source of error is the excitation
energy. Table and Figure suggest that simplified
TDDFT might also be used for NAMD simulations, which require, however,
time-consuming calculations of the Q’s between
the states. To circumvent the analytical calculation of Q via τ, we apply the numerical HST model[2]Assuming that the excited-state wave functions can be obtained
from the ground-state Kohn–Sham orbitals (ϕ) and the transition densities, O0 and O take the following formwithR and L are (X + Y) and (X – Y), respectively, in the case
of RPA and X in the case
of TDA. The HST model is nowadays widely used inNAMD simulations,[68−70] because it reduces the computational time (as shown below) and leads
to more stable trajectories in the vicinity of conical intersections.[71−73]A validation of the presented numerical scheme for Q used in the HST model is shown in the Supporting Information, where we have calculated numerical and analytical τ’s for formaldehyde. Additionally, we have performed
NAMD simulations of II, using the HST model and analytically
calculated τ’s as well as RPA, TDA, sRPA,
and sTDA. After excitation to the S2 state, the molecule
shows a fast conversion to the S1 state, which goes along
with the elongation of the C–N bond. Further relaxation to
the S0 state is achieved via a rotation around this bond.
In Figure , the increase
(S2 → S1) and decrease (S1 → S0) of the S1 occupation is shown.
The change of occupation for all states as well as energies and Q’s of selected trajectories are listed in the Supporting Information.
Figure 2
Change of S1 state occupations of protonated formaldimine
(II) calculated as an average of all NAMD simulations
at (a) the RPA (PBE0/def2-SVP) level of theory using analytical and
numerical nonadiabatic couplings and (b) RPA, TDA, sRPA, and sTDA
(PBE0/def2-SVP) results using numerical nonadiabatic couplings.
Change of S1 state occupations of protonated formaldimine
(II) calculated as an average of all n class="Chemical">NAMD simulations
at (a) the RPA (PBE0/def2-SVP) level of theory using analytical and
numerical nonadiabatic couplings and (b) RPA, TDA, sRPA, and sTDA
(PBE0/def2-SVP) results using numerical nonadiabatic couplings.
In all sets of trajectories, we observe a similar
behavior of II, indicating that the HST model and the
simplified TDDFT
are valid approximations for this example. This is reflected in the
S1 occupations, which are very similar for all cases. The
only differences are slightly different S2–S1 nonadiabatic couplings when applying the HST model and a
slower decay of the S1 occupation in the case of the simplified
TDDFT methods, which is, however, also visible in the case of TDA.
The first observation is in good agreement with previous findings[71−73] comparing analytical and numerical Q’s in the
vicinity of conical intersections. The latter trend is also reflected
in the Q’s and τ’s.Here, we want to stress that one of the well-known limitations
of TDDFT is its poor description of conical intersections involving
the ground state,[74,75] which can be improved by applying, e.g., the TDA.[76,77] Alternatively, the
spin-restricted ensemble-referenced KS (REKS) method[78] can be employed. Obviously, these limitations are not overcome
by applying the simplified TDDFT approach. However, for this particular
system, good agreement between TDDFT and CASSCF simulations has been
observed,[70] so that we are confident that
our chosen setup allows for a good qualitative description of the
investigated processes. Please note that the simulations using sTDA
or sRPA are not strictly energy conserving as discussed above. Therefore,
the standard deviation of the total energy is approximately 5 times
higher than in standard TDDFT simulations. However, rescaling the
nuclear velocities to enforce energy conservation has no effect on
the average properties of the system, which illustrates the validity
of the present approach.Table presents
the impact of the approximations on the performance of an NAMD simulation
of III. It shows that HST and sRPA virtually eliminate
the computational cost of the calculations of ωx’s and Q’s. In combination with
the GPU-based integral evaluations, the total speed-up in the case
of Nroots = 2 is ∼4 with respect
to the CPU-based RPA implementation using analytical τ’s. An NAMD simulation of III involving 5000
steps (e.g., 1 ps simulation using 0.2 fs time steps)
can thus be conducted within ∼5 instead of ∼21 days.
The acceleration becomes even larger when more excited states and
nonadiabatic couplings are considered (e.g., a factor
of more than 20 for III in case of Nroots = 7). This and the fact that the performance of
GPUs is better for large molecular systems (see, ref (9)) makes the presented approach
interesting for the investigation of systems involving hundreds of
atoms and plenty of electronic states.
Table 2
Computation
Times of Ground-State
Energy (E0) and Gradient (E0x), Excited-State
Energies (ω) and Gradient (ω1x), and Nonadiabatic Couplings (Q) Calculations of the
Schiff Base of the Retinal Chromophore (III) at the RPA
and sRPA (PBE0/def2-SVP)
Levels of Theory, Using a Different Number of Roots (Nroots) and Nonadiabatic Couplings () as Well as Analytical and Numerical Q’sa
Nroots
Q
RPA/sRPA
t(E0) + t(E0x)
t(ωI)
t(ω1x)
t(Q)
t (total)
2
analytical*
RPA*
34 s
28 s
101 s
201 s
∼6 min
2
analytical
RPA
26 s
19 s
62 s
128 s
∼4 min
2
numerical
RPA
26 s
19 s
62 s
<1 s
∼2 min
2
numerical
sRPA
26 s
<1 s
62 s
<1 s
∼1.5 min
3
analytical
RPA
26 s
36 s
62 s
309 s
∼7 min
3
numerical
RPA
26 s
36 s
62 s
<1 s
∼2 min
3
numerical
sRPA
26 s
<1 s
62 s
<1 s
∼1.5 min
7
analytical
RPA
26 s
51 s
62 s
1822 s
∼32.5 min
7
numerical
RPA
26 s
51 s
62 s
<1 s
∼2.5 min
7
numerical
sRPA
26 s
<1 s
62 s
<1 s
∼1.5 min
Asterisks mark
calculations that
have been performed entirely on CPUs. All calculations were conducted
on two Intel Xeon CPU E5 2640 v4 @ 2.20 GHz (20 threads) CPUs and
four AMD FirePro 3D W8100 GPUs.
Asterisks mark
calculations that
have been performed entirely on CPUs. All calculations were conducted
on two Intel Xeon CPU E5 2640 v4 @ 2.20 GHz (20 threads) CPUs and
four AMD FirePro 3D W8100 GPUs.As a first application of our proposed scheme, we have calculated
105 NAMD simulations of the rhodopsin protein (IV) at
the sTDA/MM (ωB97/def2-SVP) level of theory. The chromophore
of IV (Figure a) undergoes a cis–trans isomerization when exposed to light (see Figure c). For a review of calculations on this
system, the reader is referred to ref (79). Similar systems have also been investigated
by Martínez et al.[34,35] The change of the dihedral
γ1 (defined in Figure b) and the state occupations are shown in Figure .
Figure 3
(a) Structure of the
chromophore of rhodopsin (IV).
(b) Important dihedrals (γ1 and γ2) in and (c) the bicycle pedal isomerization mechanism of IV. The part of the molecule shown in (b) and (c) is located by the
rectangle in (a).
Figure 4
NAMD simulations of IV at the sTDA (PBE0/def2-SVP)
level of theory using the HST model: (a) Change of the dihedral γ1 indicating trajectories with no rotation (black), half rotation
(red), and full rotation (blue). (b) Change of the state occupations
of IV calculated as an average of all trajectories.
(a) Structure of the
chromophore of rhodopsin (IV).
(b) Important dihedrals (γ1 and γ2) in and (c) the bicycle pedal isomerization mechanism of IV. The part of the molecule shown in (b) and (c) is located by the
rectangle in (a).Nn class="Disease">AMD simulations of IV at the sTDA (PBE0/def2-SVP)
level of theory using the HST model: (a) Change of the dihedral γ1 indicating trajectories with no rotation (black), half rotation
(red), and full rotation (blue). (b) Change of the state occupations
of IV calculated as an average of all trajectories.
Out of 105 calculated trajectories, 9 feature a cis–trans isomerization (see Figure . A movie of the
isomerization
is available at https://www.cup.uni-muenchen.de/pc/ochsenfeld/download/). Most of them reach γ1 = −90° at ∼280
fs, which coincides with the crossing point of the state occupations
(see Figure ). This
hop time (t) is significantly higher, and the yield
(y) of 9% significantly lower than the experimental
(t = 147.7 ± 1.0 fs; y = 0.63
± 0.01) results reported in ref (13). Our analysis of III (see Figure ) indicates that
the weaker gradients and nonadiabatic couplings of sTDA or the discussed
problems of TDDFT with conical intersections (see also refs (79−81)) may be
the reason for this. However, our approach describes the direction
and mechanism (see Figure c) of the isomerization correctly. In contrast to previous
work,[12,13,79] our method
requires, besides α, β, and the QM/MM ansatz, no further
parametrizations and/or reductions of the system. One trajectory of IV takes ∼5–7 days on two Intel Xeon CPU E5
2640 v4 @ 2.20 GHz (20 threads) CPUs and four AMD FirePro 3D W8100
GPUs using our FermiONs++ program package.[25,26]We have introduced a combination of GPU-based integral routines,
simplified TDDFT schemes, and numerical nonadiabatic couplings for
efficient NAMD simulations. For all investigated systems ranging from
small organic molecules (II) to proteins (IV), excited-state properties and dynamics are described qualitatively
correctly with a significantly reduced computational cost. The latter
is due to the vanishing computational times for TDDFT energies and
nonadiabatic couplings calculations. The present approach may be used
to qualitatively explore relaxation pathways and predict trends (e.g., effects of mutations or different isotopes) within
these reactions.
Authors: Felipe Cordova; L Joubert Doriol; Andrei Ipatov; Mark E Casida; Claudia Filippi; Alberto Vela Journal: J Chem Phys Date: 2007-10-28 Impact factor: 3.488
Authors: Felix Plasser; Rachel Crespo-Otero; Marek Pederzoli; Jiri Pittner; Hans Lischka; Mario Barbatti Journal: J Chem Theory Comput Date: 2014-03-13 Impact factor: 6.006