Literature DB >> 35654439

Optimization of the r²SCAN-3c Composite Electronic-Structure Method for Use with Slater-Type Orbital Basis Sets.

Thomas Gasevic¹, Julius B Stückrath¹, Stefan Grimme¹, Markus Bursch².

Abstract

The "Swiss army knife" composite density functional electronic-structure method r2SCAN-3c (J. Chem. Phys. 2021, 154, 064103) is extended and optimized for the use with Slater-type orbital basis sets. The meta generalized-gradient approximation (meta-GGA) functional r2SCAN by Furness et al. is combined with a tailor-made polarized triple-ζ Slater-type atomic orbital (STO) basis set (mTZ2P), the semiclassical London dispersion correction (D4), and a geometrical counterpoise (gCP) correction. Relativistic effects are treated explicitly with the scalar-relativistic zeroth-order regular approximation (SR-ZORA). The performance of the new implementation is assessed on eight geometry and 74 energy benchmark sets, including the extensive GMTKN55 database as well as recent sets such as ROST61 and IONPI19. In geometry optimizations, the STO-based r2SCAN-3c is either on par with or more accurate than the hybrid density functional approximation M06-2X-D3(0)/TZP. In energy calculations, the overall accuracy is similar to the original implementation of r2SCAN-3c with Gaussian-type atomic orbitals (GTO), but basic properties, intermolecular noncovalent interactions, and barrier heights are better described with the STO approach, resulting in a lower weighted mean absolute deviation (WTMAD-2(STO) = 7.15 vs 7.50 kcal mol-1 with the original method) for the GMTKN55 database. The STO-optimized r2SCAN-3c outperforms many conventional hybrid/QZ approaches in most common applications at a fraction of their cost. The reliable, robust, and accurate r2SCAN-3c implementation with STOs is a promising alternative to the original implementation with GTOs and can be generally used for a broad field of quantum chemical problems.

Entities: Chemical

Year: 2022 PMID： 35654439 PMCID： PMC9255700 DOI： 10.1021/acs.jpca.2c02951

Source DB: PubMed Journal: J Phys Chem A ISSN： 1089-5639 Impact factor: 2.944

Introduction

In recent years, Kohn–Sham density functional theory (DFT)[1] has become one of the most popular methods in quantum chemistry, mainly due to its outstanding accuracy to computational cost ratio.[2] It can be employed for a large number of problems, including molecular structures and various chemical properties, as well as reactions that facilitate research and commercial projects.[3−6] Despite its high efficiency, the limits of conventional DFT are quickly reached for calculations of large systems that contain more than 300 atoms. The emerging need for fast yet accurate low-cost methods paves the way for composite schemes. These typically include small optimized basis sets to reduce the computational cost and compensate for the resulting errors with tailored corrections. A prominent class of such composite schemes is the “3c” method family. The first 3c method was the Hartree–Fock theory-based HF-3c[7] method that contains three name-giving corrections to improve its accuracy. The same concept was later applied to DFT from which the PBEh-3c/HSE-3c[8−10] hybrid and B97-3c[11] GGA functionals resulted. The latest addition to the “3c” family is r2SCAN-3c,[12] which utilizes a well-balanced triple-ζ Gaussian-type atomic orbital (GTO) basis set, the D4 London dispersion correction,[13,14] and a geometrical counterpoise (gCP)[15] correction for remaining inter- and intramolecular basis set superposition errors (BSSE). The underlying meta-generalized-gradient approximation (meta-GGA)-type density functional r2SCAN[16,17] is the regularized and restored form of the strongly constrained and appropriately normed (SCAN)[18] functional. r2SCAN yields improved accuracy and a much reduced sensitivity to the employed numerical integration grid. Overall, the original GTO-based r2SCAN-3c was shown to yield excellent results for the calculation of thermochemical properties as well as conformational energies for systems with main-group elements and transition metals, partly reaching the accuracy of hybrid functionals, applying basis sets of quadruple-ζ (QZ) quality.[12] Up to this point, the “3c” composite schemes were limited to GTO basis sets, while an assessment with Slater functions is missing. They satisfy Kato’s cusp condition[19] at the nucleus and possess a correct long-range behavior. In this work, we present an optimized Slater-type atomic orbital (STO) variant of the composite r2SCAN-3c DFT method with a customized all-electron STO basis set which was implemented in the Amsterdam Density Functional (ADF) program[20,21] of the Amsterdam Modeling Suite (AMS).[22] The performance of r2SCAN-3c is compared for both implementations (GTO vs STO) and extensively assessed on a comprehensive database consisting of 621 data points for geometrical quantities and 4405 data points for energies. This data collection includes extensive benchmark sets such as the GMTKN55[23] database as well as additional benchmark sets for noncovalent interactions (e.g., IONPI19[24]), conformational energies, and organometallic reactions (e.g., MOR41[25] and ROST61[26]). Comparisons between GTOs and STOs have already been made in a different context,[27] but this study is probably one of the most extensive ones considering the variety, chemical relevance, and amount of evaluated data points.

Theoretical Methods

The composite electronic-structure method r2SCAN-3c consists of five different components, some of which are interdependent. An overview is shown in Figure , and modifications of each component will be discussed in the following. The main building block is the underlying density functional approximation (DFA) r2SCAN. In comparison to its predecessors SCAN and rSCAN,[28] it is more accurate and less sensitive to the numerical integration grid. Consequently, much finer integration grids compared to those used in other conventional DFT methods are not required anymore, which leads to faster and more robust computations. In the STO approach, the basic DFA remains unchanged and is implemented via the Libxc library.[29]

Figure 1

Components of r2SCAN-3c applying Gaussian-type atomic orbitals (GTO) and Slater-type atomic orbitals (STO). Changes in the STO approach are marked in blue. The D4 and gCP corrections were adjusted for STOs as indicated by an asterisk.

Basis Set Modification

The new modified all-electron triple-ζ basis set mTZ2P includes a combination of the default STO atomic orbital basis sets DZP, TZP, and TZ2P in ADF, which are contracted for the zeroth-order regular approximation (ZORA).[30−32] It is constructed in analogy to the original GTO basis set mTZVPP to which it is compared in Table . The contraction schemes do not match for every element due to the different composition of the STO basis set. For example, one d- and one f-polarization function each are used for oxygen instead of two d-functions. In general, the STO basis sets include more basis functions for heavy elements compared to the original GTO basis set as the latter per default applies small-core effective core potentials (ECP) to represent the core electrons. Alterations to the underlying DZP and TZP basis sets by removing or exchanging basis functions (e.g., replacing f- by d-polarization functions) were not successful, typically increasing the obtained errors.

Table 1

Comparison of the New mTZ2P STO Basis Set with the Original mTZVPP GTO Seta

	contraction		underlying
element	mTZVPP	mTZ2P	STO basis
H	[2s1p]	[2s1p]	DZPb
He	[2s1p]	[2s1p]	DZP
N	[5s3p2d]	[5s3p2d]	TZ2P
O	[5s3p2d]	[5s3p1d1f]	TZ2Pc
F	[5s3p2d]	[5s3p2d]	TZ2P
Ne	[5s3p2d]	[5s3p1d1f]	TZ2P
Si–S	[5s4p2d]	[7s5p1d1f]	TZ2P
Cl	[5s4p2d]	[7s5p1d1f]	TZ2P
Ar	[5s4p2d]	[7s5p1d1f]	TZ2P
Kr	[6s5p4d]	[8s7p4d1f]	TZ2P

Elements that are not listed are described by the standard TZP basis in the STO set.

2p exponent is changed from 1.25 to 1.70.

3d exponent is changed from 2.00 to 2.15.

Elements that are not listed are described by the standard TZP basis in the STO set. 2p exponent is changed from 1.25 to 1.70. 3d exponent is changed from 2.00 to 2.15. Nevertheless, it was found that the respective 3d- and 2p-polarization functions of oxygen and hydrogen were initially too diffuse (too small exponents) when they are used in combination with the D4 and gCP corrections. The exponents were manually optimized using the WATER27[33] (water clusters), S22,[34] S66[35,36] (noncovalent interactions of small molecules), and HB300SPX[37] (hydrogen bonds) benchmark sets. First, the 3d exponent of oxygen was changed from 2.00 to 2.15 and subsequently the 2p exponent of hydrogen from 1.25 to 1.70. The effect on the WATER27 set is illustrated in Figure . Here, the optimization of the 3d exponent of oxygen already improves the results noticeably but the influence of tuning the 2p exponent of hydrogen is more substantial, drastically decreasing the tentative underestimation of the intermolecular water–water interactions.

Figure 2

Deviations calculated with r2SCAN-3c(STO) for the WATER27 interaction energy benchmark set applying the original exponents for oxygen and hydrogen (initial mTZ2P), customized exponents for oxygen (mTZ2P custom O), and customized exponents for oxygen and hydrogen (final mTZ2P).

London Dispersion Correction (D4)

Since semilocal density functional approximations do not account for long-range electron correlation effects, they lack the description of London dispersion interactions.[38,39] In r2SCAN-3c, they are included by the atomic-charge dependent London dispersion correction D4, which is calculated according towhere A, B, and C denote atoms, s is the scaling parameter, C( is the dispersion coefficient, R is the interatomic distance, R is the geometrically averaged distances, θ is the angle in atomic triangles, and fdamp( is the Becke–Johnson damping function fBJ(:with the functional specific parameters a1 and a2. As described in Basis Set Modification, the STO basis set mTZ2P is generally more diffuse than the GTO basis set mTZVPP and has a different long-range behavior due to the shape of the Slater-type functions. Accordingly, the manifestation of basis set superposition error (BSSE) is different for both basis sets which has a direct influence on the D4 and gCP corrections that have to be adjusted accordingly. The comparison between the GTO and STO variants of r2SCAN-3c in Figure shows that, by applying the same D4 and gCP corrections (original parameters of the GTO variant), the interaction energies differ by up to 1.66 kcal mol–1 in the L7 benchmark set for noncovalent interactions of large complexes.[40,41] Here, a slight mismatch of the attractive D4 correction and the repulsive gCP is observed for the STO basis set. To partly correct this issue, the s9 scaling parameter of the three-body dispersion has been set to 1.53 (vs 2.00 in the GTO approach), as it yields the lowest mean absolute deviation for the S30L benchmark set[42] (association energies of large NCI complexes). The remaining parameters s6, s8, a1, and a2, as well as the parameters in the charge-scaling functions β and γ (see eq 2 of ref (14)) are kept unchanged. An overview of the utilized D4 parameters is listed in the Supporting Information. The effect of adjusting the s9 parameter for the L7 set is depicted in Figure . In general, the BSSE at the TZ basis set level can be partly absorbed in the D4 parametrization.[43]

Figure 3

Deviations from reference values calculated with r2SCAN-3c for the L7 benchmark set applying different s9 D4 parameters.

Geometrical Counterpoise Correction (gCP)

Calculations applying finite basis sets are contaminated by inter- as well as intramolecular BSSE. These errors can be corrected with a geometrical counterpoise scheme according towhere A and B denote atoms, σ is a global scaling parameter, fdampgCP is a damping function as described in the work on the PBEh-3c[8] method, α and β are global fit parameters, and S is an s-type Slater overlap integral evaluated with scaled standard valence-average exponents. Emiss is originally the atomic energy difference between a large, almost complete basis set and the target basis set, and Nvirt is the number of virtual orbitals in the target basis set. In r2SCAN-3c, both Emiss and Nvirt are used as additional free fit parameters or are set to unity. In order to correct the remaining BSSE as well as the absorbing part of the (small) basis set incompleteness error (BSIE) in the STO-based r2SCAN-3c, the gCP correction was manually adjusted by optimizing the global scaling parameter σ after the basis set and the D4 correction had been modified. The remaining parameters are not altered. For the manual optimization of σ, mainly the S22, S66, and NCIBLIND10[44] benchmark sets were analyzed to reduce overall deviations, but additional sets were also cross-checked. Figure shows the impact of the gCP correction as well as the different global scaling factors for the WATER27 and ACONF12[12] benchmark sets. Although individual test sets, such as WATER27, benefit from a large scaling factor close to one, the majority is better described with a smaller value (cf. Figure b), which indicates that the STO basis set mTZ2P is less prone to remaining basis set errors than the GTO basis set mTZVPP. A good balance for all tested benchmark sets is represented by the value σ = 0.879, which is applied instead of the originally used σ = 1.000.

Figure 4

Deviations calculated with r2SCAN-3c(STO) for the WATER27 (a) and ACONF12 (b) benchmark sets applying different settings for the global scaling σ of the gCP correction. MADs are given in kcal mol–1.

Relativistic Effects

In standard quantum chemical problems, the nonrelativistic Schrödinger equation is approximately solved to obtain the final wave function. However, relativistic effects can affect the molecular geometry as well as properties, especially when heavy atoms (typically with Z > 36) are present.[45−48] They can be included implicitly by relativistic effective core-potentials (ECPs) that replace the core electrons. This approach has the advantage of a lower computational cost and is sufficient for most chemical problems which mainly depend on valence electrons like thermochemisty.[49] Accordingly, the original GTO-based r2SCAN-3c is based on a modified Ahlrichs basis set that is constructed for default use with ECPs. Relativistic effects can also be incorporated explicitly with the more time-consuming four-component Dirac equation that can be approximated with the zeroth-order regular approximation (ZORA).[30−32] It is based on an expansion of the full relativistic Hamiltonian with respect to a potential-dependent perturbation parameter and contains relativistic corrections at the zeroth order.[32,50] Overall, the accuracy for structures and electronic energies is typically similar to ECPs and ZORA.[51−53] Nevertheless, ZORA is less approximate, and the application of an all-electron (AE) basis set gives the flexibility to also apply explicit spin–orbit relativistic Hamiltonians, which may be crucial for very heavy elements. Further, explicit description of the core electrons can be crucial for a correct description of properties such as NMR chemical shielding tensors. Since scalar-relativistic effects are typically dominant for most applications and ZORA is the default in the ADF program package, it is also applied in the STO-based r2SCAN-3c.

Grid Study

Any conventional DFT calculation is typically depending on a sufficiently fine numerical integration grid.[54] And even though r2SCAN is already numerically more robust than its preceding functionals SCAN[18] and rSCAN,[28] the choice of a reasonable grid size is still relevant to obtain reliable results. In ADF, the grid can be controlled by the NumericalQuality setting, which simultaneously sets the quality of the BECKE integration grid and the quality of the density fitting, termed ZLMFIT. To determine a suitable default, different settings were assessed on a test set that includes the ACONF,[55] ACONF12,[12] L7, MOR41,[25] S30L, and S22 benchmark sets. The performance of each BECKE/ZLMFIT combination as well as a timing comparison is depicted in Figure . It was found that the influence of ZLMFIT on the accuracy is negligible and that the results mainly depend on the BECKE grid, which is almost converged with the good setting. This goes along with an increased computational cost compared to the normal setting but is necessary in order to make the method robust. For the same reason, we decided to also set the ZLMFIT to good in all calculations. In general, the good setting for the BECKE grid leads to a slightly lower number of points on a Lebedev grid compared to the originally used m4 angular grid in combination with a radial grid size of 10 in the GTO-based TURBOMOLE[56,57] (TM) code (e.g., 460046 (ADF) vs 476590 (TM) for n-dodecane).

Figure 5

Mean absolute deviation (MAD) for the ACONF, ACONF12, L7, MOR41, S30L, and S22 benchmark sets calculated with r2SCAN-3c(STO): Different BECKE and ZLMFIT settings (a) as well as timing quotients relative to the BECKE setting basic (b) and relative to the ZLMFIT setting basic (c).

Computational Details

Single-point calculations with r2SCAN-3c(STO) were performed with a development version of the Amsterdam Density Functional program ADF from the Amsterdam Modeling Suite AMS 2021.201 program package.[20,21] Consideration of molecular symmetry was turned off and the NumericalQuality was set to good. Benchmark sets that require computations of single atoms (AHB21,[59] ALKBDE10,[60] BH76,[61−63] BH76RC,[63] CHB6,[59] DIPCS10,[23] G21EA,[63,64] G21IP,[63,64] HEAVYSB11,[23] PA26,[23,60,63,65] RG18,[23] SIE4x4,[23] W4-11[66]) were calculated with the IntegerAufbau option to obtain integer instead of fractional orbital occupations. Scalar-relativistic effects were treated with the zeroth-order regular approximation (ZORA). For the D4 London dispersion correction and the geometrical counterpoise (gCP) scheme, the standalone programs dftd4 3.3.0(13,14) and mctc-gcp(15) were used. Further single-point calculations were conducted with PBE,[67] TPSS,[68] SCAN,[18] r2SCAN,[16,17] PBE0,[69,70] and B3LYP[71,72] in combination with the TZP, TZ2P, or QZ4P basis sets[73] for a timing comparison. Additionally, geometry optimizations and single-point energies were calculated with BP86[74,75]-D4/TZP and M06-2X[76]-D3(0)[77,78]/TZP. The same settings as in the r2SCAN-3c calculations were used in all calculations. Results for the GMTKN55 benchmark sets calculated with the GTO version of r2SCAN-3c were taken from the GMTKN55 database. The remaining test sets were computed with TURBOMOLE 7.5.1[56,57] using r2SCAN-3c with grid m4 and a radial grid size of 10. The resolution of identity (RI) approximation for the Coulomb energy was used with the same reduced auxiliary basis sets developed originally for B97-3c.[79−81] Default settings were used if not stated otherwise.

Results and Discussion

To compare the STO-based composite r2SCAN-3c method with the original GTO-based approach, the performance of both implementations is assessed on eight geometry and 74 energy benchmark sets. The mean deviations (MDs), mean absolute deviations (MADs), standard deviations (SDs), and root-mean-square deviations (RMSDs) for each test set are listed in the Supporting Information.

Geometries

The performance for the calculation of covalent bond lengths is assessed on benchmark sets that contain light main-group bonds (LMGB35[82]), heavy main-group bonds (HMGB11[82]), long main-group bond lengths (LB12[82]), transition metal complexes (TMC32[83]), and small semirigid organic molecules (CCse21[84,85]). Rotational constants are evaluated with the ROT34[86,87] test set. The results are summarized in Figures and 7.

Figure 6

Gaussian error distributions for a selection of covalent bond length benchmark sets. Negative mean deviations indicate overall too short bond lengths.

Figure 7

Mean absolute deviation (MAD) and standard deviation (SD) of different geometry benchmark sets calculated with both variants of r2SCAN-3c, as well as BP86-D4/TZP and M06-2X-D3(0)/TZP.

Gaussian error distributions for a selection of covalent bond length benchmark sets. Negative mean deviations indicate overall too short bond lengths. Mean absolute deviation (MAD) and standard deviation (SD) of different geometry benchmark sets calculated with both variants of r2SCAN-3c, as well as BP86-D4/TZP and M06-2X-D3(0)/TZP. In general, the GTO- and STO-based approaches of r2SCAN-3c yield similar results for the molecular geometries. Both gravitate toward slightly too long bonds for light and heavy main group bonds and toward too short bonds for transition metal complexes. In general, the error spread of r2SCAN-3c(STO) tends to be slightly larger than that of the GTO-based method, as described by the standard deviation of each test set, but the difference is small and the mean absolute deviation is lower with the STO variant. Overall, it is either on par with or outperforms the commonly used GGA method BP86-D4/TZP as well as the meta-hybrid method M06-2X-D3(0)/TZP, which performs well for small organic molecules. In calculations of bond angles and rotational constants, r2SCAN-3c(STO) yields smaller errors than the GTO-based composite method (about 14% difference in the MAD for both cases). To test noncovalent bonds, center-of-mass distances (RCMA) for the noncovalent interaction (NCI) benchmark sets S66x8[35,36,88] and HB300SPXx10[37] were calculated via a six-point cubic spline-interpolation ({0.90, 0.95, 1.00, 1.05, 1.10, 1.25}Re) of rigid fragment potential energy curves. The results are evaluated with respect to CCSD(T)/CBS and counterpoise-corrected MP2-F12/V{T,Q}Z-F12 reference values, respectively. The MADs and SDs are shown in Figure . In the S66 test set, which contains organic van der Waals and hydrogen-bonded systems, both implementations of r2SCAN-3c perform equally well with very similar statistical measures. For example, both yield too long noncovalent bonds with a MD of 5.3 pm for the STO-based composite method and 5.4 pm for the GTO-based method. Im comparison, BP86-D4/TZP and M06-2X-D3(0)/TZP yield systematically too short NCI contacts with a MD of −2.5 and −8.6 pm, respectively. The performance of both variants of r2SCAN-3c is remarkable, as their SD is about 1 pm smaller than that of BP86/TZP and M06-2X and the MAD is about 2.9 pm smaller than that of M06-2X-D3(0)/TZP. For the hydrogen-bonded systems in HB300SPX, r2SCAN-3c(STO) yields smaller deviations than the GTO-based method, and the respective MAD of 7.3 pm is halved compared to BP86/TZP, representing the smallest value in this study. Here, both versions of r2SCAN-3c yield slightly too short H-bonds with a MD of −1.5 pm for the STO variant and −0.8 pm for the GTO variant. These values are rather small compared to that of BP86-D4/TZP (MD = −12.9 pm) and M06-2X (MD = −6.6 pm), which both drastically underestimate H-bond lengths. Overall, r2SCAN-3c(STO) yields very similar results as the GTO-based method and is, in most cases, even slightly more accurate. It reaches the accuracy of computationally much more demanding hybrid/TZ approaches and can therefore be recommended for geometry optimizations.

Relative Energies

The study on relative energies includes 4405 data points in a range between −363.0 and 1290.7 kcal mol–1 with a mean reaction energy of 19.8 kcal mol–1 covering a broad area of the chemical space with tests for thermochemistry, reaction barriers, noncovalent interactions (NCIs), and conformational energies.

Main-Group Thermochemistry and Reaction Barriers

The extensive GMTKN55 database contains 55 versatile benchmark sets with CCSD(T)/CBS reference data for main-group thermochemistry, kinetics, and noncovalent interactions and represents an ideal base to validate density functional approximations. It consists of 1505 data points that can be categorized in the five subsets basic properties and reaction energies for small systems (basic properties), reaction energies for large systems (reactions), reaction barrier heights (barriers), and inter- as well as intramolecular NCIs. Because the average energies between the test sets vary significantly, the standard weighted MAD (WTMAD-2; see Supporting Information) is taken as a statistical performance measure.[23] The results are shown in Figure and Table .

Figure 8

Weighted mean absolute deviation (WTMAD-2) of the GMTKN55 as well as its subclasses computed with both variants of r2SCAN-3c.

Table 2

Weighted Mean Absolute Deviation (WTMAD-2) of the Entire GTMKN55, As Well As Its Subclasses Computed with r2SCAN-D4 and Both Variants of r2SCAN-3c

	TZP	TZ2P	QZ4P	3c(STO)	3c(GTO)
entire GMTKN55	8.45	7.97	7.41	7.15	7.50
basic properties	5.36	5.22	5.10	5.19	6.40
reactions	7.28	7.86	7.87	7.23	6.89
barriers	15.66	15.10	14.51	13.07	14.15
intermol. NCIs	9.84	9.03	7.33	7.53	7.22
intramol. NCIs	8.20	6.65	6.16	5.96	5.67

Weighted mean absolute deviation (WTMAD-2) of the GMTKN55 as well as its subclasses computed with both variants of r2SCAN-3c. The STO-based r2SCAN-3c composite method surpasses the accuracy of the parent functional r2SCAN-D4 in nearly all categories, independent of the basis set size. This is especially noticeable in the WTMAD-2 of the entire GMTKN55 database, where the STO version of r2SCAN-3c (7.15 kcal mol–1) yields the best results, followed by r2SCAN-D4/QZ4P (7.41 kcal mol–1), with TZ2P (7.97 kcal mol–1) and TZP (8.45 kcal mol–1). The WTMAD-2 of r2SCAN-3c(STO) for the entire database is also lower than that of the original GTO-based r2SCAN-3c (7.50 kcal mol–1). Notably, this performance approaches that of hybrid DFAs with large aug-def2-QZVP AO basis sets such as B3LYP-D4 (6.5 kcal mol–1) and PW6B95-D4 (5.5 kcal mol–1)[12] with a drastically reduced computational cost. The overall accuracy of both r2SCAN-3c implementations is similar, but the GTO-based approach yields slightly more accurate results for reactions and intramolecular NCIs, while the STO-based approach yields better results for basic properties and barrier heights. This behavior is also depicted in Figure , which shows the difference in the MADs and SDs of both r2SCAN-3c implementations. In this comparison, every positive value represents a better description by the GTO version and every negative value a better description by the STO version of r2SCAN-3c. The good performance of r2SCAN-3c(STO) for the basic properties presumably stems from the larger basis set that seemingly reduces the self-interaction error (SIE), as is also observed in the SIE4x4 test set.

Figure 9

Relative mean absolute deviation (MAD) and standard deviation (SD) of the STO version of r2SCAN-3c with respect to r2SCAN-3c (GTO) calculated on the GMTKN55 and several other benchmark sets.

Noncovalent Interactions

In addition to the GMTKN55 benchmark collection, more recent benchmark sets on noncovalent interactions are assessed in this section. Among them are large complexes (S30L, L7), various chalcogen (CHAL336[89]), halogen (X40x10[90]), and hydrogen bonding sets (HB300SPX), ion−π interactions (IONPI19), a blind test for DFT-based methods (NCIBLIND10), and repulsive intermolecular contacts (R160x6[12,91,92]). For the L7 set, average values of the respective LNO-CCSD(T) and fixed-node diffusion Monte Carlo (FN-DMC) interaction energies published by Al-Hamdani et al.[40] were used as reference. Similar to the statistics for the GMTKN55, the MAD values for the additional NCI benchmark sets in Figure show that r2SCAN-3c(STO) is either on par with r2SCAN-D4/QZ4P or even more accurate. In particular, in the S30L benchmark for association energies of realistic host–guest complexes, r2SCAN-3c stands out as the MAD is 56% lower than that of r2SCAN-D4/QZ4P. The dominant contribution to the interaction energies is London dispersion, which might indicate that the parametrization of the D4 correction in r2SCAN-D4 is not optimal for STOs. Unexpectedly, it is also observed that the largest tested basis set, QZ4P, does not always yield more accurate results than the smaller basis sets. Compared to the original GTO-based r2SCAN-3c, the STO approach yields overall similar results (cf. Figure ). The largest difference is observed in the IONPI19 benchmark set, which is shown in Figure . Here, the STO variant of r2SCAN-3c yields a 35% lower MAD of 0.83 kcal mol–1.

Figure 10

MADs of the additional NCI benchmark sets calculated with r2SCAN-3c(STO) and r2SCAN-D4 in combination with different STO basis sets.

Figure 11

Deviations of the IONPI19 benchmark set calculated with both versions of r2SCAN-3c.

MADs of the additional NCI benchmark sets calculated with r2SCAN-3c(STO) and r2SCAN-D4 in combination with different STO basis sets. Deviations of the IONPI19 benchmark set calculated with both versions of r2SCAN-3c.

Conformational Energies

The conformations of a molecule have a direct influence on chemical properties.[93,94] Therefore, it can be crucial to consider a conformer ensemble that is routinely created with methods that apply semiempirical methods, for example, with the CREST algorithm.[95] They still require a subsequent higher-level energy ranking for which DFT is usually employed.[96] One of the remarkable features of the original r2SCAN-3c implementation is the very good performance for conformational energies where it surpasses the accuracy of hybrid-DFT/QZ approaches at a considerably lower computational cost.[12] Thus, the implementation with STOs should ideally perform similarly. In addition to the eight conformer tests sets of the GMTKN55 database, the ACONF12 set with long alkane chains, TMCONF16[97] with transition metal complexes, and MPCONF196,[98] as well as 37CONF8,[99] with large molecules are evaluated in this section. The TMCONF16 set is essentially the TMCONF5 benchmark set without the AYISEG system. The results are depicted in Figures and 13.

Figure 12

MADs of r2SCAN-3c(STO) and r2SCAN-D4 in combination with different STO basis sets for conformational energy benchmark sets.

Figure 13

Conformational energies calculated with both versions of r2SCAN-3c for the ACONF12 benchmark set as well as the Pearson coefficients ρp. The Spearman correlation coefficient is ρs = 1 for both variants of r2SCAN-3c. The reference was calculated at DLPNO–CCSD(T1)/VeryTightPNO/CBS level of theory.[12]

MADs of r2SCAN-3c(STO) and r2SCAN-D4 in combination with different STO basis sets for conformational energy benchmark sets. Conformational energies calculated with both versions of r2SCAN-3c for the ACONF12 benchmark set as well as the Pearson coefficients ρp. The Spearman correlation coefficient is ρs = 1 for both variants of r2SCAN-3c. The reference was calculated at DLPNO–CCSD(T1)/VeryTightPNO/CBS level of theory.[12] For both large molecule test sets (37CONF8 and MPCONF196), r2SCAN-3c(STO) yields slightly better results than r2SCAN-D4, similar to the findings in the previous sections. However, it yields larger deviations for alkane chains and transition metal complexes. These deviations are also observed in the comparison between the results obtained with both implementations (Figure ). Nevertheless, the error is still small and practically negligible. The conformational energies in the ACONF12 test set are depicted in Figure . While the MAD value of 0.14 kcal mol–1) with the GTO approach is lower than with the STOs (MAD = 0.35 kcal mol–1), the relative energy ranking is better described by r2SCAN-3c(STO), which can be derived from the better Pearson correlation coefficient (ρp = 0.999).

Organometallic Thermochemistry

As the GMTKN55 database does not include any transition metal complexes, additional test sets are evaluated in this section. Reaction energies of closed-shell complexes are considered with the MOR41 and WCCR10[100,101] benchmark sets, as well as open-shell systems in the ROST61[26] set. Transition metal barrier heights are tested on the TMBH[102−105] benchmark set, which contains 34 barrier heights and on the revised MOBH35 benchmark set, termed revMOBH35.[106] The original MOBH35[107,108] set is also included to provide comparability to prior works. Binding energies of metal-linked alkyl chains are assessed on the MLA24[109] benchmark set. The results are depicted in Figures and 14.

Figure 14

MADs of r2SCAN-3c(STO) and r2SCAN-D4 in combination with different STO basis sets for organometallic thermochemistry benchmark sets.

MADs of r2SCAN-3c(STO) and r2SCAN-D4 in combination with different STO basis sets for organometallic thermochemistry benchmark sets. In all tests except MOR41, r2SCAN-3c(STO) outperforms r2SCAN-D4, independent of the applied STO basis set. However, the differences between the DFAs are rather small considering the range of energies included. The largest difference is found for reaction barrier heights in the TMBH set where the MAD of r2SCAN-3c(STO) (MAD = 2.61 kcal mol–1) is 19% lower than the MAD of r2SCAN-D4/QZ4P (MAD = 2.91 kcal mol–1). Also, the deviation between both implementations of r2SCAN-3c is fairly small. While r2SCAN-3c(GTO) yields lower errors for open-shell systems (ROST61), the STO-based method yields lower errors for closed-shell systems (MOR41). The analysis for the MOR41 benchmark set in Figure reveals that the STO-based approach is more accurate in this test set due to a better description of systems with π-interactions, which is in line with the findings for the noncovalent interaction benchmark set IONPI19 (cf. Figure ).

Figure 15

Deviations of the MOR41 benchmark set calculated with both versions of r2SCAN-3c.

Computation Time

In this section, the timings for single-point energy calculations within the ADF code are assessed on a small test set that includes eight data points that were taken from the MOR41 (13, 40), ROST61 (R31, R33), S30L (9, 19), and L7 (GGG, C2C2PD) benchmark sets. The performance of all tested STO-based DFAs is depicted in Figure .

Figure 16

MADs and wall-time quotients relative to STO-type r2SCAN-3c (1.0 = 38691 s) in a single-point energy calculation for eight structures from the S30L, MOR41, L7, and ROST61 benchmark sets. All calculations were carried out with ADF and include the D4 correction. Note that the wall-times for the semiclassical D4 and gCP corrections are negligible at 0.35 and 0.15 s, respectively. Computations were done on four Intel Xeon CPU E3-1270 v5@3.60 GHz cores. In this comparison, r2SCAN-3c(STO) is the most efficient DFA. It is faster than PBE0/TZ2P by a factor of 2.51 and faster than PBE0/QZ4P by a factor of 10.65. Surprisingly, the timing difference between meta-GGA and hybrid DFAs is not as large as in common GTO-based codes such as ORCA[110] and TURBOMOLE,[56,57] where hybrid DFAs are by a factor of about 15–20 slower than meta-GGA DFAs. In the STO-based ADF code, r2SCAN/TZ2P is only 1.3 times faster than PBE0/TZ2P, although the latter requires the additional computation of Fock exchange. This might be an effect of the Libxc implementation of r2SCAN(STO), which may slow down the computation. For example, the GGA PBE is about twice as fast in the native implementation compared to the Libxc variant (cf. Supporting Information). To compare the modified basis set of r2SCAN-3c with the underlying TZP and TZ2P basis sets, we also tested the computation time on water clusters of different sizes. The results are depicted in Figure . Overall, the STO basis set applied in r2SCAN-3c lead to similar computation times as the TZP basis set and the mTZ2P basis set is about twice as fast as the TZ2P basis set.

Figure 17

Wall-time for a single-point calculation of water clusters with different sizes applying r2SCAN-3c(STO) as well as r2SCAN-D4 in combination with TZP and TZ2P. All computations were done on four cores with the same CPU as in Figure .

Conclusion

In this work, we presented the Slater-type atomic orbital basis set optimized variant of the composite electronic-structure method r2SCAN-3c. It combines the meta-generalized-gradient-approximation density functional r2SCAN with a tailored triple-ζ all-electron STO basis set and applies the readjusted semiclassical D4 and gCP corrections for London dispersion effects and basis set superposition errors, respectively. Instead of the originally applied effective core potentials in the GTO approach, relativistic effects are treated explicitly with the scalar-relativistic zeroth-order regular approximation (SR-ZORA), keeping the flexibility to also apply spin–orbit relativistic ZORA. For robust and accurate results, the NumericalQuality should be generally set to the good level in the ADF code. In this comprehensive study, the performance of r2SCAN-3c was assessed on a collection of 82 benchmark sets that cover geometries, thermochemistry, barrier heights, noncovalent interactions, and conformational energies of main-group as well as transition metal systems. In total, 621 data points for geometrical properties and 4405 data points for energies were evaluated for both implementations of r2SCAN-3c. In the geometry study, r2SCAN-3c(STO) has proven to be on par with or better than the hybrid M06-2X-D3(0)/TZP approach. In the energy study, r2SCAN-3c was further compared to r2SCAN-D4 in combination with different sizes of STO basis sets. It was shown that r2SCAN-3c(STO), in most cases, provides more accurate results than r2SCAN-D4/QZ4P at a 6-fold speed-up. The most significant improvement over the large basis set was found for noncovalent interactions of large systems (S30L) where the MADs are 1.59 and 3.63 kcal mol–1, respectively. On average, r2SCAN-3c(STO) yields similar results as the original GTO version which was also observed for geometrical properties. Reaction energies and intramolecular NCIs, such as conformational energies, are slightly better described with the GTO approach but basic properties and intermolecular NCIs, such as ion-π interactions, are better described with the STO approach. This results in a lower WTMAD-2 for the GMTKN55 database with the STO version (WTMAD-2 = 7.15 kcal mol–1) instead of the GTO version (WTMAD-2 = 7.50 kcal mol–1). Overall, r2SCAN-3c reaches the accuracy of hybrid DFAs, which apply quadruple-ζ AO basis sets at a significantly reduced computational cost. The fast, robust, and accurate STO-based r2SCAN-3c method can be applied safely for a broad range of quantum chemical problems and therefore represents an efficient choice in many chemical applications.

78 in total

1. Generalized Gradient Approximation Made Simple.

Authors:
Journal: Phys Rev Lett Date: 1996-10-28 Impact factor: 9.161

2. Comparative Assessment of DFT Performances in Ru- and Rh-Promoted σ-Bond Activations.

Authors: Yuanyuan Sun; Lianrui Hu; Hui Chen
Journal: J Chem Theory Comput Date: 2015-03-04 Impact factor: 6.006

3. Effect of the damping function in dispersion corrected density functional theory.

Authors: Stefan Grimme; Stephan Ehrlich; Lars Goerigk
Journal: J Comput Chem Date: 2011-03-01 Impact factor: 3.376

4. Benchmarking London dispersion corrected density functional theory for noncovalent ion-π interactions.

Authors: Sebastian Spicher; Eike Caldeweyher; Andreas Hansen; Stefan Grimme
Journal: Phys Chem Chem Phys Date: 2021-05-26 Impact factor: 3.676

5. r²SCAN-3c: A "Swiss army knife" composite electronic-structure method.

Authors: Stefan Grimme; Andreas Hansen; Sebastian Ehlert; Jan-Michael Mewes
Journal: J Chem Phys Date: 2021-02-14 Impact factor: 3.488

6. Extension of the D3 dispersion coefficient model.

Authors: Eike Caldeweyher; Christoph Bannwarth; Stefan Grimme
Journal: J Chem Phys Date: 2017-07-21 Impact factor: 3.488

7. Blind test of density-functional-based methods on intermolecular interaction energies.

Authors: DeCarlos E Taylor; János G Ángyán; Giulia Galli; Cui Zhang; Francois Gygi; Kimihiko Hirao; Jong Won Song; Kar Rahul; O Anatole von Lilienfeld; Rafał Podeszwa; Ireneusz W Bulik; Thomas M Henderson; Gustavo E Scuseria; Julien Toulouse; Roberto Peverati; Donald G Truhlar; Krzysztof Szalewicz
Journal: J Chem Phys Date: 2016-09-28 Impact factor: 3.488

8. A look at the density functional theory zoo with the advanced GMTKN55 database for general main group thermochemistry, kinetics and noncovalent interactions.

Authors: Lars Goerigk; Andreas Hansen; Christoph Bauer; Stephan Ehrlich; Asim Najibi; Stefan Grimme
Journal: Phys Chem Chem Phys Date: 2017-12-13 Impact factor: 3.676

9. Calculation of Ligand Dissociation Energies in Large Transition-Metal Complexes.

Authors: Tamara Husch; Leon Freitag; Markus Reiher
Journal: J Chem Theory Comput Date: 2018-04-13 Impact factor: 6.006

10. Performance of Density Functionals for Activation Energies of Re-Catalyzed Organic Reactions.

Authors: Yihua Sun; Hui Chen
Journal: J Chem Theory Comput Date: 2014-01-07 Impact factor: 6.006