| Literature DB >> 33200931 |
Emmanouil Semidalas1, Jan M L Martin1.
Abstract
A hierarchy of wavefunction composite methods (cWFT), based on G4-type cWFT methods available for elements H through Rn, was recently reported by the present authors [ J. Chem. Theor. Comput. 2020, 16, 4238]. We extend this hierarchy by considering the inner-shell correlation energy in the second-order Møller-Plesset correction and replacing the Weigend-Ahlrichs def2-mZVPP(D) basis sets used with complete basis set extrapolation from augmented correlation-consistent core-valence triple-ζ, aug-cc-pwCVTZ(-PP), and quadruple-ζ, aug-cc-pwCVQZ(-PP), basis sets, thus creating cc-G4-type methods. For the large and chemically diverse GMTKN55 benchmark suite, they represent a substantial further improvement and bring WTMAD2 (weighted mean absolute deviation) down below 1 kcal/mol. Intriguingly, the lion's share of the improvement comes from better capture of valence correlation; the inclusion of core-valence correlation is almost an order of magnitude less important. These robust correlation-consistent cWFT methods approach the CCSD(T) complete basis limit with just one or a few fitted parameters. Particularly, the DLPNO variants such as cc-G4-T-DLPNO are applicable to fairly large molecules at a modest computational cost, as is (for a reduced range of elements) a different variant using MP2-F12/cc-pVTZ-F12 for the MP2 component.Entities:
Year: 2020 PMID: 33200931 PMCID: PMC7735707 DOI: 10.1021/acs.jctc.0c01106
Source DB: PubMed Journal: J Chem Theory Comput ISSN: 1549-9618 Impact factor: 6.006
Figure 1Naming scheme of the correlation-consistent cc-G4-type methods.
Statistical Errors (kcal/mol) of Recommended Methods and Selected Other WFT, cWFT, and DFT Methods for the GMTKN55 Database with the WTMAD2 Component Breakdown for the Top-Level Subsetsa
| methods | WTMAD2 | thermo | barrier | large | confor | intermol |
|---|---|---|---|---|---|---|
| cc-G4-T-v2 | 0.87 | 0.20 | 0.09 | 0.17 | 0.11 | 0.29 |
| Ditto frozen core | 0.94 | 0.21 | 0.10 | 0.19 | 0.12 | 0.32 |
| 0.90 | 0.19 | 0.11 | 0.176 | 0.125 | 0.30 | |
| Ditto frozen core | 0.99 | 0.21 | 0.12 | 0.20 | 0.13 | 0.33 |
| 1.00 | 0.17 | 0.08 | 0.17 | 0.31 | 0.27 | |
| 1.03 | 0.20 | 0.14 | 0.24 | 0.17 | 0.28 | |
| 1.185 | 0.29 | 0.11 | 0.21 | 0.11 | 0.47 | |
| 1.193 | 0.22 | 0.09 | 0.21 | 0.36 | 0.31 | |
| 1.194 | 0.25 | 0.12 | 0.22 | 0.36 | 0.24 | |
| 1.37 | 0.27 | 0.14 | 0.25 | 0.30 | 0.41 | |
| 1.84 | 0.25 | 0.12 | 0.35 | 0.48 | 0.64 | |
| 2.21 | 0.43 | 0.28 | 0.31 | 0.58 | 0.61 | |
| cc-MP2.X-Q | 2.89 | 0.57 | 0.70 | 0.62 | 0.60 | 0.40 |
| cc-MP2.X-T | 3.09 | 0.60 | 0.64 | 0.63 | 0.61 | 0.61 |
| G4(MP2)-XK-T[ | 1.42 | 0.39 | 0.16 | 0.18 | 0.13 | 0.56 |
| G4-T-v1[ | 1.46 | 0.31 | 0.16 | 0.22 | 0.16 | 0.61 |
| G4-T-v2[ | 1.49 | 0.32 | 0.15 | 0.23 | 0.17 | 0.63 |
| G4-Q-DLPNO[ | 1.52 | 0.25 | 0.12 | 0.20 | 0.46 | 0.49 |
| G4-T-DLPNO[ | 1.66 | 0.26 | 0.12 | 0.24 | 0.52 | 0.52 |
| G4(MP3)-D[ | 1.65 | 0.37 | 0.17 | 0.28 | 0.30 | 0.55 |
| G4(MP3|KS)-D[ | 1.96 | 0.41 | 0.28 | 0.26 | 0.45 | 0.56 |
| G4(MP2)-XK-D[ | 2.56 | 0.46 | 0.29 | 0.34 | 0.68 | 0.79 |
| G4[ | 2.52 | 0.38 | 0.23 | 0.75 | 0.38 | 0.78 |
| G4(MP2)4 | 2.96 | 0.53 | 0.34 | 0.91 | 0.33 | 0.85 |
| CBS-QB3[ | 3.10 | 0.40 | 0.35 | 0.60 | 0.20 | 1.55 |
| MP2.X-Q[ | 3.29 | 0.71 | 0.78 | 0.88 | 0.42 | 0.50 |
| rev-G4MP2XK[ | 3.53 | 0.50 | 0.29 | 0.61 | 1.16 | 0.96 |
| G4(MP2)-XK[ | 3.71 | 0.45 | 0.31 | 0.67 | 1.25 | 1.02 |
| MP2.X-T[ | 3.78 | 0.76 | 0.81 | 0.89 | 0.51 | 0.81 |
| SCS-MP2-D3[ | 5.22 | 1.23 | 0.95 | 1.39 | 0.91 | 0.75 |
| SCS-MP2[ | 5.35 | 0.94 | 1.01 | 1.15 | 1.02 | 1.23 |
| MP2-D3 | 5.83 | 1.21 | 1.21 | 1.66 | 0.87 | 0.87 |
| MP2-D3 | 5.54 | 1.20 | 1.18 | 1.52 | 0.80 | 0.84 |
| MP2 | 6.91 | 1.21 | 1.23 | 1.78 | 1.47 | 1.21 |
| HF-D3 | 13.08 | 5.05 | 2.65 | 2.06 | 1.85 | 1.48 |
| HF | 29.46 | 5.87 | 3.74 | 3.66 | 7.27 | 8.92 |
| ωB97M(2)[ | 2.19 | 0.44 | 0.26 | 0.42 | 0.58 | 0.49 |
| xrevDSD-PBEP86-D4[ | 2.26 | 0.56 | 0.27 | 0.52 | 0.43 | 0.47 |
| revDSD-PBEP86-D4[ | 2.33 | 0.56 | 0.31 | 0.58 | 0.41 | 0.48 |
| revDOD-PBEP86-D4[ | 2.36 | 0.59 | 0.30 | 0.59 | 0.41 | 0.47 |
| revDSD-PBEP86-NL | 2.44 | 0.55 | 0.30 | 0.55 | 0.47 | 0.57 |
| revDSD-PBE-D4[ | 2.46 | 0.65 | 0.35 | 0.53 | 0.43 | 0.50 |
| revDSD-PBEP86-D3[ | 2.42 | 0.54 | 0.31 | 0.55 | 0.46 | 0.57 |
| revDSD-BLYP-D4[ | 2.59 | 0.57 | 0.34 | 0.58 | 0.48 | 0.62 |
| DSD-SCAN-D4[ | 2.64 | 0.60 | 0.40 | 0.62 | 0.45 | 0.56 |
| DSD-PBE-D4[ | 2.64 | 0.61 | 0.39 | 0.56 | 0.53 | 0.54 |
| DSD-PBEP86-D4[ | 2.65 | 0.54 | 0.37 | 0.63 | 0.55 | 0.56 |
| revDSD-PBEB95-D4[ | 2.70 | 0.64 | 0.31 | 0.45 | 0.78 | 0.52 |
| DSD-BLYP-D4[ | 2.83 | 0.58 | 0.38 | 0.59 | 0.68 | 0.60 |
| DSD-PBEP86-D3[ | 3.10 | 0.55 | 0.45 | 0.49 | 0.65 | 0.97 |
| DSD-PBE-D3[ | 3.17 | 0.66 | 0.41 | 0.54 | 0.73 | 0.83 |
| B2GP-PLYP-D3[ | 3.19 | 0.63 | 0.42 | 0.66 | 0.64 | 0.85 |
| ωB97M-V[ | 3.29 | 0.73 | 0.45 | 0.64 | 0.90 | 0.57 |
| ωB97X-V[ | 3.96 | 1.02 | 0.56 | 1.07 | 0.73 | 0.58 |
| M06-2X-D3(0)[ | 4.79 | 0.86 | 0.48 | 1.08 | 1.22 | 1.14 |
| B3LYP-D3 | 6.50 | 1.31 | 1.14 | 1.66 | 1.15 | 1.24 |
D3(BJ) is abbreviated as D3 in this table; M06-2X was evaluated with a D3(0) correction, to make D3BJ parameters and the WTMAD2 value of M06-2X without D3(0) identical. Tabulated data for the DFT methods employing the def2-QZVPP basis set (def2-QZVPPD for subsets AHB21, G21EA, IL16, RG18, and WATER27) were obtained from refs (43, 44), while the WFT (MP2, SCS-MP2, and HF) data in the same basis sets were obtained from ref (38) as were all cWFT results without inner-shell correlation.
The results from the conventional G4,[3] G4(MP2),[4] CBS-QB3,[8,9] and G4(MP2)-XK[35] methods were obtained from ref (38).
D3(BJ) parameters obtained from Table S1 of ref (112).
α1 = 0, α2 = 5.5, s6 = −0.345, s8 = 0 from ref (38).
From ref (38), D3(BJ) parameters from Table 2 of the original D3(BJ) paper.[78]
The WTMAD2 component breakdown is for 1320 reactions (+63 more than in our previous work).[38]
WTMAD2 (kcal/mol) and Optimized Parameters of Selected Standard and Correlation-Consistent cWFT Methodsd,e,f,g,h
| energy
components coefficients | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MP2 | MP3 | CCSD(T) | method | WTMAD2 | HLC | ||||||||
| T | 12 | G4(MP2)-XK | 3.71 | 1.131 | 0.512 | 1.041 | 0.704 | 1.048 | 0.526 | [0] | Y | ||
| 12 | revG4(MP2)-XK-H6-v1 | 3.53 | 1.307 | 0.385 | 1.170 | 0.614 | 0.984 | 0.736 | [0] | Y | |||
| 7 | G4(MP2)-XK-D-v1 | 2.56 | 1.309 | 0.674 | 1.124 | 0.890 | 1.113 | 0.699 | –0.383 | ||||
| 6 | G4(MP2)-XK-D-v2 | 2.73 | 1.482 | 0.271 | 1.320 | 0.457 | 1.052 | 0.778 | [0] | ||||
| 7 | 2.21 | 1.379 | 0.414 | 1.187 | 0.617 | 1.094 | 0.854 | –0.358 | |||||
| 7 | Ditto (with MP2(FC)) | 2.25 | 1.307 | 0.510 | 1.118 | 0.751 | 1.103 | 0.726 | –0.418 | ||||
| 6 | cc-G4(MP2)-XK-D-v2 | 2.46 | 1.392 | 0.220 | 1.239 | 0.384 | 1.061 | 0.713 | [0] | ||||
| 6 | Ditto (with MP2(FC)) | 2.52 | 1.441 | 0.147 | 1.284 | 0.327 | 1.050 | 0.756 | [0] | ||||
| 6 | G4(MP2)-XK-T-v2 | 1.42 | 1.624 | 0.811 | 1.526 | 0.833 | 1.070 | 0.959 | [0] | ||||
| 6 | 1.19 | 1.317 | 0.586 | 1.234 | 0.597 | 1.059 | 0.976 | [0] | |||||
| 6 | Ditto (with MP2(FC)) | 1.24 | 1.442 | 0.438 | 1.345 | 0.453 | 1.059 | 0.995 | [0] | ||||
| 5 | G4(MP2)-D-v1 | 2.68 | 1.226 | 0.977 | 1.111 | 0.776 | –0.456 | ||||||
| 4 | G4(MP2)-D-v2 | 3.01 | 1.045 | 0.968 | 1.033 | 0.796 | [0] | ||||||
| 5 | cc-G4(MP2)-D-v1 | 2.35 | 1.123 | 0.979 | 1.078 | 0.951 | –0.487 | ||||||
| 4 | cc-G4(MP2)-D-v2 | 2.77 | 0.954 | 0.968 | 0.998 | 0.947 | [0] | ||||||
| { | 4 | G4-T-v1 | 1.46 | 0.577 | 1.064 | 1.177 | –0.072 | ||||||
| { | 3 | G4-T-v2 | 1.49 | 0.593 | 1.061 | 1.103 | [0] | ||||||
| { | 4 | cc-G4-T-v1 | 0.87 | 0.668 | 1.053 | 1.128 | –0.006 | ||||||
| { | 3 | cc-G4-T-v2 | 0.87 | 0.671 | 1.054 | 1.126 | [0] | ||||||
| { | 3 | cc-G4-T-v2(FC) | 0.94 | 0.787 | 1.051 | 1.139 | [0] | ||||||
| { | 2 | 0.90 | 0.626 | 1.029 | 1.029 | [0] | |||||||
| { | 2 | cc-G4-T-v6(FC) | 0.99 | 0.707 | 1.017 | 1.017 | [0] | ||||||
| { | 1 | cc-G4-T-v7 | 0.92 | 0.643 | 1.000 | 1.000 | [0] | ||||||
| { | 3 | G4-T-DLPNO-v2 | 1.66 | 0.601 | 1.019 | 1.204 | [0] | ||||||
| { | 3 | G4-Q-DLPNO-v2 | 1.52 | 0.513 | 1.003 | 1.185 | [0] | ||||||
| { | 3 | cc-G4-T-DLPNO-v2 | 1.19 | 0.604 | 0.999 | 1.181 | [0] | ||||||
| { | 3 | Ditto (with MP2(FC)) | 1.21 | 0.708 | 0.999 | 1.193 | [0] | ||||||
| { | 3 | 1.00 | 0.554 | 0.998 | 1.156 | [0] | |||||||
| { | 3 | Ditto (with MP2(FC)) | 1.00 | 0.679 | 0.997 | 1.173 | [0] | ||||||
| 6 | G4(MP3)-D-v1 | 1.65 | 1.284 | 1.018 | 1.089 | 1.119 | 0.411 | –0.185 | |||||
| 6 | 1.37 | 1.196 | 1.010 | 1.021 | 1.121 | 0.572 | –0.235 | ||||||
| 4 | MP2.X-Q | 3.28 | 1.117 | 1.036 | 0.824 | [0] | [0] | –0.062 | |||||
| 4 | cc-MP2.X-Q | 2.89 | 0.879 | 1.133 | 0.689 | [0] | [0] | –0.052 | |||||
| { | 3 | G4(scsMP2.X)-D-v8 | 1.84 | 0.633 | 0.906 | 0.965 | 1.026 | 1.026 | [0] | ||||
| { | 3 | cc-G4(scsMP2.X)-D-v8 | 1.44 | 1.617 | 0.412 | 0.926 | 1.003 | 1.003 | [0] | ||||
| MP2-F12 | MP3 | CCSD(T) | |||||||||||
| 1.03 | 1.000 | 1.000 | 1.000 | [0] | |||||||||
| T(DLPNO) | 1 | 1.19 | 1.000 | 1.000 | 1.204 | [0] | |||||||
| (d) | (g) | 6-31G(d) | 6 | G4 | 2.52 | Y | |||||||
| (e) | 6-31G(d) | 6 | G4(MP2) | 2.96 | Y | ||||||||
| (f) | (h) | 6-31+G(d’) | 3 | CBS-QB3 | 3.10 | ||||||||
aug-cc-pwCVmZ(-PP) for both HF/CBS and E2 correlation energy.
The results from the conventional G4(MP2)-XK[35] and standard cWFT methods were obtained from ref (38) based on the GMTKN55 geometries.
The WTMAD2 component breakdown is for 1320 reactions (+63 more than in our previous work[38]).
G3LargeXP denotes the G3Large basis set used in the G3 method but the 2df and 3d2f polarization functions are replaced by 3df and 4d2f on the second-row atoms, respectively.
G3MP2LargeXP is a variant of G3LargeXP in which the core polarization functions of G3LargeXP are deleted.
CBSB3 stands for the 6-311++G(3d2f,2df,2p) basis set, where 3d2f functions are added on second-row atoms, 2df on first-row atoms, and 2p on hydrogen.
MP4/6-31G(2df,p) and MP4/6-31+G(d).
MP4(SDQ)/CBSB4. CBSB4 denotes the 6-31+G(d(f),p) basis set with a d function on first- and second-row atoms, an f function on selected second-row atoms, and a p function on hydrogen.
Figure 2Contribution of each subset of the GMTKN55 database to the WTMAD2 (kcal/mol) for the most accurate two-tier standard (G4-T-v1, four parameters) and correlation-consistent (cc-G4-T-v1 and cc-G4-T-v6, four and two parameters, respectively) cWFT methods.
Summary of Recommended Correlation-Consistent cWFT Methods
| methods | WTMAD2 (kcal/mol) | parameters |
|---|---|---|
| cc-G4-T | 0.90 | 2 |
| cc-G4-Q-DLPNO | 1.00 | 3;2 |
| cc-G4-F12-T | 1.04 | |
| cc-G4(MP2)-XK-T | 1.185 | 6 |
| cc-G4-T-DLPNO | 1.193 | 3;2 |
| cc-G4-F12-T-DLPNO | 1.194 | 1 |
| cc-G4(MP3)-D | 1.37 | 6 |
| cc-G4-D-DLPNO | 1.84 | 3 |
| cc-G4(MP2)-XK-D | 2.21 | 7 |
If one fixes cCCSD-MP2 = 1; effect on WTMAD2 invisible to the precision given.
Figure 3Overall performance of selected composite methods and double-hybrid DFT over the GMTKN55 database based on the weighted mean absolute deviation (WTMAD2 in kcal/mol).
Wall Clock Times (Min) of the Top-Performing Composite Wavefunction Methods on Two 18-Core Intel Xeon Gold 6240 CPUs (2.30 GHz) for Species from the W4-11 and ADIM6 Datasets
| species | cc-G4-T | cc-G4-T-DLPNO | cc-G4-D-DLPNO | cc-G4-F12-T-DLPNO | W4 |
|---|---|---|---|---|---|
| HCl | 2.4 | 2.9 | 2.3 | 1.3 | 5.9 |
| HS | 4.2 | 5.8 | 4.2 | 3.1 | 7.0 |
| H2O | 2.3 | 3.2 | 2.4 | 1.7 | 18.1 |
| ALH3 | 7.4 | 8.2 | 7.3 | 1.9 | 23.9 |
| BH3 | 3.3 | 3.9 | 3.3 | 1.5 | 42.4 |
| PH3 | 7.1 | 8.9 | 7.3 | 3.0 | 65.7 |
| HCN | 6.9 | 9.1 | 7.2 | 3.7 | 82.0 |
| HOF | 6.8 | 11.6 | 8.4 | 6.1 | 862.2 |
| ethane...ethane | 11.4 | 13.5 | 11.6 | 5.4 | |
| propane...propane | 39.9 | 43.4 | 38.6 | 14.2 | |
| butane...butane | 109.3 | 115.3 | 106.5 | 29.5 | |
| pentane...pentane | 255.7 | 197.4 | 179.4 | 56.7 | |
| hexane...hexane | 582.7 | 334.9 | 306.6 | 95.1 | |
| heptane...heptane | 832.5 | 518.3 | 475.7 | 151.4 |