| Literature DB >> 35394778 |
Ádám Ganyecz1, Mihály Kállay1.
Abstract
In this work, we implemented the embedded cluster reference interaction site model (EC-RISM) originally developed by Kloss, Heil, and Kast (J. Phys. Chem. B 2008, 112, 4337-4343). This method combines quantum mechanical calculations with the 3D reference interaction site model (3D-RISM). Numerous options, such as buffer, grid space, basis set, charge model, water model, closure relation, and so forth, were investigated to find the best settings. Additionally, the small point charges, which are derived from the solvent distribution from the 3D-RISM solution to represent the solvent in the QM calculation, were neglected to reduce the overhead without the loss of accuracy. On the MNSOL[a], MNSOL, and FreeSolv databases, our implemented and optimized method provides solvation free energies in water with 5.70, 6.32, and 6.44 kJ/mol root-mean-square deviations, respectively, but with different settings, 5.22, 6.08, and 6.63 kJ/mol can also be achieved. Only solvent models containing fitting parameters, like COSMO-RS and EC-RISM with universal correction and directly used electrostatic potential, perform better than our EC-RISM implementation with atomic charges.Entities:
Year: 2022 PMID: 35394778 PMCID: PMC9036516 DOI: 10.1021/acs.jpca.1c07904
Source DB: PubMed Journal: J Phys Chem A ISSN: 1089-5639 Impact factor: 2.944
Figure 1Flowchart of our EC-RISM workflow.
Parameters of the Water Models Employed in This Work
| cSPC/E | cTIP3P | cOPC3 | cPOL3 | mSPC/E | |
|---|---|---|---|---|---|
| 1.0000 | 0.9572 | 1.0000 | 0.9789 | 1.0000 | |
| θHOH (deg) | 109.47 | 104.52 | 109.47 | 109.47 | 109.47 |
| –0.8476 | –0.8340 | –0.7300 | –0.8952 | –0.8476 | |
| 0.4238 | 0.4170 | 0.3650 | 0.4476 | 0.4238 | |
| σO (Å) | 3.1658 | 3.1507 | 3.2037 | 3.1743 | 3.1658 |
| σH (Å) | 1.1658 | 1.2363 | 1.2037 | 1.2165 | 1.0000 |
| ϵO (kJ/mol) | 0.64978 | 0.63597 | 0.65270 | 0.68369 | 0.64978 |
| ϵH (kJ/mol) | 0.064978 | 0.063597 | 0.065270 | 0.068369 | 0.234304 |
Calculated using .[80]
Calculated using ϵH = 0. 1ϵO.[80]
Comparison of 3D-RISM Solvation Free Energies (in kJ/mol) and Runtimes (in s) with Different Buffer and Grid Space for Piperazine with PSE-3 Closure, cSPC/E Water Model, and PC+ Correction
| grid space (Å) | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| buffer (Å) | Δ | time | Δ | time | Δ | time | Δ | time | Δ | time |
| 5 | –36.01 | 41.2 | –35.96 | 4.2 | –35.88 | 1.3 | –35.81 | 0.5 | –35.40 | 0.3 |
| 10 | –36.11 | 175.1 | –36.07 | 21.5 | –36.09 | 6.3 | –36.07 | 2.1 | –35.71 | 1.2 |
| 15 | –36.11 | 494.5 | –36.07 | 64.7 | –36.10 | 19.8 | –36.07 | 6.2 | –35.72 | 3.2 |
| 20 | –36.12 | 959.8 | –36.07 | 123.2 | –36.11 | 38.9 | –36.08 | 13.2 | –35.73 | 7.0 |
| 25 | –36.12 | 1960.6 | –36.07 | 240.2 | –36.11 | 78.9 | –36.08 | 26.5 | –35.73 | 14.1 |
| 30 | –36.13 | 3112.4 | –36.07 | 389.1 | –36.11 | 120.0 | –36.09 | 44.7 | –35.73 | 26.8 |
| 35 | –36.13 | 4758.8 | –36.07 | 608.3 | –36.11 | 173.9 | –36.09 | 70.7 | –35.73 | 35.4 |
| 40 | –36.06 | 863.0 | –36.11 | 276.9 | –36.09 | 104.1 | –35.73 | 62.5 | ||
| 45 | –36.03 | 1235.9 | –36.11 | 356.5 | –36.09 | 135.5 | –35.72 | 73.1 | ||
| 50 | –36.02 | 1774.7 | –36.08 | 468.8 | –36.06 | 207.6 | –35.71 | 94.7 | ||
Figure 2Effect of neglecting background point charges below a threshold on the energy and the runtime. The reference is a calculation where there are no point charges dropped from the RISM calculation with applying a 15 Å buffer, 0.3 Å grid space, cSPC/E water, PC+ correction, and PSE-3 closure. For comparison, the DF-MP2/def2-TZVPPD gas-phase calculation of piperazine requires 101 s on an Intel(R) Xeon(R) CPU E5–1650 v2 @ 3.50 GHz processor with 12 cores.
Figure 3Effect of the basis set on the EC-RISM solvation free energies of piperazine using PSE-3 closure, PC+ correction, cSPC/E water, 15 Å buffer, and 0.3 Å grid space with CHELPG charges.
Effect of Various Charges and Corrections on the RMSDs of Solvation Free Energies on the MNSOL[a] Dataset (Values in kJ/mol)a
| corrections | CHELPG | MK | IAO | Mulliken | Löwdin |
|---|---|---|---|---|---|
| PC | 15.08 | 17.05 | 19.28 | ||
| PC+ | 5.75 | 6.40 | 10.76 | ||
| UC | 5.41 | 5.16 | 9.92 | ||
| GF | 21.66 | 21.27 | 19.73 |
Values in italic are only informative but not directly comparable to non-italic ones.
54 out of the 273 species are not converged.
32 out of the 273 species are not converged.
Figure 4Comparison of RMSDs on the MNSOL[a] subset for different water models and closure relations. The paled and hatched bars represent the convergence issues with that pairing.
Error Statistics Using HF, MP2, and Various DFT Approaches as the QM Method in EC-RISM with PSE-3 Closure, cTIP3P Water Model, PC+ Correction, and def2-TZVPPD Basis Set on the MNSOL[a] Dataset
| statistics | HF | MP2 | revPBE | SCAN | ω-B97X-V | B2GPPLYP |
|---|---|---|---|---|---|---|
| MSD | –5.44 | –1.56 | 0.13 | –1.91 | –2.60 | –2.08 |
| MAD | 6.81 | 4.40 | 5.58 | 4.96 | 4.98 | 4.84 |
| RMSD | 9.34 | 5.75 | 6.82 | 6.33 | 6.60 | 6.31 |
Error Statistics on the MNSOL[a] Dataset in kJ/mol along with c Correction Factors in kJ/mol/Å3 Using Different Settings to Compare Our and the Original SAMPL6 Implementationa
| 6-311+G** | def2-TZVPPD | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| ESP | water model | correction | MSD | MAD | RMSD | c | MSD | MAD | RMSD | c |
| HF | mSPC/E | UC | –0.813 | 4.729 | 6.359 | –0.42977 | –0.522 | 4.328 | 5.754 | –0.43269 |
| HF | mSPC/E | PC+ | –5.577 | 6.458 | 8.152 | –0.46081 | –4.846 | 5.802 | 7.385 | –0.46081 |
| HF | cSPC/E | UC | –0.801 | 4.789 | 6.430 | –0.49790 | –0.509 | 4.402 | 5.838 | –0.50087 |
| HF | cSPC/E | PC+ | –2.890 | 5.267 | 6.809 | –0.51181 | –2.155 | 4.715 | 6.099 | –0.51181 |
| HF | cTIP3P | UC | –0.699 | 4.531 | 5.990 | –0.47846 | –0.410 | 4.242 | 5.512 | –0.48141 |
| HF | cTIP3P | PC+ | –1.943 | 4.724 | 6.136 | –0.48673 | –1.212 | 4.339 | 5.578 | –0.48673 |
| MP2 | mSPC/E | UC | –0.924 | 5.463 | 7.255 | –0.41517 | –0.618 | 4.871 | 6.474 | –0.42076 |
| MP2 | mSPC/E | PC+ | –7.974 | 8.472 | 10.468 | –0.46081 | –6.812 | 7.399 | 9.266 | –0.46081 |
| MP2 | cSPC/E | UC | –0.883 | 5.529 | 7.366 | –0.48210 | –0.617 | 4.948 | 6.570 | –0.48829 |
| MP2 | cSPC/E | PC+ | –5.371 | 6.918 | 8.797 | –0.51181 | –4.172 | 5.935 | 7.594 | –0.51181 |
| MP2 | cTIP3P | UC | –0.765 | 5.067 | 6.736 | –0.46397 | –0.500 | 4.639 | 6.063 | –0.46997 |
| MP2 | cTIP3P | PC+ | –4.209 | 6.061 | 7.680 | –0.48673 | –3.039 | 5.201 | 6.644 | –0.48673 |
ESP refers to the electrostatics used for the determination of CHELPG charges and E energy. Gas and solvent phase geometries are from the SAMPL6 work of Kast et al.,[41] which have been corrected.[92] 1D-RISM calculations for solvent susceptibility file were made with HNC closure, while 3D-RISM calculations with PSE-2 closure. The GAFF force field was utilized, and the radius of Br was set to 1.3 Å.
Figure 5Calculated vs reference experimental solvation free energies (left) and histogram of errors (right) on the FreeSolv data set for PSE-3/cTIP3P and PSE-2/cPOL3 EC-RISM settings.
Comparison of RMSDs of Several Solvation Models on the MNSOL[a], MNSOL, and FreeSolv Satasetsa
| method | FreeSolv | MNSOL[a] | MNSOL |
|---|---|---|---|
| EC-RISM, MP2/def2-TZVPPD/ | |||
| PSE-3/cTIP3P/CHELPG/PC+, this work | 6.44 | 5.70 | 6.32 |
| EC-RISM, MP2/def2-TZVPPD/ | |||
| PSE-2/cPOL3/CHELPG/PC+, this work | 6.63 | 5.22 | 6.08 |
| EC-RISM, MP2/6-311+G(d,p)/ | |||
| PSE-2/mSPC/E/φ | – | 6.19 | 6.53 |
| EC-RISM, MP2/6-311+G(d,p)/ | |||
| PSE-2/mSPC/E/CHELPG/UC[ | – | 6.59 | 7.40 |
| EC-RISM, MP2/cc-pVTZ/ | |||
| PSE-2/mSPC/E/φ | – | 4.73 | 5.44 |
| 3D-RISM, PSE-4/cTIP3P/AM1-BCC, this work | 7.12 | 6.33 | 8.18 |
| 3D-RISM, PSE-3/cPOL3/AM1-BCC, this work | 7.39 | 5.92 | 7.50 |
| MD[ | 6.45 | – | – |
| DCOSMO-RS ϵ[ | – | 5.93 | – |
| DCOSMO-RS ∞[ | – | 6.80 | – |
| COSMO[ | – | 10.42 | – |
| COSMO-RS[ | – | 3.48 | – |
| SMD, IEF-PCM/G03/M05–2x[ | – | 3.60 | – |
| SMD, M05-2 | 5.90 | 4.20 | 6.85 |
| IEF-PCM, G03d(UAHF)/HF[ | – | 7.32 | – |
| PCM, M05-2 | 9.00 | 7.95 | 9.53 |
| C-PCM, G06d/B3LYP[ | – | 7.74 | – |
Values are in kJ/mol.