| Literature DB >> 33356255 |
Felipe Ribeiro Dutra1, Cleuton de Souza Silva2, Rogério Custodio1.
Abstract
The direct method (HA(soln) ⇌ A(soln)- + H(soln)+) for calculating pKa of monoprotic acids is as efficient as thermodynamic cycles. A selective adjustment of proton free energy in solution was used with experimental pKa data. The procedure was analyzed at different levels of theory. The solvent was described by the solvation model density (SMD) model, including or not explicit water molecules, and three training sets were tested. The best performance under any condition was obtained by the G4CEP method with a mean absolute error close to 0.5 units of pKa and an uncertainty around ±1 unit of pKa for any training set including or excluding explicit solvent molecules. PM6 and AM1 performed very well with average absolute errors below 0.75 units of pKa but with uncertainties up to ±2 units of pKa, using only the SMD solvent model. Density functional theory (DFT) results were highly dependent on the basis functions and explicit water molecules. The best performance was observed for the local spin density approximation (LSDA) functional in almost all calculations and under certain conditions, as high as those obtained by G4CEP. Basis set complexity and explicit solvent molecules were important factors to control DFT calculations. The training set molecules should consider the diversity of compounds.Entities:
Year: 2020 PMID: 33356255 PMCID: PMC7872415 DOI: 10.1021/acs.jpca.0c08283
Source DB: PubMed Journal: J Phys Chem A ISSN: 1089-5639 Impact factor: 2.781
Experimental pKa Values and Differences between Experimental and Calculated Values for Different Levels of Theory, in Addition to the Mean Absolute Error, Standard Deviation, and the Largest Positive and Negative Deviationsa
| acids | p | G4CEP | AM1 | PM6 | HF | LSDA | PBE | B3LYP | CAM B3LYP | WB97XD | M062X | B2PLYP |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| acetic | 4.76 | 0.12 | –0.53 | 0.04 | –0.73 | –1.32 | –0.59 | –1.45 | –1.46 | –0.53 | –0.67 | –1.31 |
| propanoic | 4.88 | –0.77 | –0.77 | 0.75 | –0.47 | –1.37 | –1.47 | –1.32 | –1.29 | –1.48 | –1.27 | –1.32 |
| butanoic | 4.82 | 0.27 | 0.55 | –1.20 | –0.89 | –1.49 | –1.28 | –0.97 | –0.80 | –1.09 | –1.24 | –0.86 |
| pentanoic | 4.82 | –0.32 | –0.64 | 0.11 | –1.67 | –0.92 | –1.61 | –1.44 | –1.36 | –1.56 | –1.07 | –1.39 |
| hexanoic | 4.85 | –1.61 | 0.64 | 0.20 | –1.69 | –1.12 | –2.16 | –2.50 | –1.50 | –1.01 | –1.75 | –1.60 |
| chloroacetic | 2.86 | 0.79 | 0.19 | –0.38 | 0.98 | 1.13 | 1.23 | 0.08 | –0.07 | 1.26 | 0.99 | 0.07 |
| bromoacetic | 2.90 | 0.21 | –0.61 | –0.42 | 0.68 | 1.22 | 0.71 | 0.81 | 0.68 | 0.58 | 0.50 | 0.67 |
| trichloroacetic | 0.70 | 1.04 | 3.52 | 2.66 | 4.52 | 4.34 | 5.03 | 5.07 | 4.63 | 4.78 | 4.63 | 4.79 |
| 2-chlorobutanoic | 2.83 | 0.78 | 1.17 | 0.01 | 0.76 | 0.45 | 0.71 | 1.17 | 0.68 | 0.61 | 0.40 | 1.27 |
| 3-chlorobutanoic | 3.98 | –0.43 | 0.00 | 0.52 | –0.28 | –0.20 | –0.32 | –0.06 | –0.17 | –0.17 | –0.31 | –0.11 |
| 4-chlorobutanoic | 4.52 | –0.52 | –0.37 | 0.81 | –0.27 | –0.62 | –0.18 | –0.01 | 0.23 | 0.19 | 0.64 | –0.15 |
| 3-butenoic | 4.35 | –0.07 | 0.25 | –0.27 | 0.01 | –0.26 | –0.29 | –0.09 | –1.07 | –0.27 | –1.14 | –0.16 |
| 2-methylpropanoic | 4.84 | –0.17 | –0.09 | 0.72 | –0.96 | 0.05 | –0.01 | –1.15 | –0.78 | –0.83 | –1.13 | –0.80 |
| 2.2-dimethylpropanoic | 5.03 | –0.64 | 0.20 | –0.34 | –1.02 | –1.16 | –0.92 | –0.73 | –0.82 | –0.93 | –0.77 | –0.76 |
| 3-methylbutanoic | 4.77 | –1.29 | 0.09 | 0.25 | –0.89 | –1.09 | –1.17 | –0.95 | –0.75 | –1.81 | –0.90 | –0.99 |
| 2-methylbutanoic | 4.80 | –0.87 | –0.24 | 0.01 | –1.18 | –1.45 | –1.01 | –0.90 | –0.79 | –1.15 | –1.11 | –0.78 |
| 2-butynoic | 2.62 | 1.17 | 1.45 | –0.28 | 1.13 | 1.30 | 0.75 | 1.59 | 1.75 | 1.17 | 1.36 | 0.77 |
| 2-chloropropanoic | 2.83 | 0.82 | –0.19 | –0.06 | 1.01 | 0.75 | 0.72 | 1.29 | 1.15 | 0.92 | 0.36 | 1.07 |
| 3-bromopropanoic | 4.00 | 1.01 | –0.81 | 0.21 | 0.00 | 0.96 | 1.24 | 0.41 | 0.35 | –0.04 | 1.18 | 0.35 |
| 3-chloropropanoic | 3.98 | 0.05 | –0.59 | –0.51 | 0.57 | 0.91 | 0.39 | 0.53 | 0.56 | 1.25 | 1.20 | 0.51 |
| 4.69 | –0.22 | –0.75 | –0.64 | –0.44 | –1.02 | –0.79 | –0.57 | –0.36 | –0.92 | –0.60 | –0.31 | |
| formic | 3.75 | 0.64 | –2.46 | –2.20 | 0.82 | 0.93 | 1.01 | 1.21 | 1.19 | 1.03 | 0.70 | 1.05 |
| MAE | 0.63 | 0.73 | 0.57 | 0.95 | 1.09 | 1.07 | 1.11 | 1.02 | 1.07 | 1.09 | 0.96 | |
| std | 0.42 | 0.81 | 0.66 | 0.89 | 0.81 | 1.00 | 1.04 | 0.91 | 0.93 | 0.85 | 0.94 | |
| max | 1.17 | 3.52 | 2.66 | 4.52 | 4.34 | 5.03 | 5.07 | 4.63 | 4.78 | 4.63 | 4.79 | |
| min | –1.61 | –2.46 | –2.20 | –1.69 | –1.49 | –2.16 | –2.50 | –1.50 | –1.81 | –1.75 | –1.60 |
HF and DFT calculations used aug-cc-pVDZ basis functions. All calculations were performed with solvent represented by SMD and free energies of the solvated proton obtained with training set 1.
Data from refs (72) and (73).
Mean Absolute Error (MAE), Standard Deviation (Std. Dev.), and the Largest Positive (Max) and Negative (Min) Deviations at Different Levels of Theorya
| Training Set 2 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| G4CEP | AM1 | PM6 | HF | LSDA | PBE | B3LYP | CAM B3LYP | WB97XD | M062X | B2PLYP | |
| MAE | 0.63 | 0.73 | 0.59 | 1.01 | 1.41 | 1.30 | 1.41 | 1.26 | 1.21 | 1.18 | 1.25 |
| std. dev. | 0.43 | 0.85 | 0.66 | 1.08 | 1.35 | 1.29 | 1.38 | 1.29 | 1.27 | 1.28 | 1.27 |
| max | 1.30 | 3.77 | 2.80 | 5.22 | 5.73 | 6.14 | 6.32 | 5.81 | 5.81 | 5.69 | 5.95 |
| min | –1.49 | –2.21 | –2.06 | –0.99 | –0.10 | –1.05 | –1.25 | –0.32 | –0.78 | –0.69 | –0.43 |
HF and DFT calculations used aug-cc-pVDZ basis functions. All calculations were performed with the SMD model and training sets 2 and 3.
Experimental pKa, Mean Absolute Error (MAE), Standard Deviations (Std. Dev.), and the Largest Positive (Max) and Negative (Min) Deviation at HF and DFT Levelsa
| Training Set 1 | ||||||||
|---|---|---|---|---|---|---|---|---|
| HF | LSDA | PBE | B3LYP | CAM B3LYP | WB97XD | M062X | B2PLYP | |
| MAE | 1.10 | 0.39 | 1.04 | 1.07 | 1.06 | 1.07 | 0.82 | 0.93 |
| std. dev. | 0.88 | 0.33 | 1.00 | 0.86 | 0.87 | 0.86 | 0.74 | 0.90 |
| max | 4.37 | 1.43 | 5.04 | 4.42 | 4.30 | 4.33 | 3.85 | 4.50 |
| min | –2.04 | –1.12 | –1.96 | –1.58 | –1.79 | –1.72 | –1.17 | –1.70 |
All calculations were carried out with the SMD model, aug-cc-pVTZ basis set, and training sets 1, 2, and 3.
Mean Absolute Error (MAE) and Standard Deviations (Std. Dev.) at Different Levels of Theory Using the SMD Model and One Explicit Water Molecule with Training Sets 1, 2, and 3a
| Training Set 1 + SMD + H2O | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| G4CEP | AM1 | PM6 | HF | LSDA | PBE | B3LYP | CAM B3LYP | WB97XD | M062X | B2PLYP | |
| aug-cc-pVDZ | |||||||||||
| MAE | 0.50 | 0.72 | 0.89 | 1.12 | 0.49 | 0.91 | 0.84 | 0.79 | 0.77 | 0.59 | 0.78 |
| std. dev. | 0.29 | 0.74 | 0.85 | 0.87 | 0.34 | 0.60 | 0.60 | 0.57 | 0.67 | 0.61 | 0.61 |
| aug-cc-pVTZ | |||||||||||
| MAE | 1.01 | 0.47 | 0.65 | 0.71 | 0.73 | 0.72 | 0.63 | 0.68 | |||
| std. dev. | 0.81 | 0.33 | 0.54 | 0.54 | 0.48 | 0.65 | 0.47 | 0.61 | |||
The aug-cc-pVTZ and aug-cc-pVTZ basis sets were used for HF and DFT calculations.
Average Gibbs Energies of the Solvated Proton Calculated at Different Levels of Theory with the aug-cc-pVDZ and aug-cc-pVTZ Basis Functions with Training Sets 1, 2, and 3 and Solvent Effect Represented by SMD and with and without One Explicit Water Molecule. Data in kcal mol−1.
| SMD | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| G4CEP | AM1 | PM6 | HF | LSDA | PBE | B3LYP | CAM B3LYP | WB97XD | M062X | B2PLYP | |
| aug-cc-pVDZ | |||||||||||
| train. 1 | –266.91 | 104.93 | 120.86 | –277.22 | –267.22 | –271.97 | –273.07 | –271.88 | –274.71 | –273.14 | –272.12 |
| train. 2 | –267.08 | 104.59 | 120.67 | –278.17 | –269.12 | –273.49 | –274.78 | –273.49 | –276.12 | –274.59 | –273.71 |
| train. 3 | –267.09 | 105.06 | 120.92 | –278.17 | –268.09 | –272.84 | –273.61 | –272.55 | –275.67 | –273.95 | –272.53 |
| aug-cc-pVTZ | |||||||||||
| train. 1 | –279.02 | –271.67 | –273.43 | –273.95 | –272.84 | –276.11 | –272.65 | –273.18 | |||
| train. 2 | –271.74 | –274.62 | –275.46 | –274.09 | –277.98 | –273.87 | –274.37 | –271.74 | |||
| train. 3 | –279.49 | –271.19 | –274.37 | –274.43 | –273.61 | –277.17 | –273.46 | –273.69 | |||