| Literature DB >> 32052532 |
Minsik Cho1,2, Nitai Sylvetsky1, Sarah Eshafi1,3, Golokesh Santra1, Irena Efremenko1, Jan M L Martin1.
Abstract
Atomic partial charges are among the most commonly used interpretive tools in quantum chemistry. Dozens of different 'population analyses' are in use, which are best seen as proxies (indirect gauges) rather than measurements of a 'general ionicity'. For the GMTKN55 benchmark of nearly 2,500 main-group molecules, which span a broad swathe of chemical space, some two dozen different charge distributions were evaluated at the PBE0 level near the 1-particle basis set limit. The correlation matrix between the different charge distributions exhibits a block structure; blocking is, broadly speaking, by charge distribution class. A principal component analysis on the entire dataset suggests that nearly all variation can be accounted for by just two 'principal components of ionicity': one has all the distributions going in sync, while the second corresponds mainly to Bader QTAIM vs. all others. A weaker third component corresponds to electrostatic charge models in opposition to the orbital-based ones. The single charge distributions that have the greatest statistical similarity to the first principal component are iterated Hirshfeld (Hirshfeld-I) and a minimal-basis projected modification of Bickelhaupt charges. If three individual variables, rather than three principal components, are to be identified that contain most of the information in the whole dataset, one representative for each of the three classes of Corminboeuf et al. is needed: one based on partitioning of the density (such as QTAIM), a second based on orbital partitioning (such as NPA), and a third based on the molecular electrostatic potential (such as HLY or CHELPG).Entities:
Keywords: atoms in molecules; charge distributions; molecular modelling; population analysis; principal components of ionicity
Year: 2020 PMID: 32052532 PMCID: PMC7317385 DOI: 10.1002/cphc.202000040
Source DB: PubMed Journal: Chemphyschem ISSN: 1439-4235 Impact factor: 3.102
Details of software used for various population analyses.
|
Software |
Atomic Partial Charges |
|---|---|
|
Multiwfn |
Atomic Dipole Corrected Hirshfeld (ADCH), |
|
Gaussian 16 |
Mulliken, |
|
MOLPRO 2019 |
Intrinsic Bond Orbitals (IBO) |
|
Horton 3 |
Iterative Stockholder Analysis (ISA), |
|
ACP |
Adjusted Charge Partitioning (ACP), |
|
DFTD4 |
Electronegativity Equilibration Charges (EEQ) |
|
DDEC6 |
DDEC6 (density derived electrostatic and chemical approach, version 6) |
|
IBOView |
Fuzzy Voronoi |
The squared correlation matrix for a subset of all variables. For clarity, the variables have been ordered to maximize blocking. The color gradient runs from white for R2≤0.80 to deep blue for R2≥0.98.
|
|
VDD |
Hirshfeld |
HLY |
ISA |
DDEC6 |
MBIS |
Hirshfeld‐I |
MBS‐ Bickelhaupt |
NPA |
IBO |
MBS‐ Mulliken |
i‐ACP |
ACP |
CM5 |
EEQ |
QTAIM |
APT |
ADCH |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
VDD |
1.00 |
0.96 |
0.64 |
0.78 |
0.81 |
0.77 |
0.78 |
0.78 |
0.74 |
0.78 |
0.70 |
0.78 |
0.78 |
0.76 |
0.73 |
0.64 |
0.68 |
0.68 |
|
Hirshfeld |
0.96 |
1.00 |
0.67 |
0.79 |
0.85 |
0.80 |
0.80 |
0.79 |
0.78 |
0.82 |
0.76 |
0.75 |
0.80 |
0.78 |
0.73 |
0.61 |
0.62 |
0.72 |
|
HLY |
0.64 |
0.67 |
1.00 |
0.87 |
0.84 |
0.84 |
0.79 |
0.76 |
0.73 |
0.73 |
0.72 |
0.71 |
0.77 |
0.73 |
0.70 |
0.48 |
0.51 |
0.68 |
|
ISA |
0.78 |
0.79 |
0.87 |
1.00 |
0.94 |
0.94 |
0.92 |
0.89 |
0.81 |
0.80 |
0.77 |
0.89 |
0.85 |
0.78 |
0.77 |
0.69 |
0.74 |
0.69 |
|
DDEC6 |
0.81 |
0.85 |
0.84 |
0.94 |
1.00 |
.99 |
0.98 |
0.94 |
0.92 |
0.91 |
0.90 |
0.85 |
0.89 |
0.84 |
0.81 |
0.66 |
0.67 |
0.75 |
|
MBIS |
0.77 |
0.80 |
0.84 |
0.94 |
0.99 |
1.00 |
0.97 |
0.95 |
0.93 |
0.89 |
0.90 |
0.86 |
0.90 |
0.84 |
0.82 |
0.66 |
0.66 |
0.74 |
|
Hirshfeld‐I |
0.78 |
0.80 |
0.79 |
0.92 |
0.98 |
0.97 |
1.00 |
0.94 |
0.92 |
0.89 |
0.88 |
0.87 |
0.86 |
0.80 |
0.78 |
0.74 |
0.73 |
0.68 |
|
MBSBickelhaupt |
0.78 |
0.79 |
0.76 |
0.89 |
0.94 |
0.95 |
0.94 |
1.00 |
0.92 |
0.87 |
0.90 |
0.89 |
0.92 |
0.87 |
0.86 |
0.75 |
0.71 |
0.71 |
|
NPA |
0.74 |
0.78 |
0.73 |
0.81 |
0.92 |
0.93 |
0.92 |
0.92 |
1.00 |
0.96 |
0.98 |
0.73 |
0.85 |
0.84 |
0.81 |
0.60 |
0.55 |
0.73 |
|
IBO |
0.78 |
0.82 |
0.73 |
0.80 |
0.91 |
0.89 |
0.89 |
0.87 |
0.96 |
1.00 |
0.96 |
0.71 |
0.81 |
0.82 |
0.80 |
0.56 |
0.55 |
0.75 |
|
MBSMulliken |
0.70 |
0.76 |
0.72 |
0.77 |
0.90 |
0.90 |
0.88 |
0.90 |
0.98 |
0.96 |
1.00 |
0.68 |
0.84 |
0.85 |
0.83 |
0.52 |
0.49 |
0.76 |
|
i‐ACP |
0.78 |
0.75 |
0.71 |
0.89 |
0.85 |
0.86 |
0.87 |
0.89 |
0.73 |
0.71 |
0.68 |
1.00 |
0.88 |
0.77 |
0.77 |
0.84 |
0.83 |
0.59 |
|
ACP |
0.78 |
0.80 |
0.77 |
0.85 |
0.89 |
0.90 |
0.86 |
0.92 |
0.85 |
0.81 |
0.84 |
0.88 |
1.00 |
0.91 |
0.89 |
0.63 |
0.60 |
0.75 |
|
CM5 |
0.76 |
0.78 |
0.73 |
0.78 |
0.84 |
0.84 |
0.80 |
0.87 |
0.84 |
0.82 |
0.85 |
0.77 |
0.91 |
1.00 |
0.97 |
0.51 |
0.48 |
0.81 |
|
EEQ |
0.73 |
0.73 |
0.70 |
0.77 |
0.81 |
0.82 |
0.78 |
0.86 |
0.81 |
0.80 |
0.83 |
0.77 |
0.89 |
0.97 |
1.00 |
0.54 |
0.50 |
0.76 |
|
QTAIM |
0.64 |
0.61 |
0.48 |
0.69 |
0.66 |
0.66 |
0.74 |
0.75 |
0.60 |
0.56 |
0.52 |
0.84 |
0.63 |
0.51 |
0.54 |
1.00 |
0.86 |
0.36 |
|
APT |
0.68 |
0.62 |
0.51 |
0.74 |
0.67 |
0.66 |
0.73 |
0.71 |
0.55 |
0.55 |
0.49 |
0.83 |
0.60 |
0.48 |
0.50 |
0.86 |
1.00 |
0.36 |
|
ADCH |
0.68 |
0.72 |
0.68 |
0.69 |
0.75 |
0.74 |
0.68 |
0.71 |
0.73 |
0.75 |
0.76 |
0.59 |
0.75 |
0.81 |
0.76 |
0.36 |
0.36 |
1.00 |
First six principal components found after PCA on the covariance matrix.
|
Eigenvalues |
2.145 |
0.157 |
0.072 |
0.022 |
0.017 |
0.010 |
|---|---|---|---|---|---|---|
|
VDD |
−0.08 |
0.00 |
−0.01 |
0.15 |
0.22 |
−0.26 |
|
Hirshfeld |
−0.08 |
0.02 |
−0.02 |
0.12 |
0.18 |
−0.27 |
|
CHELPG |
−0.21 |
0.01 |
0.37 |
0.04 |
−0.03 |
−0.03 |
|
MK |
−0.21 |
0.11 |
0.39 |
−0.08 |
−0.11 |
−0.08 |
|
RESP |
−0.21 |
0.10 |
0.37 |
−0.05 |
−0.09 |
−0.11 |
|
HLY |
−0.22 |
0.20 |
0.38 |
−0.16 |
−0.28 |
−0.14 |
|
ISA |
−0.23 |
0.03 |
0.21 |
−0.07 |
0.14 |
0.12 |
|
DDEC6 |
−0.21 |
0.09 |
−0.01 |
−0.09 |
0.10 |
0.10 |
|
MBIS |
−0.26 |
0.11 |
−0.02 |
−0.13 |
0.02 |
0.32 |
|
Hirshfeld‐I |
−0.26 |
0.03 |
−0.08 |
−0.23 |
0.06 |
0.22 |
|
MBSBickelhaupt |
−0.26 |
0.02 |
‐0.14 |
0.12 |
−0.08 |
0.25 |
|
NPA |
−0.28 |
0.20 |
−0.35 |
−0.22 |
−0.05 |
0.01 |
|
IBO |
−0.21 |
0.17 |
−0.23 |
−0.16 |
0.16 |
−0.35 |
|
MBSMulliken |
−0.28 |
0.29 |
−0.37 |
−0.15 |
−0.07 |
−0.08 |
|
i‐ACP |
−0.20 |
−0.13 |
0.07 |
0.28 |
0.09 |
0.32 |
|
ACP |
−0.14 |
0.06 |
−0.02 |
0.31 |
0.01 |
0.21 |
|
CM5 |
−0.14 |
0.13 |
−0.05 |
0.43 |
0.01 |
0.10 |
|
EEQ |
−0.14 |
0.11 |
−0.06 |
0.46 |
−0.02 |
0.09 |
|
QTAIM |
−0.38 |
−0.72 |
−0.18 |
0.05 |
−0.47 |
−0.24 |
|
APT |
−0.24 |
−0.40 |
0.09 |
−0.14 |
0.71 |
−0.02 |
|
ADCH |
−0.13 |
0.20 |
−0.02 |
0.37 |
0.12 |
−0.49 |
R2 for least‐squares fits of the individual charges as linear combinations of the first few principal components.
|
|
C(PC1) |
C(PC2) |
C(PC3) |
R2(PC1) |
R2(PC1, 2) |
R2(PC1,2,3) |
|---|---|---|---|---|---|---|
|
Hirshfeld |
0.085 |
0.027 |
0.017 |
0.8092 |
0.8158 |
0.8164[e] |
|
CM5 |
0.151 |
0.145 |
0.019 |
0.8348 |
0.8981 |
0.8983[c] |
|
Hirshfeld‐I |
0.278 |
0.052 |
0.007 |
|
0.9780 |
0.9780 |
|
NPA |
0.304 |
0.253 |
−0.342 |
0.9160 |
0.9680 |
|
|
HLY |
0.229 |
0.193 |
0.601 |
0.7959 |
0.8419 |
0.9552[a] |
|
ISA |
0.252 |
0.032 |
0.404 |
0.9362 |
0.9338 |
0.9835 |
|
MBIS |
0.277 |
0.137 |
0.111 |
0.9629 |
|
0.9853 |
|
DDEC6 |
0.227 |
0.108 |
0.104 |
0.9638 |
|
0.9860 |
|
MBSMulliken |
0.301 |
0.340 |
−0.365 |
0.8766 |
0.9677 |
|
|
EEQ |
0.155 |
0.122 |
−0.003 |
0.8297 |
0.8718 |
0.8718[d] |
|
QTAIM |
0.419 |
−0.699 |
−0.274 |
0.8022 |
|
|
|
IBO |
0.226 |
0.201 |
−0.216 |
0.8913 |
0.9492 |
0.9661 |
|
ACP |
0.155 |
0.075 |
0.069 |
0.9007 |
0.9180 |
0.9216 |
|
i‐ACP |
0.213 |
−0.121 |
0.185 |
0.9207 |
0.9450 |
0.9594 |
|
APT |
0.258 |
−0.401 |
0.187 |
0.7774 |
0.9303 |
0.9387[b] |
|
MBSBickelhaupt |
0.281 |
0.047 |
−0.075 |
|
0.9787 |
0.9801 |
[a] R2=0.9972 with 6 components: C(PC4)=−0.080, C(PC5)=−0.470,C(PC6)=+0.470. [b] R2=0.9938 with 5 components: C(PC4)=−0.328, C(PC5)=0.666. [c] R2=0.9801 with 4 components: C(PC4)=0.455. [d] R2=0.9681 with 4 components: C(PC4)=0.509. [e] R2=0.9324 with 7 components, R2=0.9927 with 10 components.