| Literature DB >> 32786898 |
Ralf Meyer1, Manuel Weichselbaum1, Andreas W Hauser1.
Abstract
Orbital-free approaches might offer a way to boost the applicability of density functional theory by orders of magnitude in system size. An important ingredient for this endeavor is the kinetic energy density functional. Snyder et al. [ Phys. Rev. Lett. 2012, 108, 253002] presented a machine learning approximation for this functional achieving chemical accuracy on a one-dimensional model system. However, a poor performance with respect to the functional derivative, a crucial element in iterative energy minimization procedures, enforced the application of a computationally expensive projection method. In this work we circumvent this issue by including the functional derivative into the training of various machine learning models. Besides kernel ridge regression, the original method of choice, we also test the performance of convolutional neural network techniques borrowed from the field of image recognition.Entities:
Year: 2020 PMID: 32786898 PMCID: PMC7482319 DOI: 10.1021/acs.jctc.0c00580
Source DB: PubMed Journal: J Chem Theory Comput ISSN: 1549-9618 Impact factor: 6.006
Figure 1Schematic depiction of the NN architectures used for the standard CNN (left) and the ResNet model (right). Note the appearance of skip connections for the latter.
Absolute Error Values on the N = 1 Test Set for All of the Machine Learning Models (in kcal/mol)
| |Δ | ||||||
|---|---|---|---|---|---|---|
| model | mean | std | max | mean | std | max |
| KRR, ref [ | 0.15 | 0.24 | 3.2 | – | – | – |
| KRR, this work | 0.163 | 0.29 | 4.6 | 29313.2 | 345.5 | 30610.9 |
| ext KRR | 0.004 | 0.02 | 0.6 | 3.4 | 4.3 | 50.7 |
| CNN | 0.044 | 0.10 | 2.3 | 31.5 | 25.0 | 370.1 |
| ResNet | 0.015 | 0.02 | 0.3 | 10.1 | 7.0 | 110.7 |
Figure 2Comparison of exact functional derivative (solid lines) and predictions by standard KRR with and without the PCA projection detailed in section (dashed lines) as well as prediction from the ML models trained on derivative information (dashed–dotted lines). The parameters for the shown potential are a = {4.43, 7.18, 9.03}, b = {0.0532, 0.587, 0.568}, and c = {0.0754, 0.0406, 0.0554}.
Absolute Kinetic Energy Errors ΔT for the Iteratively Found Densities (in kcal/mol) as Well as the Integrated Absolute Error of the Densities Δn for the N = 1 Test Set
| |Δ | |Δ | ||||||
|---|---|---|---|---|---|---|---|
| model | mean | std | max | mean | std | max | |
| KRR, ref [ | 5 | 3.0 | 5.3 | 46 | – | – | – |
| KRR, this work | 5 | 2.85 | 7.00 | 87.34 | 45.0 | 54.0 | 503.9 |
| ext KRR | 5 | 0.46 | 1.05 | 15.35 | 14.9 | 11.5 | 85.0 |
| ext KRR | 10 | 0.04 | 0.22 | 5.95 | 0.8 | 0.7 | 10.7 |
| ext KRR | 15 | 0.04 | 0.22 | 5.97 | 0.3 | 0.5 | 10.8 |
| CNN | 5 | 0.57 | 1.40 | 21.69 | 15.2 | 12.0 | 95.5 |
| CNN | 10 | 0.29 | 0.77 | 13.26 | 5.5 | 8.5 | 171.0 |
| ResNet | 5 | 0.51 | 1.25 | 19.59 | 14.9 | 11.5 | 85.5 |
| ResNet | 10 | 0.09 | 0.21 | 5.72 | 1.0 | 0.9 | 14.7 |
| ResNet | 15 | 0.09 | 0.22 | 5.86 | 2.0 | 2.4 | 19.6 |
Error Values of the Machine Learning Approximations on the N = 2 Test Set (in kcal/mol)
| |Δ | ||||||
|---|---|---|---|---|---|---|
| model | mean | std | max | mean | std | max |
| KRR | 0.0355 | 0.0588 | 0.8752 | 2957.85 | 16.00 | 2990.36 |
| ext KRR | 0.0002 | 0.0008 | 0.0233 | 0.12 | 0.15 | 2.18 |
| ResNet | 0.0483 | 0.2837 | 6.8116 | 6.78 | 11.36 | 223.35 |
Absolute Kinetic Energy Error ΔT for the Iteratively Found Densities (in kcal/mol) as Well as the Integrated Absolute Error of the Densities Δn on the N = 2 Test Set, Compared between the KRR variants and ResNet for Increased Search Spaces
| |Δ | |Δ | ||||||
|---|---|---|---|---|---|---|---|
| model | mean | std | max | mean | std | max | |
| KRR | 10 | 8.431 | 1.138 | 16.686 | 109.0 | 23.0 | 222.8 |
| KRR | 20 | 21.365 | 0.826 | 22.456 | 133.9 | 18.4 | 194.9 |
| KRR | 40 | 23.882 | 0.846 | 25.003 | 139.8 | 18.4 | 200.5 |
| ext KRR | 10 | 0.523 | 0.827 | 7.353 | 24.8 | 16.0 | 102.1 |
| ext KRR | 20 | 0.074 | 0.069 | 0.789 | 0.5 | 0.6 | 2.9 |
| ext KRR | 40 | 0.076 | 0.069 | 0.789 | 0.1 | 0.1 | 1.1 |
| ResNet | 10 | 1.239 | 6.537 | 142.649 | 25.8 | 18.6 | 248.9 |
| ResNet | 20 | 0.877 | 6.634 | 146.324 | 2.9 | 11.8 | 253.3 |
| ResNet | 40 | 0.877 | 6.635 | 146.327 | 2.7 | 11.8 | 253.3 |
Absolute Error Values for the Kinetic Energy ΔT and Its Functional Derivative (in kcal/mol) on the N = 2 Test Set, Achieved by the ResNet Model Trained on Sets of Varying Size
| |Δ | ||||||
|---|---|---|---|---|---|---|
| mean | std | max | mean | std | max | |
| 100 | 0.049 | 0.284 | 6.814 | 6.78 | 11.36 | 223.36 |
| 1 000 | 0.012 | 0.063 | 1.922 | 3.12 | 2.81 | 66.31 |
| 10 000 | 0.007 | 0.018 | 0.528 | 2.79 | 1.92 | 45.19 |
| 100 000 | 0.007 | 0.009 | 0.138 | 2.39 | 1.36 | 19.40 |
Absolute Kinetic Energy Error ΔT (in kcal/mol) as Well as the Integrated Absolute Error Δn of the Iteratively Found Densities on the N = 2 Test Set, Employing the ResNet on Training Sets of Increasing Size
| |Δ | |Δ | |||||
|---|---|---|---|---|---|---|
| mean | std | max | mean | std | max | |
| 100 | 0.856 | 6.591 | 145.58 | 2.7 | 11.8 | 253.0 |
| 1 000 | 0.151 | 1.196 | 32.65 | 0.7 | 1.9 | 50.5 |
| 10 000 | 0.062 | 0.228 | 6.48 | 0.6 | 0.6 | 13.7 |
| 100 000 | 0.047 | 0.070 | 0.89 | 0.6 | 0.5 | 5.5 |