| Literature DB >> 35986067 |
Alexander Ryabov1,2, Iskander Akhatov3, Petr Zhilyaev3.
Abstract
Density functional theory (DFT) is one of the primary approaches to solving the many-body Schrodinger equation. The essential part of the DFT theory is the exchange-correlation (XC) functional, which can not be obtained in analytical form. Accordingly, the accuracy improvement of the DFT is mainly based on the development of XC functional approximations. Commonly, they are built upon analytic solutions in low- and high-density limits and result from quantum Monte Carlo or post-Hartree-Fock numerical calculations. However, there is no universal functional form to incorporate these data into XC functional. Instead, various parameterizations use heuristic rules to build a specific XC functional. The neural network (NN) approach to interpolate the data from higher precision theories can give a unified path to parametrize an XC functional. Moreover, data from many existing quantum chemical databases could provide the XC functional with improved accuracy. We develop NN XC functional, which gives exchange potential and energy density without direct derivatives of exchange-correlation energy density. Proposed NN architecture consists of two parts NN-E and NN-V, which could be trained in separate ways, adding new flexibility to XC functional. We also show that the developed NN XC functional converges in the self-consistent cycle and gives reasonable energies when applied to atoms, molecules, and crystals.Entities:
Year: 2022 PMID: 35986067 PMCID: PMC9391383 DOI: 10.1038/s41598-022-18083-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Topology of the XC neural network. It consists of two parts: NN-E predicts and NN-V predicts . Each part of the neural network consists of 4 layers each of 100 neurons. For both parts information on local density and its derivatives is needed.
Figure 2Training curves of XC neural network. The left one corresponds to NN-V training, where log(Loss) is the logarithm of loss (2). The right one corresponds to NN-E training where log(Loss) is the logarithm of loss with boundary conditions (3). Outliers on the training curves are related with adjustable learning rate used for optimization algorithm.
Figure 3Distributions of input features. Each curve is normalized to to the total number of elements in the corresponding dataset.The distributions localized around the values that are typical for the systems used in training dataset: ammonia, benzene, crystalline silicon. One can see that crystalline silicon has relatively small electron density gradients and no region of low electron density due to its periodic structure. If a neural network is given an input value significantly out of bounds of the distribution data, it potentially could output an inappropriate value.
MSE and MAE results for and on a training dataset. The units of MSE are , the units of MAE are .
| MSE ( | MAE ( | MSE ( | MAE ( | |
|---|---|---|---|---|
| Benzene | ||||
| Silicon | ||||
| Ammonia |
Figure 4Spacial distribution of relative error for benzene: (a) spacial distribution of relative error for , (b) spacial distribution of relative error for . The relative local errors have highest values in the region of the nuclei and at the boundary.
Results of testing NN XC functional on a subset of IP13/03 dataset. and denote total energy and exchange-correlation energy correspondingly obtained after convergence.
| Substance | Error ( | |
|---|---|---|
| C | 0.052 | 0.287 |
| S | 0.015 | 0.102 |
| SH | 0.021 | 0.007 |
| Cl | 0.010 | 0.035 |
| OH | 0.554 | 1.036 |
|
| 0.001 | 0.035 |
| O | 1.556 | 4.029 |
| P | 0.008 | 0.071 |
|
| 0.646 | 2.482 |
| PH | 0.014 | 0.050 |
|
| 0.011 | 0.023 |
|
| 0.023 | 0.040 |
| Si | 0.003 | 0.253 |
Figure 5Comparison of distribution for non self-consistent electron density of atomic oxygen (a), molecular oxygen (b), hydrogen sulfide and self-consistent electron density from training ( denotes the Bohr radius). One can see that atomic and molecular oxygen have a significant fraction of samples out of the train distribution. The step of the self-consistent cycle with maximum electron density was chosen for all substances. This example emphasizes the importance of including a wide range of electron density and derivatives in the training dataset because extreme values could occur during the self-consistent cycle.
Results of testing NN XC functional on Alkisomer11 dataset. denotes isomerization energy.
| Isomerization reaction | IE (PBE) | IE (NN) | Abs error, kcal/mol |
|---|---|---|---|
| Butane | − 0.996 | − 1.090 | 0.094 |
| Pentane | − 0.788 | − 0.491 | 0.297 |
| Pentane | − 2.315 | − 1.843 | 0.472 |
| Hexane | − 2.563 | − 2.071 | 0.492 |
| Hexane | − 2.516 | − 1.924 | 0.592 |
| Hexane | − 1.626 | − 0.896 | 0.730 |
| Hexane | − 2.062 | − 1.409 | 0.653 |
| Heptane | − 3.218 | − 2.916 | 0.302 |
| Heptane | − 2.807 | − 2.746 | 0.061 |
| Octane | 5.224 | 6.326 | 1.102 |
| Octane | 0.725 | 1.292 | 0.567 |
Figure 6Enhancement factor at fixed values of : 0, 2, 10, 1000. s is a reduced density gradient. There is a good agreement between the enhancement factors at = 2 densities, which is close to the most frequent densities in the training set. Also one can see that NN XC functional is bad in guessing asymptotic in low ( = 1000) and high ( = 0) limits of electron density.
Comparing NN and PBE energies of rotated SH molecule. All energies in Hartree.
| Rotation (deg.) | Total energy (PBE) | Total energy (NN) | XC energy (PBE) | XC energy (NN) |
|---|---|---|---|---|
| 0 | − 10.69546 | − 10.69320 | − 2.48893 | − 2.48901 |
| 11.25 | − 10.69546 | − 10.69321 | − 2.48891 | − 2.48911 |
| 22.5 | − 10.69542 | − 10.69314 | − 2.48890 | − 2.48892 |
| 45 | − 10.69544 | − 10.69314 | − 2.48892 | − 2.48905 |
| 90 | − 10.69546 | − 10.69321 | − 2.48893 | − 2.48901 |
Figure 7Slices of and along the line connecting the atoms of a hydrogen molecule. Blue lines represent values predicted by our NN, red lines—values obtained by Octopus (where is correct functional derivative).