| Literature DB >> 35169213 |
Abstract
In this work we approach the Schrödinger equation in quantum wells with arbitrary potentials, using the machine learning technique. Two neural networks with different architectures are proposed and trained using a set of potentials, energies, and wave functions previously generated with the classical finite element method. Three accuracy indicators have been proposed for testing the estimates given by the neural networks. The networks are trained by the gradient descent method and the training validation is done with respect to a large training data set. The two networks are then tested for two different potential data sets and the results are compared. Several cases with analytical potential have also been solved.Entities:
Year: 2022 PMID: 35169213 PMCID: PMC8847422 DOI: 10.1038/s41598-022-06442-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Connection diagram for: (a) NN composed of independent subnets, each with different nodes in the HL and the same nodes in the IL; (b) NN with common architecture, having nodes in the OL, nodes in the HL, and nodes in the IL.
Technical aspects of the NNs
| Parameter | NN(a) | NN(b) | Technical details | |
|---|---|---|---|---|
| S | 5 k | 14 k | Each of the | |
| 1501 | For both architectures we set the same IL and OL. The refinements of the two layers differ because, while the potential can have fast variations, the WF has relatively slow variations. | |||
| 301 | ||||
| 37 | 672 | There is no methodical rule for the HL optimal size. Numbers were empirically chosen close to the geometric mean of the IL and OL numbers of nodes (for NN(a) subnet and NN(b)) | ||
| HL neurons | 11,137 | 672 | NN(a) and NN(b) have a total of 16,727,774 and 1,210,944 connections, respectively. An NN(a) subnet has 55,574 connections. | |
| 5 |
| |||
| 1.1 | ||||
| 1.0 | – | |||
| normally distributed random numbers with mean 0 and standard deviation | ||||
| 1 | NN(a) has a larger learning rate since its subnets are trained separately, with smaller batches. Larger the batch, smaller the learning rate must be. | |||
| T | 5000 | For both NNs this value is sufficient to get cvasistationary weights. | ||
| Time (a.u.) | 181 | 153 | For NN(a) the given time cumulates the training of all subnets. | |
Figure 2Logarithmic probability of occurrence of the WF possible values, at several chosen positions, for all 151 k samples in DS1. The inset encodes the same type of information in color gradient.
Figure 3(a) Learning curves of several subnets in NN (a). The lower inset indicates each subnet position in the spatial domain. The upper left inset details the behavior of the loss curves for the first 100 iterations. The upper right inset indicates the values of the relative loss function at the end of the learning cycle. (b) Learning curve of NN (b). The left/right inset shows the descent of the loss curve for the first/last 100 iterations. For both (a) and (b) plots the discrete points have been connected with solid lines.
Figure 4Multiple histograms comparing the NNs estimations for 14 k and 151 k samples in the training DS1: (a) relative deviation of energies; (b) relative “distance” between wave solutions; (c) accuracy of average positions. The height of a histogram bar indicates the probability to get a result in the corresponding bin. The bins cover uniformly the range of possible values on the horizontal axis. The face-colored bars are for the 14 k samples set and the edge-colored bars are for the entire 151 k samples set.
Figure 5Multiple histograms comparing the NNs predictions for the 14 k samples in each of the testing DS2 and DS3 and the estimations for the 14 k samples in the training DS1: (a) relative deviation of energies; (b) relative “distance” between wave functions; (c) accuracy of average positions. The underlying shapes of the probability distributions are indicated by the histogram external perimeters.
Figure 6The FEM and NNs WF solutions and accuracy indicators in 8 particular cases with piecewise analytically-defined potential functions: (a) SQW; (b) PQW; (c) VQW; (d) TQW; (e) SPQW; (f) GQW; (g) LDQW; (h) ASQW.