| Literature DB >> 34593943 |
Xinhai Chen1,2, Rongliang Chen3, Qian Wan1, Rui Xu1, Jie Liu4,5.
Abstract
Partial differential equations (PDEs) are ubiquitous in natural science and engineering problems. Traditional discrete methods for solving PDEs are usually time-consuming and labor-intensive due to the need for tedious mesh generation and numerical iterations. Recently, deep neural networks have shown new promise in cost-effective surrogate modeling because of their universal function approximation abilities. In this paper, we borrow the idea from physics-informed neural networks (PINNs) and propose an improved data-free surrogate model, DFS-Net. Specifically, we devise an attention-based neural structure containing a weighting mechanism to alleviate the problem of unstable or inaccurate predictions by PINNs. The proposed DFS-Net takes expanded spatial and temporal coordinates as the input and directly outputs the observables (quantities of interest). It approximates the PDE solution by minimizing the weighted residuals of the governing equations and data-fit terms, where no simulation or measured data are needed. The experimental results demonstrate that DFS-Net offers a good trade-off between accuracy and efficiency. It outperforms the widely used surrogate models in terms of prediction performance on different numerical benchmarks, including the Helmholtz, Klein-Gordon, and Navier-Stokes equations.Entities:
Year: 2021 PMID: 34593943 PMCID: PMC8484684 DOI: 10.1038/s41598-021-99037-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The weighting mechanism used in the proposed DFS-Net.
Figure 2The overall pipeline of the proposed DFS-Net.
Figure 3The convergence of DFS-Net (on the log scale) on the Helmholtz equation. The Adam optimizer is used before the vertical dashed line, and the L-BFGS-B optimizer is used afterwards.
Figure 4Performance of different surrogate models on the Helmholtz benchmarks.
Comparison of the relative -error of different neural network-based surrogate models on the Helmholtz equation ().
| Surrogate modela | Accuracy ( | Training time (ms)b |
|---|---|---|
| DGM | 7.14e−01 | 44.12 |
| PINN | 2.27e−01 | 4.60 |
| PINN-anneal | 1.83e−02 | 5.41 |
| GP-PINN | 5.59e−03 | 12.28 |
| DFS-Net | 3.27e−03 | 10.11c |
| DFS-Net | 1.48e−03 | 10.66d |
aAll models consists of three hidden layers with 50 neurons in each layer.
bThe training time for each Adam epoch.
c The training time of DFS-Net for each L-BFGS-B epoch is 22.01 ms.
d The training time of DFS-Net for each L-BFGS-B epoch is 23.88 ms.
Comparison of the relative -error of different neural network-based surrogate models on the Helmholtz equation ().
| Surrogate modela | Accuracy ( | Training time (ms)b |
|---|---|---|
| DGM | 6.21e−01 | 43.82 |
| PINN | 1.37e−01 | 4.57 |
| PINN-anneal | 2.95e−02 | 5.34 |
| GP-PINN | 3.48e−03 | 12.12 |
| DFS-Net | 3.14e−03 | 10.08c |
| DFS-Net | 2.95e−03 | 10.68d |
aAll models consists of three hidden layers with 50 neurons in each layer.
b The training time for each Adam epoch.
cThe training time of DFS-Net for each L-BFGS-B epoch is 22.00 ms.
d The training time of DFS-Net for each L-BFGS-B epoch is 23.86 ms.
Figure 5Performance of different surrogate models on the Klein–Gordon equation.
Comparison of the relative -error of different neural network-based surrogate models on the Klein–Gordon equation.
| Surrogate model | Accuracy ( | Training time (ms)a |
|---|---|---|
| DGM | 2.09e−01 | 50.74 |
| PINN | 1.38e−01 | 6.01 |
| PINN-anneal | 8.71e−03 | 7.19 |
| GP-PINN | 2.57e−03 | 21.88 |
| DFS-Net | 2.29e−03 | 17.73b |
| DFS-Net | 1.45e−03 | 19.36c |
a The training time for each Adam epoch.
b The training time of DFS-Net for each L-BFGS-B epoch is 41.14 ms.
c The training time of DFS-Net for each L-BFGS-B epoch is 44.44 ms.
Figure 6Performances of different architectural designs obtained by varying the number of hidden layers and the number of neurons per layer.
Figure 7Lid-driven cavity flow.
Figure 8Lid-driven cavity flow: comparison of the reference solution with the predicted solution given by DFS-Net.
Figure 9Performance of different surrogate models on the lid-driven cavity flow benchmarks.
Comparison of the relative -error of different neural network-based surrogate models on the lid-driven cavity flow benchmark ().
| Surrogate modela | Accuracy ( | Training time (ms)b |
|---|---|---|
| DGM | 6.69e−02 | 275.39 |
| PINN | 3.47e−01 | 13.33 |
| PINN-anneal | 1.42e−01 | 15.44 |
| GP-PINN | 4.01e−02 | 66.07 |
| DFS-Net | 2.91e−02 | 58.68c |
| DFS-Net | 1.34e−02 | 63.53d |
aAll models consists of three hidden layers with 50 neurons in each layer.
bThe training time for each Adam epoch.
cThe training time of DFS-Net for each L-BFGS-B epoch is 103.16 ms.
d The training time of DFS-Net for each L-BFGS-B epoch is 110.38 ms.
Comparison of the relative -error of different neural network-based surrogate models on the lid-driven cavity flow benchmark ().
| Surrogate modela | Accuracy ( | Training time (ms)b |
|---|---|---|
| DGM | 5.96e−01 | 309.31 |
| PINN | 6.28e−01 | 15.11 |
| PINN-anneal | 6.17e−01 | 17.46 |
| GP-PINN | 4.47e−01 | 74.65 |
| DFS-Net | 3.80e−02 | 66.63c |
| DFS-Net | 3.31e−02 | 71.80d |
aAll models consists of three hidden layers with 128 neurons in each layer.
bThe training time for each Adam epoch.
cThe training time of DFS-Net for each L-BFGS-B epoch is 280.56 ms.
dThe training time of DFS-Net for each L-BFGS-B epoch is 297.83 ms.
Comparison of the relative -error of different neural network-based surrogate models on the lid-driven cavity flow benchmark ().
| Surrogate modela | Accuracy ( | Training time (ms)b |
|---|---|---|
| DGM | 5.74e−01 | 410.30 |
| PINN | 7.76e−01 | 19.53 |
| PINN-anneal | 6.99e−01 | 23.14 |
| GP-PINN | 5.92e−01 | 96.25 |
| DFS-Net | 8.50e−02 | 88.09c |
| DFS-Net | 7.25e−02 | 92.83d |
aAll models consists of three hidden layers with 256 neurons in each layer.
bThe training time for each Adam epoch.
cThe training time of DFS-Net for each L-BFGS-B epoch is 364.33 ms.
dThe training time of DFS-Net for each L-BFGS-B epoch is 395.17 ms.
Figure 10A comparison of the -error for different learning rates and batch sizes.