| Literature DB >> 36246669 |
Xin Dong1, Yu-Long Bai1, Yani Lu1, Manhong Fan1.
Abstract
A crucial challenge encountered in diverse areas of engineering applications involves speculating the governing equations based upon partial observations. On this basis, a variant of the sparse identification of nonlinear dynamics (SINDy) algorithm is developed. First, the Akaike information criterion (AIC) is integrated to enforce model selection by hierarchically ranking the most informative model from several manageable candidate models. This integration avoids restricting the number of candidate models, which is a disadvantage of the traditional methods for model selection. The subsequent procedure expands the structure of dynamics from ordinary differential equations (ODEs) to partial differential equations (PDEs), while group sparsity is employed to identify the nonconstant coefficients of partial differential equations. Of practical consideration within an integrated frame is data processing, which tends to treat noise separate from signals and tends to parametrize the noise probability distribution. In particular, the coefficients of a species of canonical ODEs and PDEs, such as the Van der Pol, Rössler, Burgers' and Kuramoto-Sivashinsky equations, can be identified correctly with the introduction of noise. Furthermore, except for normal noise, the proposed approach is able to capture the distribution of uniform noise. In accordance with the results of the experiments, the computational speed is markedly advanced and possesses robustness.Entities:
Keywords: Akaike information criterion; Group sparsity; Identification of noise distribution; Nonlinear dynamical systems; Sparse identification
Year: 2022 PMID: 36246669 PMCID: PMC9552166 DOI: 10.1007/s11071-022-07875-9
Source DB: PubMed Journal: Nonlinear Dyn ISSN: 0924-090X Impact factor: 5.741
Overview of SINDy
| Classification | Literature | Method | Modification/Field | Objective |
|---|---|---|---|---|
| Candidate function library | Brunton. et al. [ | SINDYc (SINDy with control) | Include external actuation and control signals | Generalize the SINDy algorithm to include inputs and control |
| Mangan. et al. [ | Implicit-SINDy | Rational function nonlinearities with cross terms | Infer networked nonlinear dynamical systems | |
| Quade et al. [ | Abrupt-SINDy | Either additional, deleted, or modified model terms | Learn parsimonious models of a system in response to abrupt changes | |
| Kaiser et al. [ | SINDy-MPC (model prediction control) | Include the effects of actuation | Enhance data-driven control of physical systems | |
| Champion et al. [ | Autoencoder-SINDy | Include intrinsic coordinates | Discover a coordinate transformation into a reduced space | |
| Bramburger et al. [ | UPO-SINDy (unstable periodic orbits) | Contain parameters for Poincaré mappings | Produce parameter dependent Poincare mappings to stabilize UPOs | |
| Kaptanoglu et al. [ | Constrained-SINDy | Incorporate known physical laws | Build a connection to the large-scale Galerkin in fluid mechanics | |
| Wei [ | Integral-SINDy | Contain the integral form of state equations | Identify model structure and parameters of nonlinear ODEs | |
| Hoffmann et al. [ | Reactive-SINDy | Incorporate concentration time series | Estimate a parsimonious reaction network | |
| Sparse regression | Champion et al. [ | SR3(sparse relaxed regularized regression) | Additional regularizer for auxiliary variable | Solve a relaxation of the regularized problem |
| Chu et al. [ | Lagrangian-SINDy | Transform the naive formulation as the Legendre form | Learn interpretable and sparse formulations of overall energy and the Lagrangian | |
| Dai et al. [ | Extended-SINDy | Stepwise Sparse Regressor (SSR) | Detect the dynamic behavior from data | |
| Fasel et al. [ | Ensemble-SINDy | Incorporate the bootstrap aggregating (bagging) | Robustify the SINDy algorithm | |
| Hirsh et al. [ | UQ-SINDy (uncertainty quantification) | Add the | Promote robustness against noise measurements for out-of-sample forecast | |
| Ram et al. [ | Bayesian-SINDy | Combine with the neuronized priors | Handle parameter and structural model uncertainties | |
| Modified structure | Brunton et al. [ | HAVOK (Hankel alternative view of Koopman analysis) | Integrate with delay embedding and Koopman theory | Decomposition of chaotic dynamics into linear models using forcing actuation |
| Loiseau et al. [ | SINDy-KNN (K-nearest neighbors) | Incorporate the KNN to map into the low-dimensional space | Yield interpretable nonlinear models from measurement data | |
| Kaheman et al. [ | Modified-SINDy | Integrate with the automatic differential for neural networks | Denoise the data and parametrize the noise probability distribution | |
| Dam et al. [ | Bifurcation-SINDy | Combine with the bifurcation analysis | Learn a underlying model from simulations of a convection system | |
| Mathematical theory | Zhang et al. [ | SINDy | Convergence performance | Make a proof that SINDy approximates to local minimizers of an unconstrained l0-penalty least squares issue |
| de Silva et al. [ | PySINDy (Python for the SINDy) | Code package for the SINDy algorithm | A Python package for SINDy | |
| Messenger et al.[ | Weak-SINDy | Formulate the problem as the weak form | Data-driven reconstruction of model coefficients from measurements | |
| Wu et al. [ | SINDy | Error processing from an optimize perspective | Improve the accuracy of data | |
| Cortiella et al. [ | SINDy | Consider regularized weight | Determine governing equations of nonlinear dynamics from noisy observations | |
| Application | Jiang et al. [ | SINDy-LM (Levenberg–Marquardt) | Disease (especially for COVID-19) | Analyze the infectious disease in both Chinese mainland and other countries |
| Bhadriraju et al. [ | SINDy | Chemistry | Achieve the adaptive model identification | |
| Fukami et al. [ | CNN-LSTM-SINDy | Fluid flows | Investigate influence on the parameter choice | |
| Loiseau et al. [ | SINDy | Thermal convection dynamics | Predict a bifurcation of the high-dimensional system | |
| Jadhav et al. [ | SINDy | Video process | Extract related intelligence from actual videos of strongly stochastic systems |
Fig. 1Schematic of the proposed technique process
Fig. 2Overall calculation flow of the adopted method
The pseudo-code of Algorithm 1
Fig. 3Example of nonlinear dynamics to test the performance of the SINDy algorithm
Fig. 4Selected sparse models for the Rössler system with SINDy-AIC
Fig. 5Selected models for the Van der Pol system with SINDy-AIC
Fig. 6Identified coefficients of parametric Burgers’ equation with or without the introduction of noise
Fig. 7Identified coefficients of the spatial Kuramoto–Sivashinsky equation and its respective performance indices
Fig. 8Comparison results of the augmented SINDy and other methods
A variety of parameters for different test instances
| Models | Initial condition | N | Library order | Max Adam optimizer | ||
|---|---|---|---|---|---|---|
| Duffing | [− 2, 2] | 1 | 0.05 | 5 | 3 | 5000 |
| Van der Pol | [− 2, 1] | 1 | 0.15 | 8 | 3 | 5000 |
| Lotka-Volterra | [1, 2] | 1 | 0.18 | 10 | 3 | 5000 |
| Lorenz | [5, 5, 25] | 1 | 0.20 | 8 | 2 | 15,000 |
| Rössler | [3, 5, 0] | 1 | 0.08 | 8 | 2 | 15,000 |
| SprottB | [1, 1, 1] | 1 | 0.20 | 8 | 2 | 15,000 |
Fig. 9Identification for two kinds of noise distribution and system reconstruction for van der pol system
Error metrics for the Van der Pol system
| Model | Type of noise distribution | Noise percentage (%) | |||
|---|---|---|---|---|---|
| Van der Pol | Normal | 15 | 0.00053 | 0.00024 | 6.14E−05 |
| Uniform | 15 | 1.04E−05 | 8.49E−06 | 5.40E−06 |
Error metrics for the Rössler system
| Model | Type of noise distribution | Noise percentage (%) | |||
|---|---|---|---|---|---|
| Rössler | Normal | 30 | 0.02272 | 0.00046 | 0.01604 |
| Uniform | 30 | 0.00025 | 1.63E−06 | 3.52E−05 |
Fig. 10The results of identification for both noise distribution with different forms and simulation reconstructions for Rossler system
Fig. 11Corrupted systems and identified systems of a set of levels of artificial noise
Error metrics for 5–15% noise of different test cases
| Models | 5% | 10% | 15% | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Duffing | 8.48E−06 | 4.43E−05 | 0.0001 | 3.59E−05 | 0.00018 | 0.00042 | 8.33E−05 | 0.0004 | 0.0009 |
| Van der Pol | 5.41E−05 | 1.77E−05 | 5.07E−06 | 0.00022 | 8.76E−05 | 2.32E−05 | 0.00053 | 0.00024 | 6.14E−05 |
| Lotka-Volterra | 0.00073 | 3.01E−05 | 4.48E−05 | 0.00303 | 0.00016 | 0.00011 | 0.03211 | 0.00018 | 0.50404 |
| Rössler | 0.0005 | 6.28E−06 | 0.00045 | 0.0021 | 3.10E−05 | 0.00181 | 0.00488 | 7.84E−05 | 0.00401 |
| Lorenz | 0.00484 | 0.00011 | 0.48734 | 0.0193 | 0.00042 | 0.46719 | 0.05444 | 0.00117 | 0.44858 |
| SprottB | 6.53E−05 | 1.07E−05 | 1.53695 | 0.00026 | 4.44E−05 | 1.02298 | 0.0006 | 0.00011 | 1.07388 |
Error metrics for 20–30% noise of different test cases
| Models | 20% | 25% | 30% | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Duffing | 0.00015 | 0.00067 | 0.00149 | 0.00024 | 0.00101 | 0.00216 | 0.00035 | 0.00138 | 0.00287 |
| Van der Pol | 0.00099 | 0.00046 | 0.00012 | 0.00163 | 0.00078 | 0.00019 | 0.00245 | 0.00119 | 0.00029 |
| Lotka-Volterra | 0.00765 | 0.00037 | 0.00589 | 0.0183 | 0.00108 | 0.00074 | 0.02883 | 0.00164 | 0.00168 |
| Rössler | 0.00884 | 0.00017 | 0.00722 | 0.0147 | 0.00029 | 0.00029 | 0.02272 | 0.00046 | 0.01604 |
| Lorenz | 0.1267 | 0.00227 | 0.44827 | 1.59086 | 0.01819 | 0.40318 | 1.62864 | 0.01896 | 0.50842 |
| SprottB | 0.00109 | 0.0002 | 1.39116 | 0.00173 | 0.00032 | 1.41288 | 0.00255 | 0.00048 | 1.11184 |
Table: The summation of the variable
| Symbol | Meaning | Symbol | Meaning |
|---|---|---|---|
|
|
| The decay factor in Sect. | |
|
| The training set in Eq. ( |
| The constrained condition in Eq. ( |
|
| The estimated measurements in Eq. ( |
| The parametric dependencies in Eq. ( |
| Y | The validation set in Sect. |
| The number of possible monomials in Sect. |
| U | The actually observed data in Eq. ( |
| The total number of potential models in Sect. |
|
| The estimated noise in Eq. ( |
| The |
|
| The |
| The dimension of state variable in Sect. |
|
| The candidate function collection in Eq. ( |
| The threshold in Sect. |
|
| The measurements for partial differential equations (PDEs) in Eq. ( |
| |
|
| The block diagonal matrix |
| The absolute error of the estimated derivative and system’s vector field in Eq. ( |
|
| The vector of the sparse coefficients in Eq. ( |
| The simulation error in Eq. ( |
|
| The |
| The noise identification error in Eq. ( |
|
| The |
| The vector field error in Eq. ( |
|
| The state variables of distinct dimension in Eq. ( |
| The prediction error in Eq. ( |
|
| The |
| |
|
| The time points in Sect. |
| The function of nonlinear dynamics in Eq. ( |
|
|
| The flow map of the dynamic systems in Eq. ( | |
|
| A collection of groups in Eq. ( | RSS | The residual sum of squares in Sect. |
|
| The estimated parameter in Eq. ( |
| The loss function in Eq. ( |
|
| The degree free parameters in Eq. ( |
| The relative AICc in Sect. |
|
| The dimension of measurements in Eq. ( |
| The minimum AIC value in Sect. |
|
| The noise magnitude in Sect. |
| The regularization function in Eq. ( |
|
| The backward or forward time steps in Eq. ( |
| The nonlinear function for the dynamics in Eq. ( |