| Literature DB >> 35955136 |
Zaimi Xie1,2, Zhenhua Li2,3, Chunmei Mo2,3, Ji Wang2,3.
Abstract
In order to effectively solve the problem of low accuracy of seawater water quality prediction, an optimized water quality parameter prediction model is constructed in this paper. The model first screened the key factors of water quality data with the principal component analysis (PCA) algorithm, then realized the de-noising of the key factors of water quality data with an ensemble empirical mode decomposition (EEMD) algorithm, and the data were input into the two-dimensional convolutional neural network (2D-CNN) module to extract features, which were used for training and learning by attention, gated recurrent unit, and an encoder-decoder (attention-GRU-encoder-decoder, attention-GED) integrated module. The trained prediction model was used to predict the content of key parameters of water quality. In this paper, the water quality data of six typical online monitoring stations from 2017 to 2021 were used to verify the proposed model. The experimental results show that, based on short-term series prediction, the root mean square error (RMSE), mean absolute percentage error (MAPE), and decision coefficient (R2) were 0.246, 0.307, and 97.80%, respectively. Based on the long-term series prediction, RMSE, MAPE, and R2 were 0.878, 0.594, and 92.23%, respectively, which were all better than the prediction model based on an enhanced clustering algorithm and adam with a radial basis function neural network (ECA-Adam-RBFNN), a prediction model based on a softplus extreme learning machine method with partial least squares and particle swarm optimization (PSO-SELM-PLS), and a wavelet transform-depth Bi-S-SRU (Bi-directional Stacked Simple Recurrent Unit) prediction model. The PCA-EEMD-CNN-attention-GED prediction model not only has high prediction accuracy but can also provide a decision-making basis for the water quality control and management of aquaculture in the waters around Zhanjiang Bay.Entities:
Keywords: attention; ensemble empirical modal decomposition algorithm; principal component analysis algorithm; sea area near Zhanjiang Bay; seawater; water quality prediction
Year: 2022 PMID: 35955136 PMCID: PMC9369775 DOI: 10.3390/ma15155200
Source DB: PubMed Journal: Materials (Basel) ISSN: 1996-1944 Impact factor: 3.748
Figure 1Flow chart of the prediction model.
Figure 2Network input module.
Figure 3Feature extraction module.
Figure 4Attention module.
Figure 5Temporal attention.
Figure 6Spatial attention.
Figure 7Output module.
Figure 8Mean square error of the prediction model.
Figure 9Monitoring stations near Zhanjiang Bay.
Comparison of different hidden unit models.
| Models | Model of Hidden Units | RMSE | MAPE | R2 |
|---|---|---|---|---|
| PCA–EEMD–CNN–Attention–GED (GRU Encoder-Decoder) | 16 | 0.821 | 0.564 | 92.67 |
| 64 | 0.778 | 0.549 | 93.36 | |
| 100 | 0.644 | 0.516 | 94.25 | |
| 200 | 0.694 | 0.540 | 93.92 |
Figure 10Mean square error of the model in different layers.
Principal component coefficient matrix.
| Indicators | Component 1 | Component 2 | Component 3 | Component 4 | Component 5 |
|---|---|---|---|---|---|
| pH | −0.395 | 0.813 | 0.217 | −0.011 | 0.027 |
| Turbidity | −0.636 | 0.172 | −0.255 | 0.222 | 0.247 |
| Dissolved oxygen | 0.443 | 0.461 | 0.882 | 0.011 | −0.076 |
| Water temperature | −0.768 | −0.325 | 0.471 | 0.148 | 0.078 |
| Electrical conductivity | 0.571 | 0.219 | 0.092 | 0.687 | 0.232 |
| Chemical oxygen demand | 0.809 | −0.094 | 0.253 | 0.184 | −0.197 |
| Ammonia nitrogen | −0.358 | −0.079 | 0.873 | −0.169 | 0.515 |
| Total phosphorus | 0.067 | 0.893 | 0.522 | −0.294 | 0.149 |
| Potassium permanganate | 0.423 | −0.347 | −0.319 | −0.421 | 0.586 |
| Redox potential | −0.256 | 0.210 | −0.734 | −0.164 | −0.415 |
| Eigenvalue | 2.934 | 2.429 | 2.194 | 0.894 | 0.748 |
| Contribution rate/% | 26.7 | 48.8 | 67.5 | 74.7 | 83.3 |
Figure 11(a) Signal decomposition of water quality parameters (b) Water quality parameter signal denoising.
Algorithm comparison.
| Models | In-Out | RMSE | MAPE | R2 (%) |
|---|---|---|---|---|
| PCA–EEMD–CNN–Attention–GED | 1-1 | 0.246 | 0.307 | 97.80 |
| 7-7 | 0.692 | 0.560 | 94.23 | |
| 15-15 | 0.832 | 0.596 | 93.57 | |
| PCA–EEMD–CNN–Attention–LSTM | 1-1 | 0.347 | 0.347 | 96.94 |
| 7-7 | 0.834 | 0.605 | 93.48 | |
| 15-15 | 0.896 | 0.623 | 92.85 | |
| PCA–EEMD–CNN–GED | 1-1 | 0.338 | 0.353 | 97.07 |
| 7-7 | 0.843 | 0.622 | 93.53 | |
| 15-15 | 1.021 | 0.642 | 91.89 | |
| PCA–EEMD–GED | 1-1 | 0.345 | 0.362 | 96.91 |
| 7-7 | 0.757 | 0.548 | 93.20 | |
| 15-15 | 1.213 | 0.744 | 91.46 | |
| CNN–Attention–GED | 1-1 | 0.394 | 0.418 | 96.46 |
| 7-7 | 0.747 | 0.563 | 93.68 | |
| 15-15 | 0.975 | 0.637 | 91.65 |
Figure 12(a) Real and predicted values of Do concentration; (b) Real and predicted values of pH; (c) Real and predicted values of TP concentration; (d) Real and predicted values of COD concentration; (e) Real and predicted values of AN concentration.
The prediction sequence for a 20–20 model comparison.
| Models | In–Out | RMSE | MAPE | R2 (%) |
|---|---|---|---|---|
| PCA–EEMD–CNN–Attention–GED | 20–20 | 0.878 | 0.594 | 92.23 |
| ECA–Adam–RBFNN [ | 20–20 | 1.173 | 0.740 | 90.80 |
| PSO–SELM–PLS [ | 20–20 | 1.029 | 0.622 | 91.79 |
| Wavelet Transform-Depth Bi–S–SRU [ | 20–20 | 1.363 | 0.748 | 89.27 |