| Literature DB >> 33194437 |
Hafiza Mamona Nazir1, Ijaz Hussain1, Muhammad Faisal2,3, Alaa Mohamd Shoukry4,5, Mohammed Abdel Wahab Sharkawy4, Fares Fawzi Al-Deek4, Muhammad Ismail6.
Abstract
Several data-driven and hybrid models are univariate and not considered the dependance structure of multivariate random variables, especially the multi-site river inflow data, which requires the joint distribution of the same river basin system. In this paper, we proposed a Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) Vine copula-based approach to address this issue. The proposed hybrid model comprised on two stages: In the first stage, the CEEMDAN is used to extract the high dimensional multi-scale features. Further, the multiple models are used to predict multi-scale components and residuals. In the second stage, the residuals obtained from the first stage are used to model the joint uncertainty of multi-site river inflow data by using Canonical Vine. For the application of the proposed two-step architecture, daily river inflow data of the Indus River Basin is used. The proposed two-stage methodology is compared with only the first stage proposed model, Vector Autoregressive and copula-based Autoregressive Integrated Moving Average models. The four evaluation measures, that is, Mean Absolute Relative Error (MARE), Mean Absolute Deviation (MAD), Nash-Sutcliffe Efficiency (NSE) and Mean Square Error (MSE), are used to observe the prediction performance. The results demonstrated that the proposed model outperforms significantly with minimum MARE, MAD, NSE, and MSE for two case studies having significant joint dependance. Therefore, it is concluded that the prediction can be improved by appropriately modeling the dependance structure of the multi-site river inflow data.Entities:
Keywords: Canonical-vine; Complete ensembe empirical mode decomposition with adaptive noises; Pair copula construction
Year: 2020 PMID: 33194437 PMCID: PMC7651478 DOI: 10.7717/peerj.10285
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1The proposed C-Vine based CEEMDAN-R-MM structure to predict multi-site hydrological time series data.
Archimedean and elliptical copula family with corresponding generator, inverse function, Kendall τ and its range.
| Copula | Kendall’s τ | Parameter Range | ||
|---|---|---|---|---|
| Clayton | ||||
| Gumbel | ||||
| Frank | -ln | |||
| Gaussian | ρ > (−1, 1) | |||
| Student-t | ρ > (−1, 1), ν > 2 |
BB family of Copula with their generator, Kendall’s tau and parameter space.
| Copula | τ | Parameter range | |
|---|---|---|---|
| (BB6) | |||
| (BB7) | |||
| (BB8) | |||
Figure 2A structure of C-Vine (A) and D-Vine (B) copulas with four variables and three trees.
Tree 1 has nodes N1 = {1, 2, 3, 4} and edges E1 = {1, 2, 3}, tree 2 has nodes N2 = {1, 2, 3} and E2 = {1, 2} and tree 3 has nodes N3 = {1, 2} and edges E3 = {1}.
Figure 3Network of selected rivers: Indus, Jhelum, Chenab and Kabul.
Figure 4The CEEMDAN based decomposition of Indus rivers inflow where (A–E) first five IMFs of Indus river inflow and (F–J) remaining IMFs of Indus river inflow.
Figure 5The CEEMDAN based decomposition of Jhelum river inflow where (A–E) First five IMFs of Jhelum river inflow and (F–J) remaining IMFs of Jhelum river inflow.
Results of the proposed model (C-Vine based CEEMDAN-R-MM) and benchmark (VAR, ARIMA-COP and CEEMDAN-R-MM models).
| Rivers inflow | Model | MAD | NSE | MARE | MSE |
|---|---|---|---|---|---|
| Indus inflow | VAR | 4.9069 | 0.9899 | 0.0715 | 77.9964 |
| Jhelum inflow | 3.7185 | 0.9042 | 0.0715 | 50.5069 | |
| Chenab inflow | 2.7334 | 0.9885 | 0.0976 | 28.4530 | |
| Kabul inflow | 4.6269 | 0.8849 | 0.1944 | 99.9116 | |
| Indus inflow | ARIMA-COP | 4.3562 | 0.9915 | 0.0744 | 64.4888 |
| Jhelum inflow | 3.6253 | 0.9158 | 0.1358 | 46.7833 | |
| Chenab inflow | 2.6468 | 0.9608 | 0.1043 | 24.9336 | |
| Kabul inflow | 4.7105 | 0.8844 | 0.1429 | 100.271 | |
| Indus inflow | CEEMDAN-R-MM | 2.2145 | 0.9989 | 0.0652 | 8.0779 |
| Kabul inflow | 1.2822 | 0.9967 | 0.0730 | 2.8825 | |
| Chenab inflow | 0.8694 | 0.9980 | 0.0576 | 1.2689 | |
| Jhelum inflow | 0.8664 | 0.9978 | 0.0454 | 1.2971 | |
| Indus inflow | C-Vine based CEEMDAN-R-MM | 2.1771 | 0.9990 | 0.0770 | 7.8407 |
| Kabul inflow | 0.9195 | 0.9978 | 0.0687 | 1.3767 | |
| Chenab inflow | 1.2826 | 0.9966 | 0.1458 | 2.8982 | |
| Jhelum inflow | 0.8985 | 0.9976 | 0.0691 | 1.4090 |
Figure 6Residuals from the first-stage CEEMDAN-R-MM method for (A) Indus and (B) Jhelum river inflow.
Estimated values of Kendall’s correlation.
| Rivers | Indus | Jhelum | Chenab | Kabul |
|---|---|---|---|---|
| Indus | 1.0000 | 0.4410 | 0.5441 | 0.7468 |
| Jhelum | 1.0000 | 0.6794 | 0.5844 | |
| Chenab | 1.0000 | 0.6207 | ||
| Kabul | 1.0000 |
Figure 7The empirical (red and black dotted line) and theoretical (straight black line) normal distribution of errors resulting from CEEMDAN-R-MM (A–D) and (right) ARMA method (E–H).
Figure 8Structure of pair copula decomposition of 4-D C-Vine copula for Indus and Kabul rivers inflow simulation conditioned on Jhelum and Chenab rivers inflow where C is showing Chenab, J for Jhelum, K for Kabul and I for Indus river inflow with its AIC, BIC and log-likelihood values.
Figure 9Structure of pair copula decomposition of 4-D C-Vine copula for Jhelum and Chenab rivers inflow simulation conditioned on Indus and Kabul rivers inflow where C is showing Chenab, J for Jhelum, K for Kabul and I for Indus river inflow with its AIC, BIC and log-likelihood values.
Figure 10The predicted graph for all case studies of proposed two-stage C-Vine based CEEMDAN-R-MM model with the predicted values of benchmark models (VAR, Copula based ARMA, first-stage CEEMDAN-R-MM) for (A) Indus river inflow, (B) Jhelum river inflow.
Figure 11The predicted graph for all case studies of proposed two-stage C-Vine based CEEMDAN-R-MM model with the predicted values of benchmark models (VAR, Copula based ARMA, first-stage CEEMDAN-R-MM) for (A) Chenab river inflow, (B) Kabul river inflow.