| Literature DB >> 32170078 |
Haitham Abdulmohsin Afan1, Mohammed Falah Allawi2, Amr El-Shafie3, Zaher Mundher Yaseen4, Ali Najah Ahmed5, Marlinda Abdul Malek5, Suhana Binti Koting6, Sinan Q Salih1, Wan Hanna Melini Wan Mohtar7, Sai Hin Lai6, Ahmed Sefelnasr8, Mohsen Sherif8, Ahmed El-Shafie6.
Abstract
In nature, streamflow pattern is characterized with high non-linearity and non-stationarity. Developing an accurate forecasting model for a streamflow is highly essential for several applications in the field of water resources engineering. One of the main contributors for the modeling reliability is the optimization of the input variables to achieve an accurate forecasting model. The main step of modeling is the selection of the proper input combinations. Hence, developing an algorithm that can determine the optimal input combinations is crucial. This study introduces the Genetic algorithm (GA) for better input combination selection. Radial basis function neural network (RBFNN) is used for monthly streamflow time series forecasting due to its simplicity and effectiveness of integration with the selection algorithm. In this paper, the RBFNN was integrated with the Genetic algorithm (GA) for streamflow forecasting. The RBFNN-GA was applied to forecast streamflow at the High Aswan Dam on the Nile River. The results showed that the proposed model provided high accuracy. The GA algorithm can successfully determine effective input parameters in streamflow time series forecasting.Entities:
Year: 2020 PMID: 32170078 PMCID: PMC7070020 DOI: 10.1038/s41598-020-61355-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The location of the case study (Aswan High Dam).
Figure 2The Natural stream-flow at AHD for months of (a) August, (b) November and (c) April for years between 1870 and 2000.
Figure 3The conceptual input variables selection by using GA.
Figure 4The flowchart of Genetic input selection method.
Genetic algorithm parameters setup.
| Genetic parameters | |
|---|---|
| Population size | 5 |
| Number of generations | 30 |
| Selection scheme | tournament |
| Tournament size | 0.25 |
| p initialize | 0.5 |
| p crossover | 0.5 |
| Crossover type | uniform |
The evaluation metrics for training phase of the different input combinations for each month.
| Month | RMSE | MAE | MAPE | MBE | d |
|---|---|---|---|---|---|
| August | 2.415 | 1.976 | 11.113 | 0.021 | 0.901 |
| September | 1.789 | 1.435 | 6.581 | 0.006 | 0.948 |
| October | 1.202 | 0.945 | 6.231 | −0.0006 | 0.968 |
| November | 0.620 | 0.475 | 5.760 | 0.003 | 0.979 |
| December | 0.241 | 0.177 | 3.071 | 0.0006 | 0.993 |
| January | 0.161 | 0.127 | 3.212 | 0.002 | 0.994 |
| February | 0.158 | 0.128 | 4.772 | 0.002 | 0.992 |
| March | 0.127 | 0.102 | 4.097 | 0.001 | 0.994 |
| April | 0.146 | 0.112 | 6.022 | 0.009 | 0.991 |
| May | 0.137 | 0.104 | 6.279 | 0.005 | 0.988 |
| June | 0.327 | 0.244 | 13.878 | 0.041 | 0.929 |
| July | 0.969 | 0.747 | 16.646 | 0.048 | 0.899 |
The evaluation metrics for testing phase of the different input combinations for each month.
| Month | RMSE | MAE | MAPE | MBE | d | Max. (RE) | R2 |
|---|---|---|---|---|---|---|---|
| August | 1.477 | 1.210 | 6.803 | −0.010 | 0.964 | −19.364 | 0.882 |
| September | 1.176 | 0.969 | 5.694 | 0.006 | 0.988 | −23.652 | 0.955 |
| October | 1.001 | 0.864 | 8.804 | 0.037 | 0.977 | −37.833 | 0.922 |
| November | 0.472 | 0.365 | 5.208 | −0.003 | 0.982 | −15.193 | 0.937 |
| December | 0.281 | 0.229 | 4.475 | 0.002 | 0.969 | −14.950 | 0.886 |
| January | 0.236 | 0.176 | 3.916 | −0.002 | 0.969 | 15.641 | 0.892 |
| February | 0.167 | 0.130 | 3.655 | 0.003 | 0.978 | −11.051 | 0.927 |
| March | 0.152 | 0.118 | 3.669 | −0.0001 | 0.978 | 11.315 | 0.917 |
| April | 0.184 | 0.149 | 4.107 | −0.006 | 0.967 | −11.839 | 0.882 |
| May | 0.125 | 0.101 | 2.853 | −0.004 | 0.979 | 7.775 | 0.920 |
| June | 0.164 | 0.134 | 4.449 | −0.001 | 0.976 | −9.871 | 0.912 |
| July | 0.566 | 0.413 | 7.121 | 0.014 | 0.964 | −32.878 | 0.876 |
The optimal inputs combination selected by GA for each month.
| Predicted Month | Input Variables | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| January | February | March | April | May | June | July | August | September | October | November | December | |
| January | X | X | X | X | X | X* | ||||||
| February | X* | X | X | X | X | |||||||
| March | X | X* | X** | X | X | X | X | |||||
| April | X* | X | X | X | X | X | X | X | ||||
| May | X | X | X | X* | X | X | X | |||||
| June | X | X | X* | X | X | |||||||
| July | X | X | X** | X | X | X | ||||||
| August | X | X | X | X* | X | X | ||||||
| September | X | X | X | X | X | X* | X | |||||
| October | X | X | X | X | X* | X | ||||||
| November | X | X | X | X* | ||||||||
| December | X | X | X | X | X* | X** | ||||||
Figure 5The convergence of genetic algorithm for January input selection.
Figure 6The scatter plots of the 12-months.
Figure 7Predicted versus observed using the best input variables for each month.
Comparison between the RBFNN-GA and previous study according to the MAE and RMSE indicators values.
| Month | RBFNN-GA | RBF-NN by[ | Accuracy Improvement (AI%) | Accuracy Improvement (AI%) | ||
|---|---|---|---|---|---|---|
| MAE | RMSE | MAE | RMSE | MAE | RMSE | |
| August | 1.210 | 1.477 | 2.4 | 0.552 | 98.34 | −62.62 |
| September | 0.969 | 1.176 | 0.85 | 0.582 | −12.28 | −50.51 |
| October | 0.864 | 1.001 | 1.82 | 2.03 | 110.64 | 102.79 |
| November | 0.365 | 0.472 | 0.92 | 1.71 | 152.05 | 262.28 |
| December | 0.229 | 0.281 | 0.32 | 0.67 | 39.73 | 138.43 |
| January | 0.176 | 0.236 | 0.53 | 0.54 | 201.13 | 128.81 |
| February | 0.130 | 0.167 | 0.28 | 0.37 | 115.38 | 121.55 |
| March | 0.118 | 0.152 | 0.22 | 0.42 | 86.44 | 176.31 |
| April | 0.149 | 0.184 | 0.18 | 0.28 | 20.80 | 52.17 |
| May | 0.101 | 0.125 | 0.21 | 0.51 | 107.92 | 308 |
| June | 0.134 | 0.164 | 0.57 | 1.14 | 325.37 | 595.12 |
| July | 0.413 | 0.566 | 3.25 | 1.02 | 686.92 | 80.21 |