| Literature DB >> 34620910 |
Pakorn Ditthakit1,2, Sirimon Pinthong3,4, Nureehan Salaeh3,4, Fadilah Binnui3,4, Laksanara Khwanchum5,4, Quoc Bao Pham6.
Abstract
Estimating monthly runoff variation, especially in ungauged basins, is inevitable for water resource planning and management. The present study aimed to evaluate the regionalization methods for determining regional parameters of the rainfall-runoff model (i.e., GR2M model). Two regionalization methods (i.e., regression-based methods and distance-based methods) were investigated in this study. Three regression-based methods were selected including Multiple Linear Regression (MLR), Random Forest (RF), and M5 Model Tree (M5), and two distance-based methods included Spatial Proximity Approach and Physical Similarity Approach (PSA). Hydrological data and the basin's physical attributes were analyzed from 37 runoff stations in Thailand's southern basin. The results showed that using hydrological data for estimating the GR2M model parameters is better than using the basin's physical attributes. RF had the most accuracy in estimating regional GR2M model's parameters by giving the lowest error, followed by M5, MLR, SPA, and PSA. Such regional parameters were then applied in estimating monthly runoff using the GR2M model. Then, their performance was evaluated using three performance criteria, i.e., Nash-Sutcliffe Efficiency (NSE), Correlation Coefficient (r), and Overall Index (OI). The regionalized monthly runoff with RF performed the best, followed by SPA, M5, MLR, and PSA. The Taylor diagram was also used to graphically evaluate the obtained results, which indicated that RF provided the products closest to GR2M's results, followed by SPA, M5, PSA, and MLR. Our finding revealed the applicability of machine learning for estimating monthly runoff in the ungauged basins. However, the SPA would be recommended in areas where lacking the basin's physical attributes and hydrological information.Entities:
Year: 2021 PMID: 34620910 PMCID: PMC8497588 DOI: 10.1038/s41598-021-99164-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The station locations of rainfall, runoff, and weather stations of the study area. (This map was created by QGIS version 3.12, which can be accessed on https://qgis.org/en/site/).
Figure 2The GR2M model working process.
Source: Adapted from Bachir et al.[38], Rwasoka et al.[39].
Figure 3Framework for research methodology.
Summary statistical values of hydrological data and physical characteristics of runoff gauged station used in this analysis.
| Statistical value | Runoff (mm) | Rainfall (mm) | ET (mm) |
|---|---|---|---|
| Maximum | 1615.43 | 1562.30 | 248.95 |
| Minimum | 289.45 | 0.00 | 94.37 |
| Average | 124.41 | 197.33 | 148.86 |
| Standard deviation | 119.29 | 155.28 | 19.31 |
Figure 4Physical characteristics information of the 37 runoff gauged stations.
Figure 5The results of model’s calibration and validation.
Figure 6The rainfall and runoff time series at runoff stations X.64 and X.70.
The statistical values of the fitted GR2M model parameters.
| Parameters | Min | Max | Avg | SD | SK | K |
|---|---|---|---|---|---|---|
| X1 | 2.00 mm | 10.00 mm | 5.71 mm | 2.49 mm | − 0.52 | − 1.03 |
| X2 | 0.54 | 1.00 | 0.93 | 0.12 | − 2.01 | 3.69 |
Remark: Min = Minimum; Max = Maximum; Avg = Average; SD = Standard deviation; SK = Skewness coefficient; and K = Kurtosis coefficient.
The suitable group of independent variables for determining two GR2M parameters.
| Methods | Scenario | X1 | X2 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| MAE | RMSE | r | CA | MAE | RMSE | r | CA | ||
| MLR | 1 | 2.032 | 2.383 | 0.237 | 1.787 | 0.083 | 0.110 | 0.312 | 0.365 |
| 2 | 1.462 | 1.820 | 0.671 | 1.277 | 0.059 | 0.078 | 0.733 | 0.200 | |
| 3 | |||||||||
| RF | 1 | 0.658 | 0.861 | 0.963 | 0.531 | 0.036 | 0.045 | 0.971 | 0.046 |
| 2 | 0.595 | 0.790 | 0.976 | 0.478 | |||||
| 3 | 0.028 | 0.039 | 0.983 | 0.033 | |||||
| M5 | 1 | 2.072 | 2.453 | 0.000 | 1.842 | 0.083 | 0.109 | 0.312 | 0.365 |
| 2 | 0.051 | 0.070 | 0.799 | 0.161 | |||||
| 3 | 1.502 | 1.917 | 0.624 | 1.343 | |||||
Statistical indices for estimating X1 and X2 values.
| Statistical indices | X1 | X2 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| MLR | RF | M5 | SPA | PSA | MLR | RF | M5 | SPA | PSA | |
| MAE | 1.45 | 0.58 | 1.50 | 1.67 | 1.95 | 0.05 | 0.03 | 0.05 | 0.08 | 0.10 |
| RMSE | 1.80 | 0.77 | 1.92 | 2.41 | 2.46 | 0.07 | 0.04 | 0.07 | 0.12 | 0.14 |
| r | 0.68 | 0.97 | 0.62 | 0.39 | 0.30 | 0.79 | 0.98 | 0.83 | 0.27 | − 0.16 |
| CA | 1.26 | 0.47 | 1.34 | 1.64 | 1.77 | 0.16 | 0.03 | 0.14 | 0.37 | 0.41 |
Figure 7Comparative results of applying the estimated X1 and X2 values with five methods in the GR2M model.
The performance comparison of applying the estimated X1 and X2 values in the GR2M model with 6 methods.
| Method | Efficiency criteria | |||||
|---|---|---|---|---|---|---|
| Calibration | Validation | |||||
| NSE | r | OI | NSE | r | OI | |
| MAX | 0.978 | 0.993 | 0.974 | 0.987 | 0.996 | 0.980 |
| MIN | 0.007 | 0.277 | 0.411 | − 0.437 | 0.407 | 0.120 |
| Avg | 0.637 | 0.825 | 0.757 | 0.465 | 0.750 | 0.639 |
| SD | 0.256 | 0.170 | 0.153 | 0.354 | 0.166 | 0.213 |
| MAX | 0.950 | 0.991 | 0.954 | 0.860 | 0.987 | 0.886 |
| MIN | − 0.170 | 0.320 | 0.280 | − 1.906 | 0.249 | − 0.628 |
| Avg | 0.513 | 0.817 | 0.686 | 0.102 | 0.719 | 0.433 |
| SD | 0.325 | 0.171 | 0.188 | 0.713 | 0.188 | 0.399 |
| MAX | 0.970 | 0.995 | 0.968 | 0.979 | 0.995 | 0.972 |
| MIN | − 0.091 | 0.393 | 0.357 | − 0.881 | 0.428 | − 0.125 |
| Avg | 0.613 | 0.830 | 0.744 | 0.384 | 0.752 | 0.592 |
| SD | 0.275 | 0.164 | 0.164 | 0.442 | 0.162 | 0.260 |
| MAX | 0.970 | 0.990 | 0.968 | 0.955 | 0.986 | 0.948 |
| MIN | − 0.748 | 0.315 | − 0.039 | − 2.235 | 0.235 | − 0.804 |
| Avg | 0.497 | 0.820 | 0.678 | 0.004 | 0.726 | 0.378 |
| SD | 0.369 | 0.172 | 0.211 | 0.806 | 0.196 | 0.448 |
| MAX | 0.955 | 0.994 | 0.958 | 0.977 | 0.995 | 0.970 |
| MIN | − 1.819 | 0.307 | − 0.583 | − 3.379 | 0.133 | − 1.534 |
| Avg | 0.420 | 0.818 | 0.636 | 0.041 | 0.733 | 0.398 |
| SD | 0.580 | 0.179 | 0.323 | 0.902 | 0.194 | 0.512 |
| MAX | 0.941 | 0.994 | 0.948 | 0.909 | 0.967 | 0.920 |
| MIN | − 2.084 | 0.383 | − 0.724 | − 3.459 | 0.094 | − 1.577 |
| Avg | 0.377 | 0.822 | 0.609 | − 0.102 | 0.735 | 0.316 |
| SD | 0.567 | 0.156 | 0.320 | 1.035 | 0.194 | 0.577 |
Remark: MAX = Maximum; MIN = Minimum; Avg = Average; SD = Standard deviation.
The performance criteria of the analysis method for estimating parameters.
| Range | Regionalization methods (station) | ||||
|---|---|---|---|---|---|
| MLR | RF | M5 | SPA | PSA | |
| 0.7 ≤ NSE | 9 | 13 | 8 | 10 | 2 |
| 0.5 ≤ NSE < 0.7 | 5 | 7 | 7 | 8 | 12 |
| 0.3 ≤ NSE < 0.5 | 10 | 9 | 8 | 5 | 7 |
| NSE < 0.3 | 13 | 8 | 14 | 14 | 16 |
| 0.7 ≤ r | 24 | 27 | 26 | 27 | 27 |
| 0.5 ≤ r < 0.7 | 10 | 9 | 9 | 6 | 8 |
| 0.3 ≤ r < 0.5 | 3 | 1 | 2 | 4 | 1 |
| r < 0.3 | 0 | 0 | 0 | 0 | 1 |
| 0.7 OI | 13 | 20 | 15 | 16 | 13 |
| 0.5 OI < 0.7 | 13 | 10 | 9 | 9 | 9 |
| 0.3 OI < 0.5 | 5 | 6 | 5 | 5 | 7 |
| OI < 0.3 | 6 | 1 | 8 | 7 | 8 |
Figure 8The relationship between the observed runoff and simulated runoff obtained from six methods in X.44, X.67, and X.234.
Figure 9Taylor diagram for the regionalized GR2M model for training and testing stages.