| Literature DB >> 34671081 |
Mustafa Abed1, Monzur Alam Imteaz1, Ali Najah Ahmed2, Yuk Feng Huang3.
Abstract
Evaporation is a key element for water resource management, hydrological modelling, and irrigation system designing. Monthly evaporation (Ep) was projected by deploying three machine learning (ML) models included Extreme Gradient Boosting, ElasticNet Linear Regression, and Long Short-Term Memory; and two empirical techniques namely Stephens-Stewart and Thornthwaite. The aim of this study is to develop a reliable generalised model to predict evaporation throughout Malaysia. In this context, monthly meteorological statistics from two weather stations in Malaysia were utilised for training and testing the models on the basis of climatic aspects such as maximum temperature, mean temperature, minimum temperature, wind speed, relative humidity, and solar radiation for the period of 2000-2019. For every approach, multiple models were formulated by utilising various combinations of input parameters and other model factors. The performance of models was assessed by utilising standard statistical measures. The outcomes indicated that the three machine learning models formulated outclassed empirical models and could considerably enhance the precision of monthly Ep estimate even with the same combinations of inputs. In addition, the performance assessment showed that Long Short-Term Memory Neural Network (LSTM) offered the most precise monthly Ep estimations from all the studied models for both stations. The LSTM-10 model performance measures were (R2 = 0.970, MAE = 0.135, MSE = 0.027, RMSE = 0.166, RAE = 0.173, RSE = 0.029) for Alor Setar and (R2 = 0.986, MAE = 0.058, MSE = 0.005, RMSE = 0.074, RAE = 0.120, RSE = 0.013) for Kota Bharu.Entities:
Year: 2021 PMID: 34671081 PMCID: PMC8528820 DOI: 10.1038/s41598-021-99999-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Location of case study [Imagery ©2021 TerraMetrics, Map data ©2021 Google].
Various meteorological variables and their descriptive statistics.
| Station | Dataset | Unit | ||||||
|---|---|---|---|---|---|---|---|---|
| Alor Setar | °C | 32.81 | 1.82 | 5.55 | − 0.35 | 24.8 | 39.1 | |
| °C | 24.19 | 1.06 | 4.38 | − 0.36 | 19.2 | 27.7 | ||
| % | 80.91 | 7.38 | 9.12 | − 0.98 | 49.8 | 96.8 | ||
| m/s | 1.66 | 0.57 | 34.33 | 0.59 | 0.1 | 4.7 | ||
| MJ m−2 | 18.44 | 4.72 | 25.59 | − 0.81 | 0.66 | 27.69 | ||
| mm | 4.43 | 1.59 | 35.85 | 0.31 | 0.1 | 9.9 | ||
| Kota Bharu | °C | 31.34 | 1.64 | 5.23 | − 0.56 | 24.4 | 35.9 | |
| °C | 24.22 | 1.06 | 4.41 | − 0.33 | 17.8 | 27.9 | ||
| % | 80.60 | 4.66 | 5.79 | 0.38 | 61.9 | 98.3 | ||
| m/s | 2.31 | 0.90 | 39.03 | 1.99 | 0.4 | 8.6 | ||
| MJ m−2 | 19 | 5.25 | 27.65 | − 0.87 | 0.6 | 28.9 | ||
| mm | 4.22 | 1.32 | 31.35 | − 0.22 | 0 | 9.5 |
In this table, the X, S, C, C, X and X represent the mean, standard deviation, coefficient of variation, skewness, maximum and minimum of the weather variables, respectively.
Figure 2Monthly variations of Ep and associated meteorological parameters used in this study.
Figure 3Partial Autocorrelation for Alor Setar and Kota Bharu stations (Monthly).
Input combinations of meteorological variables used for machine learning models.
| No. | Model | Input combinations | ||
|---|---|---|---|---|
| ElasticNet LR | XGB | LSTM | ||
| 1 | ElasticNet LR-1 | XGB-1 | LSTM-1 | Ta |
| 2 | ElasticNet LR-2 | XGB-2 | LSTM-2 | Tmax, Tmin |
| 3 | ElasticNet LR-3 | XGB-3 | LSTM-3 | Tmax, Tmin, RH |
| 4 | ElasticNet LR-4 | XGB-4 | LSTM-4 | Tmax, Tmin, RH,Sw |
| 5 | ElasticNet LR-5 | XGB-5 | LSTM-5 | Tmax, Tmin, RH, RS |
| 6 | ElasticNet LR-6 | XGB-6 | LSTM-6 | Ta, RS |
| 7 | ElasticNet LR-7 | XGB-7 | LSTM-7 | Tmax, Tmin, Rs |
| 8 | ElasticNet LR-8 | XGB-8 | LSTM-8 | Tmax, Tmin, Rs, Sw |
| 9 | ElasticNet LR-9 | XGB-9 | LSTM-9 | Tmax, Tmin, Rs, Sw, RH |
| 10 | ElasticNet LR-10 | XGB-10 | LSTM-10 | Tmax, Tmin, Rs, Sw, RH, Ep |
Figure 4LSTM neural network cell.
Advantages and disadvantages of the proposed machine learning models.
| AI model type | Advantages | Disadvantages |
|---|---|---|
| ElasticNet LR | ElasticNet is a Lasso/Ridge hybrid that benefits from both the L1 (Lasso) and L2 (Ridge) regularizers Simple model and can be regularised to avoid overfitting Performs well when there are several features that are related to one another | Performs poorly when there are non-linear relationships since they are not naturally flexible enough to capture more complex patterns, and adding the appropriate interaction terms or polynomials can be difficult and time-consuming |
| XGB | Boosting is a persistence and robust method for preventing and mitigating over-fitting Flexible to adapt Extremely fast computation | High-sensitivity to outliers Does not perform very well on large data sets |
| LSTM | Ability to learn extremely complicated patterns Ability to generate new features from limited set of training data, and to easily update them with new data Powerful deep learning algorithm that is able to model complicated and highly non-linear processes without any constraints on the input–output vector relationships | Tuning requires a high level of expertise (i.e. set the architecture and hyperparameters) High-speed processing units and powerful GPUs are required for training the data sets |
Figure 5Flow chart of the proposed methodology to forecast evaporation using machine learning models.
Statistical results of Stephens and Stewart and Thornthwaite empirical models for prediction Ep at Alor Setar and Kota Bharu stations.
| Station | Model | R2 | MAE | MSE | RMSE | RAE | RSE |
|---|---|---|---|---|---|---|---|
| Alor Setar | Stephens and Stewart | 0.522 | 0.535 | 0.458 | 0.677 | 0.681 | 0.477 |
| Thornthwaite | 0.303 | 0.635 | 0.67 | 0.819 | 0.811 | 0.696 | |
| Kota Bharu | Stephens and Stewart | 0.599 | 0.33 | 0.19 | 0.436 | 0.603 | 0.400 |
| Thornthwaite | 0.401 | 0.449 | 0.33 | 0.574 | 0.82 | 0.693 |
Figure 6Scatter plot of measured Ep versus predicted Ep for the proposed empirical modles for Alor Setar station.
Figure 7Scatter plot of measured Ep versus predicted Ep for the proposed empirical modles for Kota Bharu station.
Statistical results (testing period) of the three machine learning models for predicting monthly Ep under 10 input combinations of meteorological variables for Alor Setar and Kota Bharu.
| Station/model | R2 | MAE | MSE | RMSE | RAE | RSE |
|---|---|---|---|---|---|---|
| ElasticNet LR-1 | 0.615 | 0.483 | 0.374 | 0.612 | 0.610 | 0.384 |
| ElasticNet LR-2 | 0.759 | 0.387 | 0.237 | 0.486 | 0.484 | 0.240 |
| ElasticNet LR-3 | 0.849 | 0.304 | 0.149 | 0.387 | 0.378 | 0.150 |
| ElasticNet LR-4 | 0.845 | 0.296 | 0.144 | 0.380 | 0.366 | 0.144 |
| ElasticNet LR-5 | 0.863 | 0.296 | 0.136 | 0.369 | 0.366 | 0.136 |
| ElasticNet LR-6 | 0.734 | 0.410 | 0.261 | 0.511 | 0.514 | 0.265 |
| ElasticNet LR-7 | 0.792 | 0.361 | 0.206 | 0.454 | 0.448 | 0.207 |
| ElasticNet LR-8 | 0.810 | 0.320 | 0.159 | 0.399 | 0.396 | 0.159 |
| ElasticNet LR-9 | 0.862 | 0.299 | 0.137 | 0.371 | 0.369 | 0.137 |
| ElasticNet LR-10 | ||||||
| XGB-1 | 0.666 | 0.439 | 0.325 | 0.57 | 0.555 | 0.333 |
| XGB-2 | 0.762 | 0.372 | 0.233 | 0.483 | 0.466 | 0.237 |
| XGB-3 | 0.824 | 0.325 | 0.174 | 0.417 | 0.403 | 0.175 |
| XGB-4 | 0.838 | 0.309 | 0.161 | 0.401 | 0.383 | 0.162 |
| XGB-5 | 0.845 | 0.309 | 0.154 | 0.393 | 0.383 | 0.154 |
| XGB-6 | 0.766 | 0.380 | 0.230 | 0.479 | 0.476 | 0.233 |
| XGB-7 | 0.786 | 0.358 | 0.213 | 0.461 | 0.445 | 0.213 |
| XGB-8 | 0.833 | 0.327 | 0.166 | 0.408 | 0.404 | 0.166 |
| XGB-9 | 0.812 | 0.349 | 0.187 | 0.433 | 0.431 | 0.187 |
| XGB-10 | ||||||
| LSTM-1 | 0.741 | 0.359 | 0.232 | 0.482 | 0.474 | 0.258 |
| LSTM-2 | 0.766 | 0.364 | 0.212 | 0.461 | 0.475 | 0.233 |
| LSTM-3 | 0.894 | 0.249 | 0.097 | 0.311 | 0.323 | 0.105 |
| LSTM-4 | 0.925 | 0.196 | 0.068 | 0.261 | 0.254 | 0.074 |
| LSTM-5 | 0.947 | 0.178 | 0.048 | 0.219 | 0.230 | 0.052 |
| LSTM-6 | 0.807 | 0.341 | 0.175 | 0.418 | 0.446 | 0.192 |
| LSTM-7 | 0.884 | 0.252 | 0.106 | 0.326 | 0.326 | 0.115 |
| LSTM-8 | 0.914 | 0.232 | 0.079 | 0.281 | 0.300 | 0.085 |
| LSTM-9 | 0.959 | 0.150 | 0.037 | 0.194 | 0.194 | 0.041 |
| LSTM-10 | ||||||
| ElasticNet LR-1 | 0.537 | 0.335 | 0.175 | 0.418 | 0.701 | 0.462 |
| ElasticNet LR-2 | 0.721 | 0.256 | 0.105 | 0.325 | 0.536 | 0.278 |
| ElasticNet LR-3 | 0.796 | 0.211 | 0.073 | 0.270 | 0.453 | 0.203 |
| ElasticNet LR-4 | 0.867 | 0.165 | 0.048 | 0.219 | 0.354 | 0.133 |
| ElasticNet LR-5 | 0.883 | 0.152 | 0.042 | 0.205 | 0.327 | 0.116 |
| ElasticNet LR-6 | 0.821 | 0.197 | 0.067 | 0.260 | 0.414 | 0.179 |
| ElasticNet LR-7 | 0.827 | 0.192 | 0.062 | 0.249 | 0.414 | 0.172 |
| ElasticNet LR-8 | 0.904 | 0.139 | 0.034 | 0.186 | 0.297 | 0.095 |
| ElasticNet LR-9 | 0.923 | 0.114 | 0.028 | 0.167 | 0.243 | 0.076 |
| ElasticNet LR-10 | ||||||
| XGB-1 | 0.550 | 0.330 | 0.170 | 0.412 | 0.691 | 0.449 |
| XGB-2 | 0.745 | 0.240 | 0.096 | 0.310 | 0.503 | 0.254 |
| XGB-3 | 0.803 | 0.211 | 0.070 | 0.265 | 0.453 | 0.196 |
| XGB-4 | 0.774 | 0.228 | 0.082 | 0.286 | 0.489 | 0.225 |
| XGB-5 | 0.849 | 0.179 | 0.054 | 0.233 | 0.384 | 0.150 |
| XGB-6 | 0.833 | 0.190 | 0.063 | 0.251 | 0.399 | 0.166 |
| XGB-7 | 0.819 | 0.198 | 0.065 | 0.255 | 0.425 | 0.180 |
| XGB-8 | 0.906 | 0.142 | 0.033 | 0.184 | 0.304 | 0.093 |
| XGB-9 | 0.917 | 0.131 | 0.030 | 0.174 | 0.279 | 0.082 |
| XGB-10 | ||||||
| LSTM-1 | 0.586 | 0.335 | 0.173 | 0.416 | 0.677 | 0.413 |
| LSTM-2 | 0.796 | 0.225 | 0.085 | 0.292 | 0.455 | 0.203 |
| LSTM-3 | 0.879 | 0.164 | 0.048 | 0.219 | 0.341 | 0.120 |
| LSTM-4 | 0.869 | 0.177 | 0.052 | 0.229 | 0.366 | 0.130 |
| LSTM-5 | 0.915 | 0.142 | 0.034 | 0.184 | 0.295 | 0.084 |
| LSTM-6 | 0.823 | 0.213 | 0.074 | 0.272 | 0.431 | 0.176 |
| LSTM-7 | 0.849 | 0.195 | 0.060 | 0.245 | 0.405 | 0.150 |
| LSTM-8 | 0.906 | 0.161 | 0.037 | 0.194 | 0.333 | 0.093 |
| LSTM-9 | 0.948 | 0.111 | 0.021 | 0.145 | 0.228 | 0.051 |
| LSTM-10 | ||||||
Figure 8Scatter plot of measured Ep versus predicted Ep for the proposed machine learning models for Alor Setar station.
Figure 9Scatter plot of measured Ep versus predicted Ep for the proposed machine learning models for Kota Bharu station.
Statistical results of the empirical and machine learning models under the same input combination for Alor Setar and Kota Bharu weather stations.
| Input combination | Station/model | R2 | MAE | MSE | RMSE | RAE | RSE |
|---|---|---|---|---|---|---|---|
| Ta, Rs | Stephens and Stewart | 0.522 | 0.535 | 0.458 | 0.677 | 0.681 | 0.477 |
| ElasticNet LR-6 | 0.734 | 0.410 | 0.261 | 0.511 | 0.514 | 0.265 | |
| XGB-6 | 0.766 | 0.380 | 0.230 | 0.479 | 0.476 | 0.233 | |
| LSTM-6 | 0.807 | 0.341 | 0.175 | 0.418 | 0.446 | 0.192 | |
| Ta | Thornthwaite | 0.303 | 0.635 | 0.670 | 0.819 | 0.811 | 0.696 |
| ElasticNet LR-1 | 0.615 | 0.483 | 0.374 | 0.612 | 0.610 | 0.384 | |
| XGB-1 | 0.666 | 0.439 | 0.325 | 0.570 | 0.555 | 0.333 | |
| LSTM-1 | 0.741 | 0.359 | 0.232 | 0.482 | 0.474 | 0.258 | |
| Ta, Rs | Stephens and Stewart | 0.599 | 0.330 | 0.190 | 0.436 | 0.603 | 0.400 |
| ElasticNet LR-6 | 0.821 | 0.197 | 0.067 | 0.260 | 0.414 | 0.179 | |
| XGB-6 | 0.833 | 0.190 | 0.063 | 0.251 | 0.399 | 0.166 | |
| LSTM-6 | 0.823 | 0.213 | 0.074 | 0.272 | 0.431 | 0.176 | |
| Ta | Thornthwaite | 0.401 | 0.449 | 0.330 | 0.574 | 0.820 | 0.693 |
| ElasticNet LR-1 | 0.537 | 0.335 | 0.175 | 0.418 | 0.701 | 0.462 | |
| XGB-1 | 0.550 | 0.330 | 0.170 | 0.412 | 0.691 | 0.449 | |
| LSTM-1 | 0.586 | 0.335 | 0.173 | 0.416 | 0.677 | 0.413 | |