| Literature DB >> 35585155 |
Amin Mahdavi-Meymand1, Mohammad Zounemat-Kermani2, Wojciech Sulisz3, Rodolfo Silva4.
Abstract
Wave-induced inundation in coastal zones is a serious problem for residents. Accurate prediction of wave run-up height is a complex phenomenon in coastal engineering. In this study, several machine learning (ML) models are developed to simulate wave run-up height. The developed methods are based on optimization techniques employing the group method of data handling (GMDH). The invasive weed optimization (IWO), firefly algorithm (FA), teaching-learning-based optimization (TLBO), harmony search (HS), and differential evolution (DE) meta-heuristic optimization algorithms are embedded with the GMDH to yield better feasible optimization. Preliminary results indicate that the developed ML models are robust tools for modeling the wave run-up height. All ML models' accuracies are higher than empirical relations. The obtained results show that employing heuristic methods enhances the accuracy of the standard GMDH model. As such, the FA, IWO, DE, TLBO, and HS improve the RMSE criterion of the standard GMDH by the rate of 47.5%, 44.7%, 24.1%, 41.1%, and 34.3%, respectively. The GMDH-FA and GMDH-IWO are recommended for applications in coastal engineering.Entities:
Mesh:
Year: 2022 PMID: 35585155 PMCID: PMC9117196 DOI: 10.1038/s41598-022-12038-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1A schematic presentation of wave run-up on the foreshore.
Figure 2A schematic plot of GMDH structure with three hidden layers.
Figure 3The general flowchart of the constructed hybrid GMDH models.
Presentation of the meta-heuristic algorithms.
| Category | Algorithms | Inspired by | Reason for the selection |
|---|---|---|---|
| Swarm intelligence | IWO | Plants and animals | High performance in combination with ANFIS and ANN models[ |
| FA | Insects movement | A robust and efficient algorithm in optimization problems[ | |
| TLBO | Teaching–learning process | A simple and reliable algorithm that does not need any initial parameters[ | |
| Evolutionary | HS | Harmony in music | Successful application of hybrid GMDH-HS in modeling nonlinear problems[ |
| DE | Biological insights | Popular, simple, effective[ |
Figure 4Bed permeability parameter for different types of bed materials.
Statistical summary of the data used in this study.
| Parameter | Type | Data range | Average | Standard deviation (SD) | Correlation coefficient ( |
|---|---|---|---|---|---|
| Output | [0.043,1.6] | 0.237 | 0.211 | 1 | |
| Input | [1.33,5.1] | 2.542 | 0.884 | 0.351 | |
| Input | [1.24,4.4] | 2.181 | 0.707 | 0.403 | |
| Input | [0.461,1.18] | 0.139 | 0.138 | 0.938 | |
| cot | Input | [1.5,4] | 2.709 | 0.783 | 0.0003 |
| Input | 1.25,2.25 | 1.641 | 0.485 | 0.028 | |
| Input | 0.1,0.5,0.6 | 0.276 | 0.205 | 0.009 | |
| Input | [0.991,7.584] | 3.202 | 1.367 | 0.0154 | |
| Input | [1.047,8.869] | 3.740 | 1.681 | 0.0129 |
Structure of models and initial parameters of meta-heuristic algorithms.
| Model | Structure | Meta-heuristic parameters | |||
|---|---|---|---|---|---|
| Number of middle layers | Maximum neurons in each layer | Parameter | Value | Epoch | |
| GMDH | 5 | 10 | – | – | 300 |
| GMDH-IWO | 5 | 10 | Maximum no of seeds | 10 | 300 |
| Minimum no of seeds | 1 | ||||
| Initial Standard deviation ( | 0.5 | ||||
| Final Standard deviation ( | 0.001 | ||||
| Search Space Range | [−10,10] | ||||
| GMDH-FA | 5 | 10 | Mutation Coefficient ( | 0.2 | 300 |
| Attraction Coefficient ( | 2 | ||||
| Light Absorption Coefficient ( | 1 | ||||
| Search Space Range | [−10,10] | ||||
| GMDH-DE | 5 | 10 | Lower bound of scaling factor | 0.2 | 300 |
| Upper bound of scaling factor | 0.8 | ||||
| Crossover controller ( | 0.9 | ||||
| Search space range | [−10,10] | ||||
| GMDH-HS | 5 | 10 | Fret width damp ratio | 0.995 | 300 |
| Pitch adjustment rate ( | 0.1 | ||||
| Harmony memory consideration rate ( | 0.9 | ||||
| Search space range | [−10,10] | ||||
| GMDH-TLBO* | 5 | 10 | Search space range | [−10,10] | 300 |
*The TLBO algorithm can be applied without allocating any specific primary or adjusting parameter.
Statistical parameters for the evaluation of the performance of derived models at a training stage.
| Method | Statistics | |||
|---|---|---|---|---|
| GMDH-TLBO | 0.0250 | 0.9863 | 0.0184 | 0.9965 |
| GMDH | 0.0254 | 0.9853 | 0.0195 | 0.9963 |
| GMDH-DE | 0.0274 | 0.9829 | 0.0209 | 0.9956 |
| GMDH-FA | 0.028 | 0.9823 | 0.0188 | 0.9955 |
| GMDH-IWO | 0.0355 | 0.9759 | 0.0198 | 0.9932 |
| GMDH-HS | 0.0359 | 0.9730 | 0.0219 | 0.9928 |
Statistical parameters for the evaluation of the performance of derived models at a testing stage.
| Method | Statistics | Enhancement ( +) or deterioration (−) of the | Establishment | Execution time | |||
|---|---|---|---|---|---|---|---|
| GMDH-FA | 0.0209 | 0.9908 | 0.0172 | 0.9977 | + 47.49 | Difficult | High |
| GMDH-IWO | 0.0220 | 0.9908 | 0.0170 | 0.9975 | + 44.74 | Difficult | High |
| GMDH-TLBO | 0.0235 | 0.9888 | 0.0180 | 0.997 | + 41.08 | Difficult | High |
| GMDH-HS | 0.0262 | 0.9864 | 0.0211 | 0.9964 | + 34.26 | Difficult | High |
| GMDH-DE | 0.0302 | 0.984 | 0.0240 | 0.9953 | + 24.12 | Difficult | Medium |
| GMDH | 0.0398 | 0.9674 | 0.028 | 0.9916 | Base | Not Simple | Low |
| Schimmels et al.[ | 0.0763 | 0.9549 | 0.0638 | 0.9711 | −91.52 | Simple | Low |
| Van der Meer and Stam[ | 0.1224 | 0.8178 | 0.0747 | 0.927 | −207.22 | Simple | Low |
Figure 5Scatter plots obtained for the testing stage.
Figure 6Taylor diagram for the tested data set.
The CPU time for 300 epochs of the applied GMDH models.
| Model | Empirical relations and standard GMDH | GMDH-IWO | GMDH-FA | GMDH-DE | GMDH-HS | GMDH-TLBO |
|---|---|---|---|---|---|---|
| CPU* time (min) | < 1 | 116 | 405 | 11 | 43 | 116 |
*Core i7; RAM: 8 GB.
The results of the Kruskal–Wallis test for assessing the significant statistical differences between the applied models.
| Methods | Significantly different (95%) | Significantly different (99%) | |
|---|---|---|---|
| Schimmels et al.[ | 0.7251 | NO | NO |
| GMDH, GMDH-FA, GMDH-DE, GMDH-IWO, GMDH-TLBO, GMDH-HS, Schimmels et al.[ | ˂.0006 | YES | YES |
| GMDH, GMDH-FA, GMDH-DE, GMDH-IWO, GMDH-TLBO, GMDH-HS | 0.994 | NO | NO |
Summary of the studies conducted to predict wave run-up in coastal regions.
| Authors | Methods | Type of the machine learning model | Calculated | Remarks |
|---|---|---|---|---|
| Abolfathi et al.[ | M5’ Decision tree | Decision tree | 0.970 | The results show the general ability of M5’ to simulate wave run-up |
| Bakhtyar et al.[ | ANFIS | Hybrid intelligent systems | 0.960 | The comparison of results confirms the high accuracy of ANFIS in predicting wave run-up |
| Empirical formulas | – | 0.890 | ||
| Erdik and Savci[ | ANFIS | Hybrid intelligent systems | 0.621 | TS Fuzzy is a capable tool for modeling wave run-up |
| Empirical formulas | – | 0.559 | ||
| Bonakdar and Etemad-Shahidi[ | M5 model tree | Decision tree | 0.920 | M5 results are better than TS Fuzzy and empirical formulas |
| TS Fuzzy | Hybrid intelligent systems | – | ||
| Empirical formulas | – | 0.902 | ||
| Elbisy[ | MART* | Decision tree | 0.974 | The MART model is more accurate than the ANN |
| ANN | Neural computing | 0.837 | ||
| Rehman et al.[ | ANN | Neural computing | 0.995 | Both ANN and RSM are robust methods for the prediction of wave run-up |
| Response surface methodology (RSM) | – | 0.999 | ||
| Yao et al.[ | ANN | Neural computing | 0.987 | The ANN performance in predicting wave run-up was confirmed |
| The current study | GMDH | Neural computing | 0.985 | The results show that the GMDH models provide more accurate results than empirical relations. Applications of optimization algorithms increases the accuracy of standard models. The Kruskal–Wallis test shows that there are no significant statistical differences between the classical and hybrid GMDH models |
GMDH-TLBO GMDH-FA GMDH-DE GMDH-HS GMDH-IWO | Hybrid intelligent systems | 0.986 0.991 0.984 0.973 0.976 | ||
Van der Meer and Stam[ Schimmels et al.[ | – | 0.818 0.955 |
*Multiple additive regression trees.