| Literature DB >> 32316191 |
Viet-Ha Nhu1,2, Ataollah Shirzadi3, Himan Shahabi4,5, Sushant K Singh6, Nadhir Al-Ansari7, John J Clague8, Abolfazl Jaafari9, Wei Chen10,11, Shaghayegh Miraki12, Jie Dou13, Chinh Luu14, Krzysztof Górski15, Binh Thai Pham16, Huu Duy Nguyen17, Baharin Bin Ahmad18.
Abstract
Shallow landslides damage buildings and other infrastructure, disrupt agriculture practices, and can cause social upheaval and loss of life. As a result, many scientists study the phenomenon, and some of them have focused on producing landslide susceptibility maps that can be used by land-use managers to reduce injury and damage. This paper contributes to this effort by comparing the power and effectiveness of five machine learning, benchmark algorithms-Logistic Model Tree, Logistic Regression, Naïve Bayes Tree, Artificial Neural Network, and Support Vector Machine-in creating a reliable shallow landslide susceptibility map for Bijar City in Kurdistan province, Iran. Twenty conditioning factors were applied to 111 shallow landslides and tested using the One-R attribute evaluation (ORAE) technique for modeling and validation processes. The performance of the models was assessed by statistical-based indexes including sensitivity, specificity, accuracy, mean absolute error (MAE), root mean square error (RMSE), and area under the receiver operatic characteristic curve (AUC). Results indicate that all the five machine learning models performed well for shallow landslide susceptibility assessment, but the Logistic Model Tree model (AUC = 0.932) had the highest goodness-of-fit and prediction accuracy, followed by the Logistic Regression (AUC = 0.932), Naïve Bayes Tree (AUC = 0.864), ANN (AUC = 0.860), and Support Vector Machine (AUC = 0.834) models. Therefore, we recommend the use of the Logistic Model Tree model in shallow landslide mapping programs in semi-arid regions to help decision makers, planners, land-use managers, and government agencies mitigate the hazard and risk.Entities:
Keywords: Iran; Shallow landslide; artificial intelligence; goodness-of-fit; logistic model tree; prediction accuracy
Mesh:
Year: 2020 PMID: 32316191 PMCID: PMC7215797 DOI: 10.3390/ijerph17082749
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Location of shallow landslides in the study area. The blue circles denote landslides for training the algorithms, and the red circles denote landslides for validating the algorithms.
Figure 2Landslide conditioning factors used in this study: (a) slope, (b) aspect, (c) elevation, (d) curvature, (e) plan curvature, (f) profile curvature, (g) slope length (SL), (h) rainfall, (i) annual solar radiation (j) stream power index (SPI), (k) topographic wetness index (TWI), (l) distance to rivers, (m) river density, (n) lithology, (o) distance to fault, (p) fault density, (q) land use, (r) normalized difference vegetation index (NDVI), (s) distance to road, (t) road density.
Figure 3Illustration of the Support Vector Machines (SVM) method.
Figure 4Flowchart of the Artificial Neutral Network (ANN) model.
Figure 5The flowchart of methodology used in this study.
Figure 6Average merit of shallow landslide factors calculated by the One Rule Attribute Evaluation (ORAE) method.
Model performances of the applied data-mining approaches for the training and validation datasets.
| Parameters | LMT | NBT | LR | ANN | SVM | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| T * | V * | T | V | T | V | T | V | T | V | |
|
| 78 | 19 | 77 | 18 | 80 | 19 | 76 | 16 | 77 | 18 |
|
| 81 | 19 | 83 | 20 | 81 | 19 | 73 | 17 | 83 | 20 |
|
| 11 | 3 | 12 | 4 | 9 | 3 | 13 | 6 | 12 | 4 |
|
| 8 | 3 | 6 | 2 | 8 | 3 | 16 | 5 | 6 | 2 |
|
| 0.907 | 0.864 | 0.928 | 0.900 | 0.909 | 0.864 | 0.826 | 0.762 | 0.928 | 0.900 |
|
| 0.880 | 0.864 | 0.874 | 0.833 | 0.900 | 0.864 | 0.849 | 0.739 | 0.874 | 0.833 |
|
| 0.893 | 0.864 | 0.899 | 0.864 | 0.904 | 0.864 | 0.837 | 0.750 | 0.899 | 0.864 |
|
| 0.207 | 0.216 | 0.225 | 0.225 | 0.213 | 0.216 | 0.241 | 0.235 | 0.223 | 0.246 |
|
| 0.304 | 0.313 | 0.319 | 0.341 | 0.311 | 0.314 | 0.349 | 0.358 | 0.318 | 0.369 |
|
| 0.944 | 0.936 | 0.918 | 0.874 | 0.939 | 0.936 | 0.911 | 0.871 | 0.899 | 0.864 |
T *: Training, V *: Validation.
Figure 7Landslide susceptibility maps: (a) Logistic Model Tree (LMT), (b) Logistic Regression (LR), (c) Naïve Bayes Tree (NBT), (d) ANN, and (e) SVM.
Figure 8Receiver operating characteristic (ROC) curves and area under the receiver operatic characteristic curve (AUC) for the (a) training dataset and (b) validation dataset.
Performance of the five landslide machine learning models using Wilcoxon signed-rank test (two-tailed).
| No. | Landslide Model | Mean Rank | χ2 | |
|---|---|---|---|---|
| 1 | LMT | 2.80 | 557.912 | 0.000 |
| 2 | LR | 2.93 | ||
| 3 | NBT | 2.88 | ||
| 4 | ANN | 3.07 | ||
| 5 | SVM | 2.32 |
Performance of the five landslide machine learning models using the Wilcoxon signed-rank test (two-tailed).
| No. | Pairwise | Number of Positive | Number of Negative | Significance | ||
|---|---|---|---|---|---|---|
|
|
| 60 | 50 | −1.536 | 0.125 | No |
|
|
| 83 | 27 | −5.590 | 0.000 | Yes |
|
|
| 62 | 46 | −0.878 | 0.080 | Yes |
|
|
| 36 | 74 | −3.677 | 0.000 | Yes |
|
|
| 82 | 29 | −5.589 | 0.000 | Yes |
|
|
| 61 | 49 | −0.605 | 0.015 | Yes |
|
|
| 35 | 75 | −4.081 | 0.000 | Yes |
|
|
| 36 | 73 | −3.958 | 0.000 | Yes |
|
|
| 30 | 80 | −5.711 | 0.000 | Yes |
|
|
| 43 | 67 | −3.140 | 0.002 | Yes |
Note: The standard p-value is 0.05.