| Literature DB >> 31653958 |
Xiangang Luo1, Feikai Lin1, Yihong Chen2, Shuang Zhu1, Zhanya Xu3, Zhibin Huo4, Mengliang Yu1,5, Jing Peng1.
Abstract
Landslide disasters cause huge casualties and economic losses every year, how to accurately forecast the landslides has always been an important issue in geo-environment research. In this paper, a hybrid machine learning approach RSLMT is firstly proposed by coupling Random Subspace (RS) and Logistic Model Tree (LMT) for producing a landslide susceptibility map (LSM). With this method, the uncertainty introduced by input features is considered, the problem of overfitting is solved by reducing dimensions to increase the prediction rate of landslide occurrence. Moreover, the uncertainty of prediction will be deeply discussed with the rank probability score (RPS) series, which is an important evaluation of uncertainty but rarely used in LSM. Qingchuan county, China was taken as a study area. 12 landslide causal factors were selected and their contribution on landslide occurrence was evaluated by ReliefF method. In addition, Logistic Model Tree (LMT), Naive Bayes (NB) and Logistic Regression (LR) were researched for comparison. The results showed that RSLMT (AUC = 0.815) outperformed LMT (AUC = 0.805), NB (AUC = 0.771), LR (AUC = 0.785). LSM of Qingchuan county was produced using the novel model, it indicated that landslides tend to occur along with the fault belts and the middle-low mountain area that is strongly influenced by the large numbers of human engineering activities.Entities:
Year: 2019 PMID: 31653958 PMCID: PMC6814778 DOI: 10.1038/s41598-019-51941-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The study area, Qingchuan County in Sichuan Province China, generated by Arcgis version 10.2 in Windows (https://developers.arcgis.com).
Landslide causal factors with their classes and quantitative value.
| Factors | Classes | Value | Factors | Classes | Value |
|---|---|---|---|---|---|
| Lithology | Weak-semi-hard | 1 | Profile curvature | −58.61–8.89 | 1 |
| thin-medium | −8.89–0.53 | 2 | |||
| phyllite | −0.53–0.12 | 3 | |||
| schist | −0.12–9.50 | 4 | |||
| slate | 9.50–47.51 | 5 | |||
| metamorphic | Plan curvature | −51.17–17.39 | 1 | ||
| sandstone | −17.39–5.03 | 2 | |||
| Hard-semi-hard | 2 | −5.03–0.50 | 3 | ||
| medium-thick layered limestone | −0.50–8.98 | 4 | |||
| Dolomitic limestone | 8.98–53.88 | 5 | |||
| dolomite | Slope (degree) | 0–12.97 | 1 | ||
| debris | 12.97–21.84 | 2 | |||
| 21.84–29.34 | 3 | ||||
| Loosely packed soil | 4 | 29.34–37.88 | 4 | ||
| Hard-thin layered quartz sandstone | 5 | 37.88–87.01 | 5 | ||
| siltstone | Aspect | North | 1 | ||
| conglomerate | Northeast | 2 | |||
| mudstone | East | 3 | |||
| Rainfall (mm) | 0–500 | 1 | Southeast | 4 | |
| 500–800 | 2 | South | 5 | ||
| 800–1000 | 3 | Southwest | 6 | ||
| 1000–1200 | 4 | West | 7 | ||
| >1200 | 5 | Northwest | 8 | ||
| Seismic intensity | VII | 7 | Distance to faults (m) | 0–100 | 1 |
| VIII | 8 | 100–200 | 2 | ||
| IX | 9 | 200–300 | 3 | ||
| X | 10 | >300 | 4 | ||
| Landform | Middle-low mountains | 1 | Distance to rivers (m) | 0–100 | 1 |
| Middle mountains | 2 | 100–200 | 2 | ||
| High-middle mountains | 3 | 200–300 | 3 | ||
| Elevation (m) | 491–922 | 1 | >300 | 4 | |
| 922–1253 | 2 | Distance to roads (m) | 0–100 | 1 | |
| 1253–1671 | 3 | 100–200 | 2 | ||
| 1671–2245 | 4 | 200–300 | 3 | ||
| 2245–3794 | 5 | >300 | 4 |
Figure 2Maps of landslide causal factors. (a) Lithology. (b) Distance to roads. (c) Seismic intensity. (d) Distance to faults. (e) Rainfall. (f) Plan curvature. (g) Slope. (h) Aspect. (i) Profile curvature. (j) Distance to rivers. (k) Elevation. (l) Landform, generated by Arcgis version 10.2 in Windows (https://developers.arcgis.com).
Figure 3Training data and validation data. (a) Training data. (b) Validation data, generated by Arcgis version 10.2 in Windows (https://developers.arcgis.com).
Figure 4Outline of RSLMT after Pham et al.[25], 2018 with modifications, which has been authorized by Elsevier.
Multicollinearity of the causal factors.
| Landslide causal factors | Multicollinearity statistics | |
|---|---|---|
| Tolerance | VIF | |
| Rainfall | 0.896 | 1.116 |
| Seismic intensity | 0.885 | 1.130 |
| Lithology | 0.846 | 1.182 |
| Landform | 0.259 | 3.860 |
| Distance to faults | 0.962 | 1.039 |
| Distance to roads | 0.736 | 1.359 |
| Elevation | 0.257 | 3.891 |
| Plan curvature | 0.981 | 1.019 |
| Slope | 0.853 | 1.172 |
| Aspect | 0.970 | 1.031 |
The calculated parameters of algorithms utilized in this study.
| Algorithm | Parameters | |||||
|---|---|---|---|---|---|---|
| RSLMT | Minimum subspace | 0.5 | Seed | 1 | Iteration | 8 |
| Execution slots | 1 | Instances in node | 21 | LogitBoost iterations | 7 | |
| NB | / | |||||
| LR | Maximum number of iterations | 8 | Ridge value in the log-likelihood | 10−8 | ||
| LMT | Minimum of instances in node | 15 | LogitBoost iterations | 3 | Weight trimming value | 0.0 |
Figure 5AUC of the models. (a) Training data. (b) Validation data.
Performance of models using training and validation data.
| Statistic index | RSLMT | NB | LR | LMT | ||||
|---|---|---|---|---|---|---|---|---|
| T | V | T | V | T | V | T | V | |
| Accuracy | 0.738 | 0.697 | 0.703 | 0.686 | 0.716 | 0.694 | 0.727 | 0.674 |
| Precision | 0.715 | 0.639 | 0.682 | 0.630 | 0.714 | 0.653 | 0.713 | 0.625 |
| Recall | 0.826 | 0.801 | 0.808 | 0.790 | 0.760 | 0.729 | 0.800 | 0.746 |
| Specificity | 0.641 | 0.606 | 0.589 | 0.596 | 0.667 | 0.663 | 0.648 | 0.611 |
| F-measure | 0.766 | 0.711 | 0.740 | 0.701 | 0.736 | 0.689 | 0.754 | 0.680 |
T = training data; V = validation data.
The RPS and RPSS values of the models.
| Model |
|
|
|
|---|---|---|---|
| RSLMT | 0.196 | 0.286 | 0.315 |
| NB | 0.231 | 0.286 | 0.192 |
| LMT | 0.207 | 0.286 | 0.276 |
| LR | 0.203 | 0.286 | 0.290 |
Performance of the RSLMT model compared to other models using Chi-Square test.
| Comparative pairs | Chi-square values | |
|---|---|---|
| RSLMT vs. NB | 604.063 | 0 |
| RSLMT vs. LR | 539.001 | 0 |
| RSLMT vs. LMT | 543.939 | 0 |
Figure 6Landslide susceptibility map in Qingchuan county using the RSLMT model, generated by Arcgis version 10.2 in Windows (https://developers.arcgis.com).
Relative landslide density of each class in LSM.
| Classes | Percentage of area (%) | Percentage of landslide points (%) | Relative landslide density |
|---|---|---|---|
| VLS | 18.35 | 0.47 | 0.0256 |
| LS | 20.11 | 5.78 | 0.2874 |
| MS | 24.98 | 20.16 | 0.8070 |
| HS | 15.32 | 21.56 | 1.4073 |
| VHS | 21.24 | 52.03 | 2.4496 |