| Literature DB >> 31936038 |
Alireza Arabameri1, Thomas Blaschke2, Biswajeet Pradhan3,4, Hamid Reza Pourghasemi5, John P Tiefenbacher6, Dieu Tien Bui7.
Abstract
Gully erosion is a problem; therefore, it must be predicted using highly accurate predictive models to avoid losses caused by gully development and to guarantee sustainable development. This research investigates the predictive performance of seven multiple-criteria decision-making (MCDM), statistical, and machine learning (ML)-based models and their ensembles for gully erosion susceptibility mapping (GESM). A case study of the Dasjard River watershed, Iran uses a database of 306 gully head cuts and 15 conditioning factors. The database was divided 70:30 to train and verify the models. Their performance was assessed with the area under prediction rate curve (AUPRC), the area under success rate curve (AUSRC), accuracy, and kappa. Results show that slope is key to gully formation. The maximum entropy (ME) ML model has the best performance (AUSRC = 0.947, AUPRC = 0.948, accuracy = 0.849 and kappa = 0.699). The second best is the random forest (RF) model (AUSRC = 0.965, AUPRC = 0.932, accuracy = 0.812 and kappa = 0.624). By contrast, the TOPSIS (Technique for Order Preference by Similarity to Ideal Solution) model was the least effective (AUSRC = 0.871, AUPRC = 0.867, accuracy = 0.758 and kappa = 0.516). RF increased the performance of statistical index (SI) and frequency ratio (FR) statistical models. Furthermore, the combination of a generalized linear model (GLM), and functional data analysis (FDA) improved their performances. The results demonstrate that a combination of geographic information systems (GIS) with remote sensing (RS)-based ML models can successfully map gully erosion susceptibility, particularly in low-income and developing regions. This method can aid the analyses and decisions of natural resources managers and local planners to reduce damages by focusing attention and resources on areas prone to the worst and most damaging gully erosion.Entities:
Keywords: GIS; Iran; ensemble; gully erosion; hybrid model; soft computing
Year: 2020 PMID: 31936038 PMCID: PMC7014250 DOI: 10.3390/s20020335
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Location of study area in Semnan province and Iran, and location of training and validation gullies in the study area.
Figure 2Sample of gullies in the study area.
Classes and classification method for the various thematic data layers.
| No. | Factor | Classes | Classification Method | References |
|---|---|---|---|---|
| 1 | Elevation (m) | 1. <1005, 2. 1005–1154, 33. 1154–1319, 4. 1319–1530, 5. 1530–1835, 6. >1835 | Natural break | [ |
| 2 | Slope (°) | 1. <5, 2. 5–10, 3. 10–15, 4. 15–20, 5. 20–30, 6. >30 | Manual | [ |
| 3 | Plan curvature (m−1) | 1. Concave, 2. Flat, 3. Convex | Manual | [ |
| 4 | TWI | 1. <5.84, 2. 5.84–8.18, 3. 8.18–11.69, 4. >11.69 | Natural break | [ |
| 5 | CI | 1. <–53.7, 2. −53.7–−16, 3. −16–17.6, 4. 17.6–53.7, 5. >53.7 | Natural break | [ |
| 6 | TRI (m) | 1. <1.97, 2. 1.97–5.63, 3. 5.63–11.27, 4. 11.27–20.86, 5. >20.86 | Natural break | [ |
| 7 | TPI | 1. <−10.26, 2. −10.26–−2.85, 3. −2.85–2.28, 4. 2.28–11.4, 5. > 11.4 | Natural break | [ |
| 8 | Distance to river (m) | 1. <100, 2. 100–200, 3. 200–300, 4. 300–400, 5. >400 | Manual | [ |
| 9 | Drainage density (km/km2) | 1. < 1.25, 2. 1.25–1.79, 3. 1.79–2.26, 4. >2.26 | Natural break | [ |
| 10 | Distance to road (m) | 1. <500, 2. 500–1000, 3. 1000–1500, 4. 1500–2000, 5. >2000 | Manual | [ |
| 11 | NDVI | 1. <−0.04, 2. −0.04–0.12, 3. >0.12 | Natural break | [ |
| 12 | Rainfall | 1. <114.05, 2.114.05–132.8, 3. 132.8–155.7, 4. 155.7–182.9, 5. <182.9 | Natural break | [ |
| 13 | Soil | 1. Rock Outcrops/Entisols, 2. Aridisols, 3. Entisols/Aridisols | Soil type | |
| 14 | LULC | 1. Abkhan, 2. Agriculture, 3. Bareland, 4. Rangeland, 5. Rock, 6. Urban | Land use type | |
| 15 | Lithology | 1. A, 2. B, 3. C, 4. D, 5. E, 6. F, 7. G, 8. H | Lithology type |
Figure 3Gully erosion conditioning factors. (a) elevation, (b) slope, (c) plan curvature (PC), (d) topography wetness index (TWI), (e) convergence index (CI), (f) Terrain Ruggedness Index (TRI), (g) topography position index (TPI), (h) distance to stream, (i) drainage density, (j) distance to road, (k) Normalized Difference Vegetation Index (NDVI), (l) rainfall, (m) soil type, (n) land use/land cover (LU/LC), (o) lithology.
Multi-collinearity analysis among gully erosion conditioning factors.
| * Factors | Unstandardized Coefficients | Standardized Coefficients | t | Sig. | Collinearity Statistics | ||
|---|---|---|---|---|---|---|---|
| B | Std. Error | Beta | Tolerance | VIF | |||
| (Constant) | −0.272 | 0.164 | −1.666 | 0.096 | |||
| lithology | −0.006 | 0.038 | −0.007 | −0.159 | 0.874 | 0.554 | 1.805 |
| LU/LC | 0.016 | 0.005 | 0.164 | 3.033 | 0.003 | 0.415 | 2.412 |
| Soil type | 0.083 | 0.037 | 0.104 | 2.205 | 0.028 | 0.546 | 1.831 |
| Drainage density | 0.073 | 0.029 | 0.107 | 2.517 | 0.012 | 0.672 | 1.489 |
| Rainfall | 0.002 | 0.029 | 0.005 | 0.076 | 0.939 | 0.245 | 4.078 |
| Slope | 0.143 | 0.110 | 0.092 | 1.296 | 0.196 | 0.241 | 4.153 |
| TRI | −0.079 | 0.108 | −0.055 | −0.729 | 0.466 | 0.214 | 4.668 |
| TPI | 0.048 | 0.079 | 0.027 | 0.601 | 0.548 | 0.586 | 1.706 |
| TWI | −0.082 | 0.059 | −0.055 | −1.404 | 0.161 | 0.784 | 1.276 |
| PC | 0.054 | 0.106 | 0.019 | 0.506 | 0.613 | 0.903 | 1.108 |
| NDVI | 0.102 | 0.040 | 0.107 | 2.526 | 0.012 | 0.685 | 1.459 |
| Dis to stream | 0.126 | 0.041 | 0.116 | 3.084 | 0.002 | 0.863 | 1.158 |
| Dis to road | 0.080 | 0.013 | 0.263 | 6.148 | 0.000 | 0.666 | 1.501 |
| elevation | 0.055 | 0.020 | 0.202 | 2.815 | 0.005 | 0.237 | 4.224 |
| CI | −0.070 | 0.085 | −0.029 | −0.818 | 0.414 | 0.940 | 1.063 |
* LU/LC: land use/land cover, TRI: Terrain Ruggedness Index, TPI: topography position index, TWI: topography wetness index, PC: plan curvature, NDVI: Normalized Difference Vegetation Index, CI: convergence index.
Figure 4Flowchart of research in the study area.
Spatial relationship between conditioning factors and gully locations using frequency ratio and statistical index.
| * Factor | Class | Pixels in Domain | Gullies | FR | SI | ||
|---|---|---|---|---|---|---|---|
| No | % | No | % | ||||
| Elevation (m) | <1005 | 409583 | 16.47 | 143 | 67.45 | 4.09 | 1.41 |
| 1005–1154 | 778,768 | 31.32 | 45 | 21.23 | 0.68 | −0.39 | |
| 1154–1319 | 675,092 | 27.15 | 17 | 8.02 | 0.30 | −1.22 | |
| 1319–1530 | 314,348 | 12.64 | 7 | 3.30 | 0.26 | −1.34 | |
| 1530–1835 | 290,378 | 11.68 | 0 | 0.00 | 0.00 | None | |
| >1835 | 18,056 | 0.73 | 0 | 0.00 | 0.00 | None | |
| Slope (°) | <5 | 2,018,483 | 81.19 | 205 | 96.70 | 1.19 | 0.17 |
| 5–10 | 235,497 | 9.47 | 7 | 3.30 | 0.35 | −1.05 | |
| 10–15 | 98,979 | 3.98 | 0 | 0.00 | 0.00 | None | |
| 15–20 | 46,006 | 1.85 | 0 | 0.00 | 0.00 | None | |
| 20–30 | 44,839 | 1.80 | 0 | 0.00 | 0.00 | None | |
| >30 | 42,421 | 1.71 | 0 | 0.00 | 0.00 | None | |
| PC (100/m) | Concave | 792,994 | 31.90 | 55 | 25.94 | 0.81 | −0.21 |
| Flat | 907,578 | 36.50 | 94 | 44.34 | 1.21 | 0.19 | |
| Convex | 785,652 | 31.60 | 63 | 29.72 | 0.94 | −0.06 | |
| TWI | <5.84 | 805,518 | 32.40 | 33 | 15.57 | 0.48 | −0.73 |
| 5.84–8.18 | 1,120,812 | 45.08 | 123 | 58.02 | 1.29 | 0.25 | |
| 8.18–11.69 | 408,848 | 16.44 | 38 | 17.92 | 1.09 | 0.09 | |
| >11.69 | 151,046 | 6.08 | 18 | 8.49 | 1.40 | 0.33 | |
| CI (100/m) | <−53.7 | 170,770 | 6.98 | 16 | 7.55 | 1.08 | 0.08 |
| −53.7–−16 | 614,268 | 25.10 | 44 | 20.75 | 0.83 | −0.19 | |
| −16–17.6 | 994,363 | 40.63 | 83 | 39.15 | 0.96 | −0.04 | |
| 17.6–53.7 | 535,208 | 21.87 | 54 | 25.47 | 1.16 | 0.15 | |
| >53.7 | 133,049 | 5.44 | 15 | 7.08 | 1.30 | 0.26 | |
| TRI | <1.97 | 1,995,829 | 80.28 | 207 | 97.64 | 1.22 | 0.20 |
| 1.97–5.63 | 314,118 | 12.63 | 5 | 2.36 | 0.19 | −1.68 | |
| 5.63–11.27 | 120,287 | 4.84 | 0 | 0.00 | 0.00 | None | |
| 11.27–20.86 | 45,815 | 1.84 | 0 | 0.00 | 0.00 | None | |
| >20.86 | 10,176 | 0.41 | 0 | 0.00 | 0.00 | None | |
| TPI | <−10.26 | 27,479 | 1.11 | 0 | 0.00 | 0.00 | None |
| −10.26–−2.85 | 202,970 | 8.16 | 5 | 2.36 | 0.29 | −1.24 | |
| −2.85–2.28 | 2,103,205 | 84.59 | 207 | 97.64 | 1.15 | 0.14 | |
| 2.28–11.4 | 130,891 | 5.26 | 0 | 0.00 | 0.00 | None | |
| >11.4 | 21,679 | 0.87 | 0 | 0.00 | 0.00 | None | |
| Dis to stream (m) | <100 | 881,433 | 35.45 | 117 | 55.19 | 1.56 | 0.44 |
| 100–200 | 625,868 | 25.17 | 54 | 25.47 | 1.01 | 0.01 | |
| 200–300 | 443,260 | 17.83 | 25 | 11.79 | 0.66 | −0.41 | |
| 300–400 | 224,458 | 9.03 | 10 | 4.72 | 0.52 | −0.65 | |
| Drainage density (km/km2) | <1.25 | 461689 | 18.57 | 2 | 0.94 | 0.05 | −2.98 |
| 1.25–1.79 | 746549 | 30.03 | 55 | 25.94 | 0.86 | −0.15 | |
| 1.79–2.26 | 712235 | 28.65 | 124 | 58.49 | 2.04 | 0.71 | |
| >2.26 | 565752 | 22.76 | 31 | 14.62 | 0.64 | −0.44 | |
| Dis to road (m) | <500 | 177023 | 7.12 | 32 | 15.09 | 2.12 | 0.75 |
| 500–1000 | 168791 | 6.79 | 68 | 32.08 | 4.72 | 1.55 | |
| 1000–1500 | 159125 | 6.40 | 31 | 14.62 | 2.28 | 0.83 | |
| 1500–2000 | 151080 | 6.08 | 26 | 12.26 | 2.02 | 0.70 | |
| > 2000 | 1830206 | 73.61 | 55 | 25.94 | 0.35 | −1.04 | |
| NDVI | <−0.04 | 918021 | 36.92 | 17 | 8.02 | 0.22 | −1.53 |
| −0.04–0.12 | 1541694 | 62.01 | 192 | 90.57 | 1.46 | 0.38 | |
| >0.12 | 26510 | 1.07 | 3 | 1.42 | 1.33 | 0.28 | |
| Rainfall (mm) | <114.05 | 688309 | 27.68 | 161 | 75.94 | 2.74 | 1.01 |
| 114.05–132.8 | 694011 | 27.91 | 21 | 9.91 | 0.35 | −1.04 | |
| 132.8–155.7 | 619724 | 24.93 | 23 | 10.85 | 0.44 | −0.83 | |
| 155.7–182.9 | 259107 | 10.42 | 7 | 3.30 | 0.32 | −1.15 | |
| <182.9 | 225074 | 9.05 | 0 | 0.00 | 0.00 | None | |
| Soil type | Rock Outcrops/Entisols | 1035170 | 41.64 | 12 | 5.66 | 0.14 | −2.00 |
| Aridisols | 1443591 | 58.06 | 199 | 93.87 | 1.62 | 0.48 | |
| Entisols/Aridisols | 7464 | 0.30 | 1 | 0.47 | 1.57 | 0.45 | |
| LU/LC | Abkhan | 5419 | 0.22 | 0 | 0.00 | 0.00 | None |
| Agriculture | 73837 | 2.97 | 11 | 5.19 | 1.75 | 0.56 | |
| Bareland | 124293 | 5.00 | 124 | 58.49 | 11.70 | 2.46 | |
| Rangeland | 2182822 | 87.80 | 76 | 35.85 | 0.41 | −0.90 | |
| Rock | 98132 | 3.95 | 1 | 0.47 | 0.12 | −2.12 | |
| Urban | 1722 | 0.07 | 0 | 0.00 | 0.00 | None | |
| Lithology | A | 697041 | 28.04 | 20 | 9.43 | 0.34 | −1.09 |
| B | 79375 | 3.19 | 1 | 0.47 | 0.15 | −1.91 | |
| C | 114582 | 4.61 | 3 | 1.42 | 0.31 | −1.18 | |
| D | 190539 | 7.66 | 10 | 4.72 | 0.62 | −0.49 | |
| E | 2747 | 0.11 | 0 | 0.00 | 0.00 | None | |
| F | 153705 | 6.18 | 4 | 1.89 | 0.31 | −1.19 | |
| G | 1236071 | 49.72 | 174 | 82.08 | 1.65 | 0.50 | |
| H | 12165 | 0.49 | 0 | 0.00 | 0.00 | None | |
* LU/LC: land use/land cover, TRI: Terrain Ruggedness Index, TPI: topography position index, TWI: topography wetness index, PC: plan curvature, NDVI: Normalized Difference Vegetation Index, CI: convergence index.
Figure 5Relative importance of gully erosion conditioning factors using the random forest model.
Values of resulted gully erosion maps using ten models.
| * Models | Classification with a Natural Break Model | ||||
|---|---|---|---|---|---|
| Very Low | Low | Moderate | High | Very High | |
| FR | 3.66–9.63 | 9.63–13.79 | 13.79–17.96 | 17.96–26.57 | 26.57–39.06 |
| SI | −19.3–−10.5 | −10.5–−6 | −6–−2.27 | −2.27–2.25 | 2.25–10.25 |
| RF | 0.01–0.21 | 0.21–0.37 | 0.37–0.53 | 0.53–0.72 | 0.72–1 |
| ME | 0.00–0.06 | 0.06–0.17 | 0.17–0.34 | 0.34–0.57 | 0.57–0.97 |
| GLM | 0.00–0.12 | 0.12–0.3 | 0.3–0.49 | 0.49–0.69 | 0.69–0.98 |
| FDA | 0.00–0.13 | 0.13–0.31 | 0.31–0.51 | 0.51–0.73 | 0.73–0.99 |
| TOPSIS | 0.15–0.29 | 0.29–0.38 | 0.38–0.48 | 0.48–0.61 | 0.61–0.78 |
| GLM-FDA | 0.00–0.13 | 0.13–0.31 | 0.31–0.5 | 0.5–0.71 | 0.71–0.99 |
| FR-RF | 21.72–84.7 | 84.7–131.2 | 131.2–183.2 | 183.2–268.1 | 268.1–370.8 |
| SI-RF | −190–−99.9 | −99.9–−59.3 | −59.3–−23.2 | −23.2–18.4 | 18.4–97.4 |
* FR: frequency ratio, SI: statistical index, RF: random forest, ME: maximum entropy, GLM: generalized linear model, FDA: functional data analysis, TOPSIS: Technique for order preference by similarity to ideal solution.
Figure 6Gully erosion susceptibility maps. (a) frequency ration (FR), (b) statistical index (SI), (c) random forest (RF), (d) maximum entropy (ME), (e) generalized linear model (GLM), (f) functional data analysis (FDA), (g) TOPSIS, (h) GLM–FDA, (i) FR–RF, (j) SI–RF.
Figure 7Percentage of each susceptibility classes in individual and hybrid models.
Figure 8Area under curve in individual and ensemble models. (a) training data (success rate curve), (b) validation data (prediction rate curve).
Validation of gully erosion susceptibility maps using cutoff-dependent and independent criteria.
| * Criteria | TN | FP | FN | TP | TPR | TNR | FPR | Cutoff-Dependent Criteria | Cutoff Independent Criteria | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| * Models | AUSRC | AUPRC | Aaccuracy | Kappa | ||||||||
| FR | 71 | 22 | 22 | 71 | 0.76 | 0.76 | 0.23 | 0.890 | 0.900 | 0.763 | 0.527 | |
| SI | 73 | 20 | 22 | 71 | 0.76 | 0.78 | 0.21 | 0.884 | 0.897 | 0.774 | 0.548 | |
| RF | 76 | 17 | 18 | 75 | 0.80 | 0.81 | 0.18 | 0.965 | 0.932 | 0.812 | 0.624 | |
| ME | 79 | 14 | 14 | 79 | 0.84 | 0.84 | 0.15 | 0.947 | 0.948 | 0.849 | 0.699 | |
| GLM | 74 | 19 | 18 | 75 | 0.80 | 0.79 | 0.20 | 0.869 | 0.887 | 0.801 | 0.602 | |
| FDA | 74 | 19 | 21 | 72 | 0.77 | 0.79 | 0.20 | 0.868 | 0.894 | 0.785 | 0.570 | |
| TOPSIS | 71 | 22 | 23 | 70 | 0.75 | 0.76 | 0.23 | 0.871 | 0.867 | 0.758 | 0.516 | |
| GLM-FDA | 75 | 18 | 20 | 73 | 0.78 | 0.80 | 0.19 | 0.870 | 0.891 | 0.796 | 0.591 | |
| FR-RF | 73 | 20 | 21 | 72 | 0.77 | 0.78 | 0.21 | 0.893 | 0.908 | 0.780 | 0.559 | |
| SI-RF | 71 | 22 | 21 | 72 | 0.77 | 0.76 | 0.237 | 0.889 | 0.914 | 0.769 | 0.538 | |
* FR: frequency ratio, SI: statistical index, RF: random forest, ME: maximum entropy, GLM: generalised linear model, FDA: functional data analysis, TOPSIS: Technique for order preference by similarity to ideal solution; * TN: true negative, FP: false positive, TP: true positive, FN: false negative, TNR: true negative rate, FPR: false positive rate, AUSRC: area under success rate curve, AUPRC: area under prediction rate curve.
Figure 9Values of true negative (TN), false positive (FP), true positive (TP) and false negative (FN) for each individual or ensemble model. (a) FR, (b) SI, (c) RF, (d) ME, (e) GLM, (f) FDA, (g) TOPSIS, (h) GLM–FDA, (i) FR–RF, (j) SI–RF.