| Literature DB >> 34199102 |
Iman Salehi Hikouei1, S Sonny Kim2, Deepak R Mishra3.
Abstract
Remotely sensed data from both in situ and satellite platforms in visible, near-infrared, and shortwave infrared (VNIR-SWIR, 400-2500 nm) regions have been widely used to characterize and model soil properties in a direct, cost-effective, and rapid manner at different scales. In this study, we assess the performance of machine-learning algorithms including random forest (RF), extreme gradient boosting machines (XGBoost), and support vector machines (SVM) to model salt marsh soil bulk density using multispectral remote-sensing data from the Landsat-7 Enhanced Thematic Mapper Plus (ETM+) platform. To our knowledge, use of remote-sensing data for estimating salt marsh soil bulk density at the vegetation rooting zone has not been investigated before. Our study reveals that blue (band 1; 450-520 nm) and NIR (band 4; 770-900 nm) bands of Landsat-7 ETM+ ranked as the most important spectral features for bulk density prediction by XGBoost and RF, respectively. According to XGBoost, band 1 and band 4 had relative importance of around 41% and 39%, respectively. We tested two soil bulk density classes in order to differentiate salt marshes in terms of their capability to support vegetation that grows in either low (0.032 to 0.752 g/cm3) or high (0.752 g/cm3 to 1.893 g/cm3) bulk density areas. XGBoost produced a higher classification accuracy (88%) compared to RF (87%) and SVM (86%), although discrepancies in accuracy between these models were small (<2%). XGBoost correctly classified 178 out of 186 soil samples labeled as low bulk density and 37 out of 62 soil samples labeled as high bulk density. We conclude that remote-sensing-based machine-learning models can be a valuable tool for ecologists and engineers to map the soil bulk density in wetlands to select suitable sites for effective restoration and successful re-establishment practices.Entities:
Keywords: Landsat-7 (ETM+); XGBoost; coastal wetlands; machine learning; random forest; soil characterization
Mesh:
Substances:
Year: 2021 PMID: 34199102 PMCID: PMC8271383 DOI: 10.3390/s21134408
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Flowchart illustrating the steps for soil bulk density prediction using Landsat-7 ETM+ data.
General statistical description of the dataset.
| Data Source | Sampling Date | Number of Samples | Minimum | Maximum | Average | Standard Deviation |
|---|---|---|---|---|---|---|
| Our Survey | 2018 | 24 | 0.17 g/cm3 | 1.66 g/cm3 | 0.78 g/cm3 | 0.51 g/cm3 |
| CCRCN | 2007–2013–2016–2018 | 622 | 0.18 g/cm3 | 1.56 g/cm3 | 0.62 g/cm3 | 0.43 g/cm3 |
| GCE-LTER | 2000–2009–2011 | 346 | 0.11 g/cm3 | 1.89 g/cm3 | 0.59 g/cm3 | 0.54 g/cm3 |
Figure 2Bulk density and salt marsh vegetation types.
Figure 3Relative importance of Landsat-7 bands for modeling bulk density.
Figure 4An example of the decision tree for bulk density classification.
Figure 5Relative importance of LandSat-7 bands as well as vegetation indices for modeling bulk density.
SVM, RF, and XGboost models’ assessment results.
| Models | Class | Recall | Precision | Mean Recall | Mean Precision | Accuracy |
|---|---|---|---|---|---|---|
| SVM | Low BD | 0.96 | 0.87 | 0.78 | 0.84 | 0.86 |
| High BD | 0.60 | 0.82 | ||||
| RF | Low BD | 0.88 | 0.96 | 0.85 | 0.79 | 0.87 |
| High BD | 0.83 | 0.62 | ||||
| XGBoost | Low BD | 0.96 | 0.88 | 0.78 | 0.86 | 0.88 |
| High BD | 0.61 | 0.84 |
Confusion matrix corresponding to the machine-learning algorithms.
| SVM | |||
| True | |||
| Predicted | Low BD | High BD | |
| Low BD | 178 | 25 | |
| High BD | 8 | 37 | |
| RF | |||
| True | |||
| Predicted | Low BD | High BD | |
| Low BD | 179 | 25 | |
| High BD | 7 | 38 | |
| XGBoost | |||
| True | |||
| Predicted | Low BD | High BD | |
| Low BD | 178 | 24 | |
| High BD | 8 | 39 | |
Figure 6XGBoost classification error vs. the number of iterations.
Figure 7Learning curves on training and test datasets by (a) SVM and (b) XGBoost algorithms.