| Literature DB >> 25140631 |
Timothy Dube1, Onisimo Mutanga2, Adam Elhadi3, Riyad Ismail4.
Abstract
The quantification of aboveground biomass using remote sensing is critical for better understanding the role of forests in carbon sequestration and for informed sustainable management. Although remote sensing techniques have been proven useful in assessing forest biomass in general, more is required to investigate their capabilities in predicting intra-and-inter species biomass which are mainly characterised by non-linear relationships. In this study, we tested two machine learning algorithms, Stochastic Gradient Boosting (SGB) and Random Forest (RF) regression trees to predict intra-and-inter species biomass using high resolution RapidEye reflectance bands as well as the derived vegetation indices in a commercial plantation. The results showed that the SGB algorithm yielded the best performance for intra-and-inter species biomass prediction; using all the predictor variables as well as based on the most important selected variables. For example using the most important variables the algorithm produced an R² of 0.80 and RMSE of 16.93 t·ha⁻¹ for E. grandis; R² of 0.79, RMSE of 17.27 t·ha⁻¹ for P. taeda and R² of 0.61, RMSE of 43.39 t·ha⁻¹ for the combined species data sets. Comparatively, RF yielded plausible results only for E. dunii (R² of 0.79; RMSE of 7.18 t·ha⁻¹). We demonstrated that although the two statistical methods were able to predict biomass accurately, RF produced weaker results as compared to SGB when applied to combined species dataset. The result underscores the relevance of stochastic models in predicting biomass drawn from different species and genera using the new generation high resolution RapidEye sensor with strategically positioned bands.Entities:
Mesh:
Year: 2014 PMID: 25140631 PMCID: PMC4179085 DOI: 10.3390/s140815348
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1.Location of Sappi Clan forest in the midlands of KwaZulu Natal, South Africa.
Figure 2.Typical field site showing (a) Eucalyptus spp. and (b), P. taeda in early August 2013.
Selected strategically positioned Rapideye spectral parameters and vegetation indices used for this study.
| Blue, green, red, NIR and Red-edge | - | |
|
| ||
| Simple Ratio | NIR/Red | Jordan [ |
| RVI.RE (Ratio Vegetation Index) | Red-edge/NIR | de Sousa, |
| NDVI (Normalized Difference Vegetation Index) | (NIR−Red)/(NIR + Red) | Rouse, |
| NDVI.RE | (NIR − Red-edge)/(NIR + Red-edge) | Mutanga, Adam and Cho [ |
| DVI (Difference Vegetation Index) | NIR − Red | Tucker [ |
| MSR (Modified Simple Ratio) | (NIR/Red)-1/(NIR/Red)∧°.5 + 1 | Qi, |
| MSR.RE | (NIR/Red-edge)-1/(NIR /Red-edge)∧°.5 + 1 | |
| TVI (Triangular Vegetation Index) | 0.5*[120*(NIR − Green)−200*(Red-Green)] | Broge and Leblanc [ |
| TVI.RE | 0.5*[120*(NIR − Green)−200*(Red-edge-Green)] | |
| IPVI (Perpendicular Vegetation Index) | NIR/(NIR + Red) | Crippen [ |
| IPVI.RE | NIR/(NIR + Red-edge) | |
| GI (Greenness Index) | Green/Red | Zarco-Tejada, |
| GI.RE | Green/Red-edge | |
| PSSR (Pigment specific simple ratio) | NIR/Red-edge | Blackburn [ |
Descriptive statistics of the measured above ground biomass (t·ha−1).
| 63 | 33.24 | 96.49 | 52.86 | 16.39 | |
| 65 | 106.03 | 225.07 | 170.30 | 29.94 | |
| 53 | 137.11 | 298.04 | 206.07 | 42.83 | |
| 181 | 33.24 | 298.04 | 139.89 | 72.22 |
Figure 3.One-to-one relationship between measured and predicted intra-species biomass based on (i) SGB and (ii) RF algorithms. a, b, and c represent E. grandis, E. dunii, and P. taeda based on all the predictor variables (p = 19).
Figure 4.The one-to-one relationship between measured and predicted inter-species biomass for all species data combined, based on (i) SGB and (ii) RF algorithms without variable selection.
Inter-and-intra species biomass prediction results using the most important variables selected by the two regression models SGB and RF.
| SGB (n = 5) | 3 | 0.001 | - | 3750 | 0.80 | 16.93 | |
| RF (n = 4) | - | - | 4 | 500 | 0.76 | 18.61 | |
| SGB (n = 6) | 5 | 0.001 | - | 2350 | 0.88 | 09.23 | |
| RF (n = 7) | - | - | 7 | 500 | 0.79 | 07.18 | |
| SGB (n = 5) | 5 | 0.001 | - | 800 | 0.79 | 17.27 | |
| RF (n = 6) | - | - | 6 | 2800 | 0.80 | 22.43 | |
| All species | SGB (n = 19) | 5 | 0.001 | - | 2800 | 0.61 | 43.39 |
| RF (n = 19) | - | - | 19 | 750 | 0.37 | 59.27 |
Illustrates the most important variables retained by SGB and RF after implementing variable selection.
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
| 1 | RE | NIR | NIR | RE | NIR | NIR | All variables selected | |
| 2 | PSSR | RE | RE | PSSR | Green | Green | ||
| 3 | GI.RE | PSSR | Red | GI.RE | RE | RE | ||
| 4 | NIR | DVI | GI.RE | NIR | Red | Red | ||
| 5 | Green | - | Green | Green | DVI | DVI | ||
| 6 | - | - | Blue | DVI | Blue | |||
| 7 | - | - | - | Blue | - | - | ||
Figure 5.Show the optimal number of variables (spectral bands and VIs) based on the backward feature elimination search function for estimating intra-and-inter species using Random Forest (based on 1000 repetitions). In Figure 5, (a–d) represent E. grandis, E. dunii, P. taeda and inter-species dataset.
* The best number of variables with the lowest error rate is shown by the arrows and the RMSE is calculated from the training dataset.