| Literature DB >> 34084916 |
Sri Yulianto Joko Prasetyo1, Kristoko Dwi Hartomo1, Mila Chrismawati Paseleng1.
Abstract
This study aims to develop a software framework for predicting aridity using vegetation indices (VI) from LANDSAT 8 OLI images. VI data are predicted using machine learning (ml): Random Forest (RF) and Correlation and Regression Trees (CART). Comparison of prediction using Artificial Neural Network (ANN), Support Vector Machine (SVM), k-nearest neighbors (k-nn) and Multivariate Adaptive Regression Spline (MARS). Prediction results are interpolated using Inverse Distance Weight (IDW). This study was conducted in stages: (1) Image preprocessing; (2) calculating numerical data extracted from the LANDSAT band imagery using vegetation indices; (3) analyzing correlation coefficients between VI; (4) prediction using RF and CART; (5) comparing performances between RF and CART using ANN, SVM, k-nn, and MARS; (6) testing the accuracy of prediction using Mean Square Error (MSE) and Mean Absolute Percentage Error (MAPE); (7) interpolating with IDW. Correlation coefficient of VI data shows a positive correlation, the lowest r (0.07) and the highest r (0.98). The experiments show that the RF and CART algorithms have efficiency and effectivity in determining the aridity areas better than the ANN, SVM, k-nn, and MARS algorithm. RF has a difference between the predicted results and 1.04% survey data MAPE and the smallest value close to zero is 0.05 MSE. CART has a difference between the predicted results and 1.05% survey data MAPE and the smallest value approaching to zero which is 0.05 MSE. The prediction results of VI show that in 2020 most of the study areas were low vegetation areas with the Normalized Difference Vegetation Index (NDVI) < 0.21, had an indication of drought with the Vegetation Health Index (VHI) < 31.10, had a Vegetation Condition Index (VCI) in some areas between 35%-50% (moderate drought) and < 35% (high drought). The Burn Area Index (dBAI) values are between -3, 971 and -2,376 that show the areas have a low fire risk, and index values are between -0, 208 and -0,412 that show the areas are starting vegetation growth. The result of this study shows that the machine learning algorithms is an accurate and stable algorithm in predicting the risks of drought and land fire based on the VI data extracted from the LANDSAT 8 OLL imagery. The VI data contain the record of vegetation condition and its environment, including humidity, temperatures, and the environmental vegetation health. ©2021 Prasetyo et al.Entities:
Keywords: Aridity; Machine learning; Remote sensing; Vegetation indices
Year: 2021 PMID: 34084916 PMCID: PMC8157165 DOI: 10.7717/peerj-cs.415
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
The VI in the proposed framework.
| VI | Equation | Description | Ref. |
|---|---|---|---|
| NDVI is a vegetation index that uses a combination of NIR and RED spectrum reflectance to measure vegetation greenness. Chlorophyll and carotenoid elements of a plant absorb more red spectrum with | | ||
| VCI is used as an indicator to identify changes in vegetation from bad condition to good condition. NDVImin and NDVImax are the mean values of seasonal NDVI. NDVI represents the dynamics of seasonal vegetation, while VCI reflects relative changes in humidity conditions from very bad to optimal condition. | | ||
| TCI is an indicator of drought that gives an indication of stress on vegetation caused by high temperature and excessive humidity. | | ||
| VHI is a vegetation index which is a combination of VCI and TCI. VCI represents humidity in the vegetation canopy which absorbs visible light and NIR spectra. TCI represents the air temperature in the vegetation canopy which absorbs the IR spectrum. VHI is a vegetation index that represents the level of air temperature (TCI) and soil moisture (VCI). A lower VHI value indicates a high drought while a higher VHI value indicates a wet or no drought condition. | | ||
| Normalized Burn Ratio (NBR) is the ratio between NIR and SWIR with both | | ||
| BAI is a special index to distinguish areas affected by fire. This index is calculated from the spectral distance from each pixel to the reference spectral point, where the newly burned area tends to coalesce. | |
The class vegetation status of NDVI (A) and class of indicator VHI, TCI, VCI (B) (Hashim, Abd Latif & Adnan, 2019).
| Vegetation Class | Description | NDVI Value |
|---|---|---|
| Non Vegetation | Barren areas, build up area, road network | −1 to 0.199 |
| Low Vegetation | Shrub and grassland | 0.2 to 0.5 |
| High Vegetation | Temperate and Tropical urban forest | 0.501 to 1.0 |
Band name, bandwidth and resolution of LANDSAT 8 OLI.
| Band | Bandwidth | Resolution |
|---|---|---|
| Band 1 Coastal | 0.43–0.45 | 30 |
| Band 2 Blue | 0.45–0.51 | 30 |
| Band 3 Green | 0.53–0.59 | 30 |
| Band 4 Red | 0.64–0.67 | 30 |
| Band 5 NIR | 0.85–0.88 | 30 |
| Band 6 SWIR 1 | 1.57–1. 65 | 30 |
| Band 7 SWIR 2 | 2.11–2.29 | 30 |
| Band 8 Pan | 0.50–0.68 | 15 |
| Band 9 Cirus | 1.36–1.68 | 30 |
| Band 10 TIRS 1 | 10.6–10. 19 | 100 |
| Band 11 TIRS 2 | 11.5–12.51 | 100 |
Figure 1The land cover study area of Central Java Province of Indonesia.
Figure 2The proposed framework in the study.
Figure 3Analysis of the Vegetation Indices Association in 2018 (A) and 2019 (B).
Prediction of vegetation index using the RF algorithm and the CART algorithm in 2019–2020.
| VI | VI | MAPE | MSE | |||
|---|---|---|---|---|---|---|
| VI | Min | Max | Min | Max | ||
| NDVI | 0.389 | 0.597 | 0.049 | 0.405 | 0.500 | 0.071 |
| TCI | 22.359 | 78.912 | 22.359 | 78.912 | 0.000 | 0.000 |
| VCI | 13.678 | 43.638 | 10.023 | 57.037 | 23.492 | 23.736 |
| VHI | 21.375 | 52.375 | 17.740 | 59.987 | 14.121 | 3.954 |
| BAI | 21.261 | 30.193 | 21.215 | 30.210 | 0.056 | 0.000 |
| NBR | 0.564 | 0.716 | 0.552 | 0.749 | 4.406 | 0.000 |
| NDVI | 0.426 | 0.583 | 0.246 | 0.583 | 0.000 | 0.000 |
| TCI | 30.387 | 71.577 | 84.639 | 71.557 | 0.028 | 0.000 |
| VCI | 14.010 | 37.593 | 14.010 | 38.000 | 0.003 | 0.166 |
| VHI | 22.503 | 51.655 | 22.503 | 52.000 | 0.019 | 0.119 |
| BAI | 21.938 | 28.755 | 21.938 | 29.000 | 0.003 | 0.060 |
| NBR | 0.557 | 0.706 | 0.577 | 0.658 | 7.295 | 0.002 |
(A) Prediction of vegetation index 2019–2020 using the ANN algorithm and the (B) knn algorithm.
| VI | VI | MAPE | MSE | |||
|---|---|---|---|---|---|---|
| VI | Min | Max | Min | Max | ||
| NDVI | 0.331 | 0.602 | 0.348 | 0.615 | 2.114 | 0.000 |
| TCI | 27.465 | 76.772 | 22.359 | 79.531 | 3.469 | 7.612 |
| VCI | 13.210 | 55.443 | 13.032 | 57.037 | 2.795 | 2.541 |
| VHI | 19.801 | 40.861 | 17.740 | 41.711 | 11.893 | 0.723 |
| BAI | 20.979 | 30.492 | 21.214 | 30.210 | 0.933 | 0.080 |
| NBR | 0.555 | 0.695 | 0.552 | 0.749 | 7.210 | 0.003 |
| NDVI | 0.396 | 0.613 | 0.348 | 0.615 | 0.325 | 0.002 |
| TCI | 26.217 | 77.847 | 22.359 | 79.351 | 1.895 | 14.884 |
| VCI | 13.098 | 41.098 | 13.023 | 57.036 | 27.944 | 0.006 |
| VHI | 18.591 | 53.441 | 17.740 | 59.591 | 10.320 | 0.724 |
| BAI | 21.289 | 30.012 | 21.220 | 30.012 | 0.000 | 0.005 |
| NBR | 0.555 | 0.734 | 0.552 | 0.749 | 2.003 | 0.002 |
Prediction of vegetation index using SVM Algorithm (A) and MARS Algorithm (B) in 2019–2020.
| VI | VI | MAPE | MSE | |||
|---|---|---|---|---|---|---|
| VI | Min | Max | Min | Max | ||
| NDVI | 0.351 | 0.617 | 0.040 | 0.504 | 22.421 | 0.045 |
| TCI | 23.569 | 78.495 | 22.358 | 79.351 | 1.079 | 0.732 |
| VCI | 13.276 | 56.479 | 13.023 | 57.036 | 0.977 | 0.310 |
| VHI | 18.701 | 59.289 | 17.740 | 55.711 | 16.915 | 12.802 |
| BAI | 21.390 | 29.835 | 21.213 | 30.210 | 1.241 | 0.140 |
| NBR | 0.552 | 0.741 | 0.552 | 0.749 | 1.068 | 0.003 |
| NDVI | 0.359 | 0.615 | 0.049 | 0.405 | 21.782 | 0.044 |
| TCI | 30.387 | 71.557 | 22.359 | 75.912 | 9.321 | 18.966 |
| VCI | 13.023 | 57.037 | 13.023 | 57.037 | 0.000 | 0.000 |
| VHI | 17.740 | 59.987 | 17.740 | 59.987 | 0.000 | 0.000 |
| BAI | 21.261 | 30.193 | 21.215 | 30.210 | 0.056 | 0.000 |
| NBR | 0.551 | 0.726 | 0.552 | 0.749 | 3.071 | 0.001 |
Average MAPE and MSE on the Machine Learning Algorithm Test.
| Algorithms | MAPE (%) | MSE |
|---|---|---|
| ANN | 5.70 | 2.19 |
| k-nn | 6.07 | 2.07 |
| SVM | 6.28 | 2.01 |
| RF | 1.04 | 0.05 |
| CART | 1.05 | 0.05 |
| MARS | 5.34 | 4.15 |
Figure 4Spatial prediction using IDW in 2019–2020 of NDVI vegetation index (A) and spatial prediction of VHI vegetation index (B).
Figure 5Spatial prediction in 2019–2020 using the IDW of TCI vegetation index (A) and the VCI vegetation index (B)
Figure 6Spatial prediction of the BAI vegetation index (A) and the NBR vegetation index in 2019–2020 using IDW (B).
(A) Prediction of BAI values with Machine Learning algorithm and (B) ΔNBR values with Machine Learning algorithm.
| Algorithms | Min | Average | Max |
|---|---|---|---|
| ΔANN | −11.787 | 0.074 | 3.845 |
| Δk-nn | −6.037 | −0.016 | 6.742 |
| ΔSVM | −3.617 | −0.242 | 2.506 |
| ΔRF | −2.647 | −0.272 | 5.208 |
| ΔCART | −3.067 | −0.283 | 2.376 |
| ΔMARS | −6.923 | −0.283 | 5.028 |
| ΔBAI1819 | −3.971 | −0.216 | 2.376 |
| ΔANN | −0.426 | −0.362 | −0.204 |
| Δk-nn | −0.402 | −0.302 | −0.212 |
| ΔSVM | −0.410 | −0.301 | −0.210 |
| ΔRF | −0.381 | -0302 | −0.237 |
| ΔCART | −0.410 | −0.303 | −0.244 |
| ΔMARS | −0.397 | −0.302 | −0.236 |
| ΔNBR1819 | −0.412 | −0.303 | −0.208 |