| Literature DB >> 30845748 |
Paulo Amador Tavares1, Norma Ely Santos Beltrão2, Ulisses Silva Guimarães3, Ana Cláudia Teodoro4.
Abstract
In tropical regions, such as in the Amazon, the use of optical sensors is limited by high cloud coverage throughout the year. As an alternative, Synthetic Aperture Radar (SAR) products could be used, alone or in combination with optical images, to monitor tropical areas. In this sense, we aimed to select the best Land Use and Land Cover (LULC) classification approach for tropical regions using Sentinel family products. We choose the city of Belém, Brazil, as the study area. Images of close dates from Sentinel-1 (S-1) and Sentinel-2 (S-2) were selected, preprocessed, segmented, and integrated to develop a machine learning LULC classification through a Random Forest (RF) classifier. We also combined textural image analysis (S-1) and vegetation indexes (S-2). A total of six LULC classifications were made. Results showed that the best overall accuracy (OA) was found for the integration of S-1 and S-2 (91.07%) data, followed by S-2 only (89.53%), and S-2 with radiometric indexes (89.45%). The worse result was for S-1 data only (56.01). For our analysis the integration of optical products in the stacking increased de OA in all classifications. However, we suggest the development of more investigations with S-1 products due to its importance for tropical regions.Entities:
Keywords: machine learning; optical data; radar data; random forest; spatial analysis; urban land cover
Year: 2019 PMID: 30845748 PMCID: PMC6427458 DOI: 10.3390/s19051140
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Study area in the municipality of Belém, state of Pará, Brazil *. * Where (A) is the location in the Brazilian territory of the S-1 and S-2 scenes used; (B) shows the relative tracks of the S-1 scenes and S-2 tile used; and (C) is an RGB composition of the S-2 scene where it is illustrated the complexity of the tropical coastal environment chosen (water bodies, different types of vegetation (dense, lowlands, and mangrove), impervious areas, among other ecosystems).
Main attributes from the Synthetic Aperture Radar (SAR) dataset.
|
|
| Since 03/04/2014-current |
|
| 21/07/2017 |
|
| 693 km |
| 250 km | ||
|
| 98.18° |
| 3 | ||
|
| C-band (3.75–7.5 cm) |
| 29.1–46.0° | ||
|
| Dual (VV + VH) |
| 5 × 20 m (single look) | ||
|
| Six days |
| 2.3 × 17.4 m |
Main attributes of Planet Labs imagery and the scene selected.
|
| Daily | |
|
| 24.6 km × 16.4 km | |
|
| 475 (~98° inclination) | |
|
| 0% | |
|
| 12 bits | |
|
| Daily imagery since 14/02/2017-current | |
|
| 28/07/2017 | |
|
|
|
|
| Blue | 455–515 | 3 |
| Green | 500–590 | 3 |
| Red | 590–670 | 3 |
| NIR | 780–860 | 3 |
Figure 2Process flowchart for Random Forest (RF) classification of Land Use and Land Cover (LULC) classes using S-1 and S-2 integration and validation with Planet Labs imagery.
Textural analysis and radiometric indexes employed in this research.
| S-2 Indexes | References | |
|---|---|---|
| Index Applied | Equation * | |
| NDVI |
| [ |
| NDWI |
| [ |
| SAVI |
| [ |
|
| ||
| Mean |
| [ |
| Variance |
| |
| Correlation |
| |
* Where NIR is the near infrared band, 842 nm for S-2, for NDVI, NDWI, and SAVI; Red is 665 nm for S-2 for NDVI and SAVI; MIR (Medium Infrared) is 2190 nm for S-2, for NDWI; P(i,j) is a normalized gray-tone spatial dependence matrix such that SUM(i,j = 0, N − 1) (P(i,j)) = 1; i and j represent the rows and columns, respectively, for the measures of Mean, Variance and Correlation; μ is the mean, for the Variance textural measure; and N is the number of distinct grey levels in the quantized image; μx. μy, σx, and σy are the means and standard deviations of px and py, respectively, for the correlation textural measure.
Keys of interpretation to recognize the different LULC with S-1 and S-2 colored compositions.
| Classes | Class Code | S-1 (R: VV; G: VH; B: VV/VH) * | S-2 (R: B4; G: B3; B: B2) * | S-2 (R: B12; G: B8; B: B4) * |
|---|---|---|---|---|
| Agriculture | C1 |
|
|
|
| Airport | C2 |
|
|
|
| Bare Soil | C3 |
|
|
|
| Beach | C4 |
|
|
|
| Built-up | C5 |
|
|
|
| Grassland | C6 |
|
|
|
| Highway | C7 |
|
|
|
| Mining | C8 |
|
|
|
| Primary Vegetation | C9 |
|
|
|
| Urban Vegetation | C10 |
|
|
|
| Water with Sediments | C11 |
|
|
|
| Water without Sediments | C12 |
|
|
|
* The drawn polygons were the ones produced in the segmentation and classified for each class to perform the LULC classification.
Figure 3Backscattering coefficient by LULC class analyzed of VV and VH polarizations of the processed S-1 product *. * The names of the classes respect the class code of the interpretation key as follows. C1 = Agriculture; C2 = Airport; C3 = Bare Soil; C4 = Beach area; C5 = Built-up; C6 = Grassland; C7 = Highway; C8 = Mining; C9 = Primary vegetation; C10 = Urban vegetation; C11 = Water with sediments; C12 = Water without sediments.
Figure 4Wavelengths analysis by LULC class of S-2 bands.
JM and TD * variability results for the similarity of each class. Values in blue represents the variability for S-1 and values in green is for the S-2 *#.
| C1 | C2 | C3 | C4 | C5 | C6 | C7 | C8 | C9 | C10 | C11 | C12 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| - | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|
| 0.00 | - | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|
| 0.00 | 1.85 | - | 0.00 | 1.73 | 1.92 | 1.73 | 0.00 | 2.00 | 1.84 | 2.00 | 2.00 |
|
| 0.00 | 1.52 | 0.83 | - | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|
| 0.00 | 1.94 | 0.86 | 1.50 | - | 0.00 | 1.53 | 0.00 | 2.00 | 1.95 | 2.00 | 2.00 |
|
| 0.00 | 1.91 | 0.51 | 1.19 | 0.99 | - | 1.99 | 0.00 | 1.98 | 1.67 | 2.00 | 2.00 |
|
| 0.00 | 1.91 | 0.54 | 1.27 | 0.10 | 0.73 | - | 0.00 | 2.00 | 1.92 | 2.00 | 2.00 |
|
| 0.00 | 1.96 | 0.74 | 1.12 | 1.52 | 0.64 | 1.33 | - | 0.00 | 0.00 | 0.00 | 0.00 |
|
| 0.00 | 2.00 | 0.92 | 1.76 | 1.19 | 0.28 | 0.99 | 1.20 | - | 1.78 | 2.00 | 2.00 |
|
| 0.00 | 1.98 | 0.52 | 1.49 | 0.46 | 0.69 | 0.30 | 1.29 | 0.72 | - | 2.00 | 2.00 |
|
| 0.00 | 1.33 | 1.30 | 0.43 | 1.70 | 1.45 | 1.55 | 1.31 | 1.83 | 1.79 | - | 0.18 |
|
| 0.00 | 1.46 | 0.98 | 0.41 | 1.53 | 0.94 | 1.33 | 0.77 | 1.36 | 1.47 | 1.99 | - |
* Where the values range from 0 to 2, the higher the value, the better the chance of the class separating into the LULC classification. # The classes are represented as follows. C1 = Agriculture; C2 = Airport; C3 = Bare Soil; C4 = Beach area; C5 = Built-up; C6 = Grassland; C7 = Highway; C8 = Mining; C9 = Primary vegetation; C10 = Urban vegetation; C11 = Water with sediments; C12 = Water without sediments.
Figure 5LULC classification maps produced by RFs *#. * Where (A) is for S-1 only, (B) is S-2 only, (C) is S-1 with textures, (D) is S-2 with indexes, (E) is S-1 with S-2, and (F) is for all bands. # The classes are represented as follows. C1 = Agriculture; C2 = Airport; C3 = Bare Soil; C4 = Beach area; C5 = Built-up; C6 = Grassland; C7 = Highway; C8 = Mining; C9 = Primary vegetation; C10 = Urban vegetation; C11 = Water with sediments; C12 = Water without sediments.
Figure 6Bands contribution for RF classification *. * Where (A) is the band’s contribution for the S-1 only, (B) is for S-2 only, (C) is S-1 with textures, (D) is for S-2 with indexes, (E) is S-1 with S-2, and (F) is all bands.
PA and UA for each class in the different types of RF classifications produced *.
| Class Code | S-1 Only | S-2 Only | S-1 with Textures | S-2 with Indexes | S-1 with S-2 | All Bands | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PA (%) | UA (%) | PA (%) | UA (%) | PA (%) | UA (%) | PA (%) | UA (%) | PA (%) | UA (%) | PA (%) | UA (%) | |
|
| 50.0 | 7.7 | 100.0 | 18.2 | 0.0 | 0.0 | 100.0 | 20.0 | 100.0 | 20.0 | 50.0 | 20.0 |
|
| 60.0 | 16.7 | 80.0 | 66.7 | 40.0 | 25.0 | 80.0 | 80.0 | 100.0 | 50.0 | 80.0 | 57.1 |
|
| 31.2 | 23.8 | 56.3 | 64.3 | 25.0 | 21.1 | 56.3 | 69.2 | 65.6 | 75.0 | 46.9 | 48.4 |
|
| 40.0 | 21.1 | 70.0 | 77.8 | 80.0 | 47.1 | 80.0 | 100.0 | 80.0 | 88.9 | 40.0 | 66.7 |
|
| 41.8 | 42.2 | 75.5 | 85.1 | 53.1 | 51.5 | 79.6 | 86.7 | 80.6 | 94.1 | 71.4 | 88.6 |
|
| 27.6 | 10.3 | 58.6 | 60.7 | 41.4 | 16.2 | 51.7 | 55.6 | 58.6 | 63.0 | 55.2 | 48.5 |
|
| 12.1 | 9.3 | 75.8 | 59.5 | 15.2 | 11.9 | 78.8 | 59.1 | 78.8 | 70.3 | 69.7 | 46.0 |
|
| 20.0 | 4.0 | 100.0 | 55.6 | 0.0 | 0.0 | 100.0 | 50.0 | 100.0 | 50.0 | 100.0 | 62.5 |
|
| 46.9 | 65.7 | 89.2 | 94.6 | 60.7 | 74.3 | 88.8 | 94.3 | 90.3 | 94.0 | 88.8 | 93.1 |
|
| 16.7 | 18.1 | 68.6 | 64.2 | 18.6 | 21.6 | 66.7 | 62.4 | 73.5 | 70.8 | 60.8 | 60.2 |
|
| 41.4 | 8.6 | 93.1 | 81.8 | 48.3 | 9.4 | 93.1 | 77.1 | 93.1 | 75.0 | 79.3 | 67.7 |
|
| 75.3 | 98.5 | 99.5 | 99.7 | 77.2 | 98.7 | 99.2 | 99.7 | 99.5 | 99.7 | 99.0 | 98.7 |
* The classes are represented as follows. C1 = Agriculture; C2 = Airport; C3 = Bare Soil; C4 = Beach area; C5 = Built-up; C6 = Grassland; C7 = Highway; C8 = Mining; C9 = Primary vegetation; C10 = Urban vegetation; C11 = Water without sediments; C12 = Water with sediments.
OA and Kappa coefficient for each classifier ranked in order of accuracy.
| Data Combination | Overall Accuracy (%) | Kappa Coefficient | Rank |
|---|---|---|---|
| S-1 with S-2 | 91.07 | 0.8709 | 1 |
| S-2 Only | 89.53 | 0.8487 | 2 |
| S-2 with Indexes | 89.45 | 0.8476 | 3 |
| All | 87.09 | 0.8132 | 4 |
| S-1 with Textures | 61.61 | 0.4870 | 5 |
| S-1 Only | 56.01 | 0.4194 | 6 |