| Literature DB >> 36236538 |
Jingzong Zhang1, Shijie Cong1, Gen Zhang1, Yongjun Ma1, Yi Zhang1, Jianping Huang1.
Abstract
Plant pests are the primary biological threats to agricultural and forestry production as well as forest ecosystem. Monitoring forest-pest damage via satellite images is crucial for the development of prevention and control strategies. Previous studies utilizing deep learning to monitor pest-infested damage in satellite imagery adopted RGB images, while multispectral imagery and vegetation indices were not used. Multispectral images and vegetation indices contain a wealth of useful information for detecting plant health, which can improve the precision of pest damage detection. The aim of the study is to further improve forest-pest infestation area segmentation by combining multispectral, vegetation indices and RGB information into deep learning. We also propose a new image segmentation method based on UNet++ with attention mechanism module for detecting forest damage induced by bark beetle and aspen leaf miner in Sentinel-2 images. The ResNeSt101 is used as the feature extraction backbone, and the attention mechanism scSE module is introduced in the decoding phase for improving the image segmentation results. We used Sentinel-2 imagery to produce a dataset based on forest health damage data gathered by the Ministry of Forests, Lands, Natural Resource Operations and Rural Development (FLNRORD) in British Columbia (BC), Canada, during aerial overview surveys (AOS) in 2020. The dataset contains the 11 original Sentinel-2 bands and 13 vegetation indices. The experimental results confirmed that the significance of vegetation indices and multispectral data in enhancing the segmentation effect. The results demonstrated that the proposed method exhibits better segmentation quality and more accurate quantitative indices with overall accuracy of 85.11%, in comparison with the state-of-the-art pest area segmentation methods.Entities:
Keywords: Sentinel-2; attention mechanism; deep learning; pest area detecting; semantic segmentation; vegetation indices
Mesh:
Year: 2022 PMID: 36236538 PMCID: PMC9570766 DOI: 10.3390/s22197440
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The study area’s location and an illustration of the Sentinel-2 images utilized for the experiment.
The features of Sentinel-2 imagery.
| Band | Band Name | Resolution (m) |
|---|---|---|
| B1 | Coastal aerosol | 60 |
| B2 | Blue | 10 |
| B3 | Green | 10 |
| B4 | Red | 10 |
| B5 | Vegetation Red Edge 1 | 20 |
| B6 | Vegetation Red Edge 2 | 20 |
| B7 | Vegetation Red Edge 3 | 20 |
| B8 | NIR | 10 |
| B8A | Narrow NIR | 20 |
| B9 | Water vapor | 60 |
| B10 | SWIR-Cirrus | 60 |
| B11 | SWIR 1 | 20 |
| B12 | SWIR 2 | 20 |
Method for calculating vegetation indices.
| Vegetation | Calculation Method | Calculation |
|---|---|---|
| NDWI |
|
|
| DWSI |
|
|
| NGRDI |
|
|
| RDI |
|
|
| GLI |
|
|
| NDRE2 |
|
|
| PBI |
|
|
| NDVI |
|
|
| GNDVI |
|
|
| CIG |
|
|
| CVI |
|
|
| NDRE3 |
|
|
| DRS |
|
|
Figure 2The raster label corresponding to the Sentinel-2 image.
Figure 3The structure of RSPR-UNet++.
Figure 4Split-Attention Block.
Figure 5Spatial and Channel Squeeze and Excitation Block(scSE).
Other common semantic segmentation models and their characteristics.
| Model | Characteristics | Reference |
|---|---|---|
| UNet | The architecture contains 2 paths (contraction path and symmetric expanding path). It is an end-to-end fully convolutional network (FCN). | [ |
| DeeplabV3+ | The spatial pyramid pooling module and the encoder–decoder structure were combined. The depthwise separable convolution was applied to both the Atrous Spatial Pyramid Pooling and decoder modules. | [ |
| Feature Pyramid Networks (FPN) | Developed a top-down architecture with lateral connections for building high-level semantic feature maps at all scales. | [ |
| Pyramid Attention Network (PAN) | Exploited the impact of global contextual information in semantic segmentation. | [ |
| UNet++ | The architecture is an encoder–decoder network where the encoder and decoder sub-networks are connected through a series of nested, dense skip pathways. It optimizes the topology of UNet and is an improved version of the UNet network structure. | [ |
Figure 6A flow chart of the process of creating the dataset together with the process of training, validating, and testing the model.
Figure 7Loss value corresponding to different iterations.
Method for calculating the 8 new vegetation indices related to red edge.
| Vegetation | Calculation Method | Calculation |
|---|---|---|
| ND790/670 |
|
|
| NDVI690-710 |
|
|
| NDRE |
|
|
| NDVI65 |
|
|
| GNDVIhyper |
|
|
| RENDVI1 |
|
|
| RENDVI2 |
|
|
| RI |
|
|
The experimental outcomes of RSPR-UNet++ and other methods. BG represents the background, BB the bark beetle, and ALM the aspen leaf miner.
| Model | Category | Precision | Recall | F1 | IoU | mIoU | FWIoU | Accuracy |
|---|---|---|---|---|---|---|---|---|
| UNet | BG | 85.44 | 88.63 | 87.00 | 77.00 | 63.84 | 70.09 | 82.21 |
| BB | 76.64 | 69.74 | 73.03 | 57.52 | ||||
| ALM | 74.03 | 71.26 | 72.62 | 57.01 | ||||
| FPN | BG | 86.33 | 87.93 | 87.12 | 77.18 | 64.51 | 70.60 | 82.52 |
| BB | 75.09 | 74.12 | 74.60 | 59.49 | ||||
| ALM | 75.08 | 70.11 | 72.51 | 56.87 | ||||
| PAN | BG | 86.39 | 88.34 | 87.36 | 77.55 | 64.84 | 70.87 | 82.71 |
| BB | 75.00 | 73.05 | 74.01 | 58.75 | ||||
| ALM | 76.15 | 71.21 | 73.59 | 58.22 | ||||
| DeeplabV3+ | BG | 87.63 | 86.22 | 86.92 | 76.86 | 65.09 | 70.75 | 82.51 |
| BB | 74.85 | 75.81 | 75.33 | 60.42 | ||||
| ALM | 71.49 | 75.44 | 73.41 | 57.99 | ||||
| UNet++ | BG | 86.81 | 87.81 | 87.31 | 77.47 | 65.18 | 71.06 | 82.82 |
| BB | 75.34 | 74.71 | 75.03 | 60.03 | ||||
| ALM | 75.00 | 71.93 | 73.44 | 58.02 | ||||
| RSPR-UNet++ | BG | 89.61 | 87.06 | 88.32 | 79.08 | 68.83 | 73.76 | 84.61 |
| BB | 75.60 | 82.15 | 78.74 | 64.94 | ||||
| ALM | 76.83 | 76.98 | 76.90 | 62.47 | ||||
| RSPR-UNet++ | BG | 89.92 | 87.53 | 88.71 | 79.70 | 69.82 | 74.50 | 85.11 |
| BB | 78.10 | 79.52 | 78.81 | 65.02 | ||||
| ALM | 75.16 | 82.33 | 78.58 | 64.72 |
Figure 8The process of the prediction of the entire Sentinel-2 image.
Figure 9The segmentation effect of the whole Sentinel-2 image.
The overall accuracy for the segmentation results of the Sentinel-2 image in Figure 9.
| Model | Accuracy |
|---|---|
| UNet | 86.79 |
| FPN | 86.31 |
| PAN | 87.40 |
| DeeplabV3+ | 85.87 |
| UNet++ | 87.49 |
| RSPR-UNet++ without scSE | 88.29 |
| RSPR-UNet++ | 89.10 |
Figure 10Comparison of the evaluation metrics after the two additions of the bands.
Effect of different data on the segmentation result of infested areas using RSPR-UNet++.
| Data | Category | Precision | Recall | F1 | IoU | mIoU | FWIoU | Accuracy |
|---|---|---|---|---|---|---|---|---|
| RGB | BG | 87.06 | 85.67 | 86.36 | 75.99 | 64.20 | 69.76 | 81.81 |
| BB | 73.45 | 73.32 | 73.38 | 57.96 | ||||
| ALM | 71.32 | 76.76 | 73.94 | 58.66 | ||||
| 11 bands | BG | 88.96 | 84.98 | 86.93 | 76.87 | 65.76 | 71.03 | 82.63 |
| BB | 70.04 | 81.40 | 75.29 | 60.38 | ||||
| ALM | 76.43 | 73.65 | 75.02 | 60.02 | ||||
| RGB | BG | 89.79 | 84.79 | 87.22 | 77.34 | 67.06 | 72.01 | 83.31 |
| BB | 76.23 | 78.65 | 77.42 | 63.16 | ||||
| ALM | 68.99 | 83.43 | 75.52 | 60.67 | ||||
| 11 bands | BG | 89.92 | 87.53 | 88.71 | 79.70 | 69.82 | 74.50 | 85.11 |
| BB | 78.10 | 79.52 | 78.81 | 65.02 | ||||
| ALM | 75.16 | 82.33 | 78.58 | 64.72 | ||||
| 8 bands | BG | 86.43 | 89.63 | 88.00 | 78.57 | 68.17 | 72.76 | 84.13 |
| BB | 80.66 | 74.73 | 77.58 | 63.37 | ||||
| ALM | 78.76 | 75.28 | 76.98 | 62.57 | ||||
| 11 bands | BG | 84.68 | 90.62 | 87.55 | 77.85 | 68.12 | 72.17 | 83.85 |
| BB | 81.21 | 75.63 | 78.32 | 64.37 | ||||
| ALM | 84.00 | 70.48 | 76.65 | 62.14 |
Figure 11The spectral figure of bark beetle and aspen leaf miner. VRE represents Vegetation Red Edge.
Effects of different attention modules.
| Attention Module | Accuracy (%) |
|---|---|
| scSE | 85.11 |
| cSE | 84.93 |
| sSE | 84.85 |
| None | 84.61 |
The channels’ number of the feature-maps output.
| Parameters | Accuracy (%) |
|---|---|
| 16, 32, 64, 128 and 256 | 85.11 |
| 32, 64, 128, 256 and 512 | 85.03 |
| 64, 128, 256, 512 and 1024 | 83.24 |