| Literature DB >> 35140732 |
Fen Dai1,2,3,4, Fengcheng Wang1,2, Dongzi Yang1,2, Shaoming Lin1,2, Xin Chen1,2,3,4, Yubin Lan1,2,3,4, Xiaoling Deng1,2,3,4.
Abstract
Citrus psyllid is the only insect vector of citrus Huanglongbing (HLB), which is the most destructive disease in the citrus industry. There is no effective treatment for HLB, so detecting citrus psyllids as soon as possible is the key prevention measure for citrus HLB. It is time-consuming and laborious to search for citrus psyllids through artificial patrol, which is inconvenient for the management of citrus orchards. With the development of artificial intelligence technology, a computer vision method instead of the artificial patrol can be adopted for orchard management to reduce the cost and time. The citrus psyllid is small in shape and gray in color, similar to the stem, stump, and withered part of the leaves, leading to difficulty for the traditional target detection algorithm to achieve a good recognition effect. In this work, in order to make the model have good generalization ability under outdoor light condition, a high-definition camera to collect data set of citrus psyllids and citrus fruit flies under natural light condition was used, a method to increase the number of small target pests in citrus based on semantic segmentation algorithm was proposed, and the cascade region-based convolution neural networks (R-CNN) (convolutional neural network) algorithm was improved to enhance the recognition effect of small target pests using multiscale training, combining CBAM attention mechanism with high-resolution feature retention network high-resoultion network (HRNet) as feature extraction network, adding sawtooth atrous spatial pyramid pooling (ASPP) structure to fully extract high-resolution features from different scales, and adding feature pyramid networks (FPN) structure for feature fusion at different scales. To mine difficult samples more deeply, an online hard sample mining strategy was adopted in the process of model sampling. The results show that the improved cascade R-CNN algorithm after training has an average recognition accuracy of 88.78% for citrus psyllids. Compared with VGG16, ResNet50, and other common networks, the improved small target recognition algorithm obtains the highest recognition performance. Experimental results also show that the improved cascade R-CNN algorithm not only performs well in citrus psylla identification but also in other small targets such as citrus fruit flies, which makes it possible and feasible to detect small target pests with a field high-definition camera.Entities:
Keywords: cascade R-CNN; citrus psyllids; deep learning; small target detection; small target enhancement
Year: 2022 PMID: 35140732 PMCID: PMC8819152 DOI: 10.3389/fpls.2021.816272
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1Experimental data. The blue box is used to mark citrus psyllids.
FIGURE 2Diagram of the relationship between the picture and the target quantity.
FIGURE 3Small sample number enhancement flowchart.
FIGURE 4Small sample number enhancement example diagram.
FIGURE 5Image cropping flowchart.
FIGURE 6High-resoultion network (HRNet) model with CBAM attention mechanism.
FIGURE 7Atrous spatial pyramid pooling (ASPP) structure.
FIGURE 8Feature pyramid networks (FPN) structure.
FIGURE 9Comparison of the number of small targets before and after enhancement (A) image enhancement (B) contrast matrix.
FIGURE 10Comparison of difficult sampling and random sampling.
Different model recognition effects.
| Models | Average precision | Mean average precision | |
| Citrus psyllids | Fruit flies | ||
| ResNet50 + Data enhancement | 67.4% | 78.48% | 72.94% |
| ResNet101 + Data enhancement | 66.45% | 75.33% | 70.89% |
| ResNetXt101 + Data enhancement | 67.78% | 77.82% | 73.3% |
| VGG16 + Data enhancement | 66.89% | 77.37% | 72.13% |
| HRNet + Data enhancement | 73.54% | 80.25% | 76.89% |
| Improved HRNet + Data enhancement | 81.89% | 84.73% | 76.89% |
| Improved HRNet + Offline resampling | 72.89% | 76.25% | 74.57% |
| Improved HRNet + ASPP + FPN + Online hard sample mining strategy + Data enhancement | 88.78% | 91.64% | 90.21% |
Visualization results of different models.
| Model | Data | Visualization of results | Heat map |
| ResNet50 + Data enhancement |
|
|
|
| ResNet101 + Data enhancement |
|
|
|
| ResNetXt101 + Data enhancement |
|
|
|
| VGG16 + Data enhancement |
|
|
|
| Improved HRNet + Data enhancement |
|
|
|
| Improved HRNet + ASPP + FPN + Online hard sample mining strategy + Data enhancement |
|
|
|
FIGURE 11Training loss diagram (A) training total loss (B) training RPN bbox loss.
FIGURE 12Prediction results.