| Literature DB >> 35755692 |
Weikuan Jia1,2, Mengyuan Liu1, Rong Luo3, Chongjing Wang4, Ningning Pan1, Xinbo Yang1, Xinting Ge1,5.
Abstract
Accurate detection and segmentation of the object fruit is the key part of orchard production measurement and automated picking. Affected by light, weather, and operating angle, it brings new challenges to the efficient and accurate detection and segmentation of the green object fruit under complex orchard backgrounds. For the green fruit segmentation, an efficient YOLOF-snake segmentation model is proposed. First, the ResNet101 structure is adopted as the backbone network to achieve feature extraction of the green object fruit. Then, the C5 feature maps are expanded with receptive fields and the decoder is used for classification and regression. Besides, the center point in the regression box is employed to get a diamond-shaped structure and fed into an additional Deep-snake network, which is adjusted to the contours of the target fruit to achieve fast and accurate segmentation of green fruit. The experimental results show that YOLOF-snake is sensitive to the green fruit, and the segmentation accuracy and efficiency are significantly improved. The proposed model can effectively extend the application of agricultural equipment and provide theoretical references for other fruits and vegetable segmentation.Entities:
Keywords: YOLOF-snake; automatic harvesting; deep-snake; fruits segmentation; green fruits
Year: 2022 PMID: 35755692 PMCID: PMC9218684 DOI: 10.3389/fpls.2022.765523
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1Images of apple and persimmon in front light, backlight, overlap, after rain, and night.
Sample distribution details of green fruit images.
| Condition | Overlapping | Direct sunlight | After the rain | Backlight |
| Number of persimmon images | 523 | 109 | 113 | 207 |
| Number of apple images | 459 | 89 | 103 | 265 |
FIGURE 2YOLOF structure schematic diagram.
FIGURE 3Overall structure of YOLOF-snake. 1. Output the feature map based on the underlying backbone. 2. Enter the encoder. 3. Decoder to obtain the classification results and regression box. 4. Deep-snake network to segment the regression boxes for the object fruit.
FIGURE 4Encoder structure diagram.
FIGURE 5Fine-tuning of the profile of the apple fruit.
FIGURE 6Network structure of the add-on module.
FIGURE 7Training error variation curve.
FIGURE 8AP evolution curve of persimmon dataset.
FIGURE 9AP evolution curve of apple dataset.
FIGURE 10Green apple result images.
FIGURE 11Green persimmon result images.
FIGURE 12Result images of the Cityscapes data set.
Number of residual blocks.
| Number | mAP/% | mAP | mAP | mAP |
| 0 | 62.3 | 47.6 | 79.5 | 72.5 |
| 2 | 63.5 | 47.6 | 70.1 | 75.2 |
| √4 | 64.9 | 47.8 | 70.2 | 77.3 |
| 6 | 65.0 | 47.6 | 70.2 | 78.2 |
| 8 | 66.1 | 48.5 | 71.0 | 79.1 |
Performance comparison of detection and segmentation methods.
| Methods |
|
| Average time/s | ||
|
|
| ||||
| mAP |
| mAP | mAP | ||
| SOLO | — | 57.4 | — | 76.5 | 0.49 |
| PolarMask | 56.3 | 53.8 | 68.3 | 66.1 | 0.52 |
| YOLACT | 57.6 | 60.5 | 66.9 | 71.2 | 0.45 |
| TensorMask | — | 65.3 | — | 72.4 | 0.71 |
| FCOS | 61.2 | — | 78.6 | — | 0.42 |
| RetinaMask | 69.2 | 69.4 | 77.5 | 73.1 | 0.92 |
| 70.3 | 70.9 | 79.9 | 79.3 | 0.48 | |
| 71.2 | 71.6 | 80.3 | 80.9 | 0.56 | |
| OURS | 71.0 | 71.4 | 81.8 | 82.6 | 0.23 |