| Literature DB >> 33921492 |
Suk-Ju Hong1, Il Nam2, Sang-Yeon Kim1, Eungchan Kim1,3, Chang-Hyup Lee1,3, Sebeom Ahn1, Il-Kwon Park2,4, Ghiseok Kim1,3,4.
Abstract
The black pine bast scale, M. thunbergianae, is a major insect pest of black pine and causes serious environmental and economic losses in forests. Therefore, it is essential to monitor the occurrence and population of M. thunbergianae, and a monitoring method using a pheromone trap is commonly employed. Because the counting of insects performed by humans in these pheromone traps is labor intensive and time consuming, this study proposes automated deep learning counting algorithms using pheromone trap images. The pheromone traps collected in the field were photographed in the laboratory, and the images were used for training, validation, and testing of the detection models. In addition, the image cropping method was applied for the successful detection of small objects in the image, considering the small size of M. thunbergianae in trap images. The detection and counting performance were evaluated and compared for a total of 16 models under eight model conditions and two cropping conditions, and a counting accuracy of 95% or more was shown in most models. This result shows that the artificial intelligence-based pest counting method proposed in this study is suitable for constant and accurate monitoring of insect pests.Entities:
Keywords: CNN; Faster R-CNN; Matsucoccus thunbergianae; SSD; deep learning; insect counting; object detection; pest monitoring; sex pheromone trap
Year: 2021 PMID: 33921492 PMCID: PMC8068825 DOI: 10.3390/insects12040342
Source DB: PubMed Journal: Insects ISSN: 2075-4450 Impact factor: 2.769
Figure 1Eight-sided sticky trap: (a) installed pheromone traps; (b) captured M. thunbergianae.
Figure 2Trap photographing system.
Figure 3Samples of pheromone trap images.
Number of images and M. thunbergianae of datasets.
| Train Set | Validation Set | Test Set | |
|---|---|---|---|
| Images | 30 | 10 | 10 |
|
| 13,419 | 5071 | 4566 |
Figure 4Cropped pheromone trap images: (a) 12 × 8 cropping condition; (b) 6 × 4 cropping condition.
Figure 5Flowchart for trap image counting.
Figure 6Schematic of the training and evaluation process.
Test results of M. thunbergianae detectors (12 × 8 cropping condition).
| Model | Input Size | Inference Time (ms) | AP (%) | |
|---|---|---|---|---|
| IoU:0.3 | IoU:0.5 | |||
| Faster R-CNN Resnet 101 | 1024 | 78.26 | 89.78 | 85.63 |
| Faster R-CNN Resnet 101 | 512 | 39.64 | 89.58 | 84.32 |
| EfficientDet D4 | 1024 | 86.74 | 89.26 | 84.79 |
| EfficientDet D0 | 512 | 25.58 | 88.36 | 83.79 |
| Retinanet 50 | 1024 | 30.97 | 89.35 | 84.40 |
| Retinanet 50 | 640 | 20.56 | 89.86 | 86.40 |
| SSD Mobilenet v.2 | 640 | 15.28 | 89.02 | 84.76 |
| SSD Mobilenet v.2 | 320 | 11.82 | 89.46 | 84.54 |
Test results of M. thunbergianae detectors (6 × 4 cropping condition).
| Model | Input Size | Inference Time (ms) | AP (%) | |
|---|---|---|---|---|
| IoU:0.3 | IoU:0.5 | |||
| Faster R-CNN Resnet 101 | 1024 | 79.58 | 87.13 | 82.92 |
| Faster R-CNN Resnet 101 | 512 | 41.48 | 85.04 | 80.18 |
| EfficientDet D4 | 1024 | 90.33 | 84.87 | 81.22 |
| EfficientDet D0 | 512 | 26.12 | 85.30 | 80.21 |
| Retinanet 50 | 1024 | 33.52 | 86.58 | 82.62 |
| Retinanet 50 | 640 | 21.85 | 85.33 | 81.71 |
| SSD Mobilenet v.2 | 640 | 16.83 | 85.75 | 81.35 |
| SSD Mobilenet v.2 | 320 | 12.22 | 79.87 | 72.05 |
Figure 7Detection result images of the 12 × 8 cropping condition (0.5 score threshold).
Figure 8Detection result images of the 6 × 4 cropping condition (0.5 score threshold).
Figure 9Detection errors: (a–c) false negatives; (d) false positives.
Figure 10Counting error–counting time graph of detection models: (a) 12 × 8 cropping condition; (b) 6 × 4 cropping condition.
Counting error results of the 12 × 8 cropping condition.
| Model | Input Size | Counting Time (s) | Counting Error (%) |
|---|---|---|---|
| Faster R-CNN Resnet 101 | 1024 | 14.14 | 2.11 |
| Faster R-CNN Resnet 101 | 512 | 9.17 | 3.69 |
| EfficientDet | 1024 | 14.44 | 3.37 |
| EfficientDet | 512 | 5.29 | 3.42 |
| Retinanet50 | 1024 | 6.58 | 3.30 |
| Retinanet50 | 640 | 4.78 | 2.95 |
| SSD Mobilenet v.2 | 640 | 3.81 | 2.32 |
| SSD Mobilenet v.2 | 320 | 3.63 | 3.32 |
Counting error results of the 6 × 4 cropping condition.
| Model | Input Size | Counting Time (s) | Counting Error (%) |
|---|---|---|---|
| Faster R-CNN Resnet 101 | 1024 | 3.90 | 3.95 |
| Faster R-CNN Resnet 101 | 512 | 2.50 | 4.02 |
| EfficientDet | 1024 | 3.92 | 4.70 |
| EfficientDet | 512 | 1.68 | 4.21 |
| Retinanet50 | 1024 | 1.88 | 3.83 |
| Retinanet50 | 640 | 1.45 | 3.74 |
| SSD Mobilenet v.2 | 640 | 1.40 | 3.65 |
| SSD Mobilenet v.2 | 320 | 1.19 | 6.69 |
Figure 11Detection results of the trap image (Faster R-CNN Resnet101 with a 1024 input size, a 12 × 8 cropping condition, and a 0.5 score threshold).