| Literature DB >> 35957382 |
Yaowen Lv1, Zhiqing Ai1, Manfei Chen1, Xuanrui Gong1, Yuxuan Wang1, Zhenghai Lu1.
Abstract
To solve the problem of low accuracy and slow speed of drone detection in high-resolution images with fixed cameras, we propose a detection method combining background difference and lightweight network SAG-YOLOv5s. First, background difference is used to extract potential drone targets in high-resolution images, eliminating most of the background to reduce computational overhead. Secondly, the Ghost module and SimAM attention mechanism are introduced on the basis of YOLOv5s to reduce the total number of model parameters and improve feature extraction, and α-DIoU loss is used to replace the original DIoU loss to improve the accuracy of bounding box regression. Finally, to verify the effectiveness of our method, a high-resolution drone dataset is made based on the public data set. Experimental results show that the detection accuracy of the proposed method reaches 97.6%, 24.3 percentage points higher than that of YOLOv5s, and the detection speed in 4K video reaches 13.2 FPS, which meets the actual demand and is significantly better than similar algorithms. It achieves a good balance between detection accuracy and detection speed and provides a method benchmark for high-resolution drone detection under a fixed camera.Entities:
Keywords: background difference; drone; high-resolution image; object detection; small target
Mesh:
Year: 2022 PMID: 35957382 PMCID: PMC9527012 DOI: 10.3390/s22155825
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The steps in this paper to detect high-resolution drones with fixed cameras.
Figure 2Extraction of potential targets: (a) original images; (b) results obtained by background difference.
Figure 3Examples of candidate areas that may contain interference targets, such as birds.
Figure 4SAG-YOLOv5s object detection network.
Figure 5The GhostConv structure.
Figure 6Basic structure of SAG-YOLOv5s: (a) SAGBottleneck structure; (b) SAGCSP_n structure.
Figure 7Target size distribution in the high-resolution drone dataset.
Figure 8PR curves of the proposed algorithm and YOLOv5s.
Ablation experiments.
| Method | mAP@0.5 (%) | Parameters (106) | FLOPs (109) | FPS | Input Size (pixels) |
|---|---|---|---|---|---|
| YOLOv5s | 73.3 | 7.06 | 16.4 | 21.0 | 416 × 416 |
| BD + YOLOv5s | 97.0 | 7.06 | 16.4 | 9.2 | 416 × 416 |
| BD + YOLOv5s + SimAM | 97.5 | 7.06 | 16.4 | 9.0 | 416 × 416 |
| BD + Ghost + YOLOv5s | 95.8 | 3.90 | 8.8 | 13.4 | 416 × 416 |
| BD + SAG-YOLOv5s | 97.6 | 3.90 | 8.8 | 13.2 | 416 × 416 |
| BD + SAG-YOLOv5s | 97.2 | 3.90 | 8.8 | 15.0 | 96 × 96 |
Experimental results of various algorithms on the test set.
| Method | Precision (%) | Recall (%) | mAP@0.5 (%) | FPS |
|---|---|---|---|---|
| CenterNet | 87.2 | 69.5 | 69.8 | 10.5 |
| YOLT | 92.9 | 88.0 | 91.1 | 4.6 |
| YOLT-YOLOV5s | 95.3 | 92.8 | 95.2 | 6.0 |
| Ours |
|
|
|
|
Figure 9Visual heatmaps of different networks.
Figure 10The high-resolution drone detection results obtained by the method in this paper.