| Literature DB >> 35009861 |
Xin Chen1,2, Jinghong Liu1,2, Fang Xu1, Zhihua Xie1,2, Yujia Zuo1, Lihua Cao1,2.
Abstract
Aircraft detection in remote sensing images (RSIs) has drawn widespread attention in recent years, which has been widely used in the military and civilian fields. While the complex background, variations of aircraft pose and size bring great difficulties to the effective detection. In this paper, we propose a novel aircraft target detection scheme based on small training samples. The scheme is coarse-to-fine, which consists of two main stages: region proposal and target identification. First, in the region proposal stage, a circular intensity filter, which is designed based on the characteristics of the aircraft target, can quickly locate the centers of multi-scale suspicious aircraft targets in the RSIs pyramid. Then the target regions can be extracted by adding bounding boxes. This step can get high-quality but few candidate regions. Second, in the stage of target identification, we proposed a novel rotation-invariant feature, which combines rotation-invariant histogram of oriented gradient and vector of locally aggregated descriptors (VLAD). The feature can characterize the aircraft target well by avoiding the impact of its rotation and can be effectively used to remove false alarms. Experiments are conducted on Remote Sensing Object Detection (RSOD) dataset to compare the proposed method with other advanced methods. The results show that the proposed method can quickly and accurately detect aircraft targets in RSIs and achieve a better performance.Entities:
Keywords: aircraft target detection; circular intensity filter; remote sensing images; rotation invariant feature; vector of locally aggregated descriptors (VLAD)
Year: 2022 PMID: 35009861 PMCID: PMC8749598 DOI: 10.3390/s22010319
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Some aircraft examples in RSIs. (a) Complex geo-graphical environmental background; (b) aircraft targets in different poses; (c) aircraft targets in various sizes.
Figure 2The flowchart of the proposed aircraft detection scheme.
Figure 3The aircraft and corresponding circular intensity signal waveform charts. (a,c) Aircraft in RSIs; (b,d) circular intensity signal waveform charts centered at aircraft’s central point.
Figure 4The real part of the constructed convolution kernel. (a) The planar schematic; (b) the three-dimensional schematics.
Figure 5The Original RSI and the process of target center determination. (a) The original RSI with the resolution of 1121 × 957 pixels; (b) the corresponding center-response magnitude map generated by a convolution kernel (r = 20 pixels); (c) the binary image with the center of targets obtained by threshold segmentation.
Figure 6Multi-scale image pyramid and the results of aircraft target center determination. (a) RS image pyramid; (b) center determination on image pyramid; (c) center point aggregation.
Figure 7Illustration of the expansion of gradient images to Fourier coefficient images. (a) The input image; (b) the gradient image; (c) the complex Fourier coefficient images.
Figure 8Illustration of the codebook generation in the VLAD representation.
Figure 9The distribution histogram of the aircraft target long side.
The results of region proposal method under different parameter t0, t1, t2 settings.
| Combinations of thresholds | t0 = 0.2 | t0 = 0.3 |
| t0 = 0.35 | t0 = 0.4 | t0 = 0.4 |
| t1 = 0.2 | t1 = 0.3 |
| t1 = 0.35 | t1 = 0.4 | t1 = 0.4 | |
| t2 = 0.4 | t2 = 0.4 |
| t2 = 0.5 | t2 = 0.4 | t2 = 0.5 | |
| Number of candidate regions/Per image | 235 | 181 |
| 127 | 78 | 73 |
| Recall | 0.984 | 0.976 |
| 0.935 | 0.911 | 0.91 |
The results of the three different region proposal methods.
| Method | Number of Candidate Regions per Image | Recall | Time (s) per Image |
|---|---|---|---|
| EdgeBoxes | 6109 | 0.928 |
|
| Selective Search | 3892 | 0.941 | 11.446 |
| Proposed |
|
| 0.513 |
Figure 10The PR curves of different methods on the RSOD dataset.
The performance comparisons of different methods on the RSOD dataset.
| Method | HOG | ACF-Based | RICNN | YOLOv2 | Fourier HOG | Proposed Method |
|---|---|---|---|---|---|---|
| AP | 0.697 | 0.808 | 0.874 | 0.881 | 0.905 |
|
| Mean Time (s) per Image | 0.72 | 2.23 | 8.84 |
| 2.37 | 1.31 |
Figure 11Some visual detection results of the proposed method in the RSOD dataset.