| Literature DB >> 31426597 |
Anderson Aparecido Dos Santos1, José Marcato Junior2, Márcio Santos Araújo3, David Robledo Di Martini3, Everton Castelão Tetila4, Henrique Lopes Siqueira3, Camila Aoki5, Anette Eltner6, Edson Takashi Matsubara1, Hemerson Pistori1,4, Raul Queiroz Feitosa7, Veraldo Liesenberg8, Wesley Nunes Gonçalves9.
Abstract
Detection and classification of tree species from remote sensing data were performed using mainly multispectral and hyperspectral images and Light Detection And Ranging (LiDAR) data. Despite the comparatively lower cost and higher spatial resolution, few studies focused on images captured by Red-Green-Blue (RGB) sensors. Besides, the recent years have witnessed an impressive progress of deep learning methods for object detection. Motivated by this scenario, we proposed and evaluated the usage of Convolutional Neural Network (CNN)-based methods combined with Unmanned Aerial Vehicle (UAV) high spatial resolution RGB imagery for the detection of law protected tree species. Three state-of-the-art object detection methods were evaluated: Faster Region-based Convolutional Neural Network (Faster R-CNN), YOLOv3 and RetinaNet. A dataset was built to assess the selected methods, comprising 392 RBG images captured from August 2018 to February 2019, over a forested urban area in midwest Brazil. The target object is an important tree species threatened by extinction known as Dipteryx alata Vogel (Fabaceae). The experimental analysis delivered average precision around 92% with an associated processing times below 30 miliseconds.Entities:
Keywords: deep learning; object-detection; remote sensing
Mesh:
Year: 2019 PMID: 31426597 PMCID: PMC6719170 DOI: 10.3390/s19163595
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1General processing chain: (a) UAV images at different seasons were captured and annotated by a specialist. A set of images were selected to train the detection network. (b) Once trained, the network was applied to detect cumbaru trees in test images. The object detection method in this figure corresponds to RetinaNet, although other methods (e.g., Faster-RCNN and YOLOv3) can be applied.
Aircraft and flight specifications.
| Aircraft | Sensor | Field of View | Nominal Focal Length | Image Size | Mean GSD | Mean Flight Height |
|---|---|---|---|---|---|---|
| Phantom4 | 1” CMOS | 84° | 8.8 mm | 5472 × 3648 | 0.82 cm | 30 m |
| Advanced | (20 Mp) |
Figure 2Examples from the dataset: images captured on: (a) 26 August 2018; (b) 21 September 2018; (c) 22 September 2018; and (d) 20 February 2019.
Figure 3Precision–recall curves of detection methods in all five cross validation rounds (a–e).
Average precision (%) for cumbaru tree detection in five cross validation rounds (R1–R5).
| Variant | R1 | R2 | R3 | R4 | R5 | Mean (std) |
|---|---|---|---|---|---|---|
| Faster-RCNN | 86.62 | 84.14 | 86.13 | 77.83 | 77.69 | 82.48 (±3.94) |
| YOLOv3 | 89.08 | 88.64 | 89.74 | 80.99 | 80.93 | 85.88 (±4.03) |
| RetinaNet | 93.13 | 93.92 | 95.65 | 87.82 | 92.66 | 92.64 (±2.61) |
Figure 4Examples of detection results in images captured in different seasons: (a) ground truth; (b) Faster-RCNN; (c) YOLOv3; and (d) RetinaNet.
Figure 5Examples of detection results in images captured for different lighting (average of 67.15 and 130.99 for the brightness channel of the HSB color space) and scale conditions (1:4000 and 1:2500): (a) ground truth; (b) Faster-RCNN; (c) YOLOv3; and (d) RetinaNet.
Figure 6Examples of detection results in images with different capture angles (0° and 30°): (a) ground truth; (b) Faster-RCNN; (c) YOLOv3; and (d) RetinaNet.
Computational cost of the proposed approach variants. The time is the average in seconds to execute the deep learning methods on an image.
| Approach Variation | Time (s) |
|---|---|
| Faster-RCNN | 0.163 (±0.066) |
| YOLOv3 | 0.026 (±0.001) |
| RetinaNet | 0.067 (±0.001) |