| Literature DB >> 35432415 |
Geovanni Figueroa-Mata1, Erick Mata-Montero2, Juan Carlos Valverde-Otárola3,4, Dagoberto Arias-Aguilar3, Nelson Zamora-Villalobos3.
Abstract
Tree species identification is critical to support their conservation, sustainable management and, particularly, the fight against illegal logging. Therefore, it is very important to develop fast and accurate identification systems even for non-experts. In this research we have achieved three main results. First, we developed-from scratch and using new sample collecting and processing protocols-an dataset called CRTreeCuts that comprises macroscopic cross-section images of 147 native tree species from Costa Rica. Secondly, we implemented a CNN for automated tree species identification based on macroscopic images of cross-sections of wood. For this CNN we apply the fine-tuning technique with VGG16 as a base model, pre-trained with the ImageNet data set. This model is trained and tested with a subset of 75 species from CRTreeCuts. The top-1 and top-3 accuracies achieved in the testing phase are 70.5% and 80.3%, respectively. The Same-Specimen-Picture Bias (SSPB), which is known to erroneously increase accuracy, is absent in all experiments. Finally, the third result is Cocobolo, an Android mobile application that uses the developed CNN as back-end to identify Costa Rican tree species from images of cross-sections of wood.Entities:
Keywords: automated image-based tree species identification; convolutional neural network; costa rican tree species; deep learning; plant classification; xylotheques
Year: 2022 PMID: 35432415 PMCID: PMC9011719 DOI: 10.3389/fpls.2022.789227
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Cutting planes (taken from Hoadley, 2000).
Figure 2Location of forest reserves selected for sample collections.
Figure 3(A–H) Extraction process of a sample of wood.
Figure 4(A–D) Sample processing.
Figure 5Some of the images in the database.
Figure 6Dividing the original image into sub-images (taken from Figueroa-Mata et al., 2018b).
Number of species and specimens in dataset.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| # specimens per species | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 12 | 14 | 15 | 19 |
Figure 7Training accuracy and loss for CNN model.
Species never correctly identified by the CNN.
|
|
|
|---|---|
|
| 5 |
|
| 5 |
|
| 5 |
|
| 9 |
|
| 5 |
|
| 5 |
Figure 8Images of species never correctly identified and the top-1 predicted species.
Figure 9Training accuracy and loss for CNN model with SSPB bias.
Figure 10Training accuracy and loss for CNN model with FSD-M dataset and (possibly) SSPB bias.
Figure 11(A–C) Cocobolo mobil application.