| Literature DB >> 31871441 |
Danzi Wu1, Xue Han2, Guan Wang2, Yu Sun2,3, Haiyan Zhang2, Hongping Fu2.
Abstract
Plant identification is a fine-grained classification task which aims to identify the family, genus, and species according to plant appearance features. Inspired by the hierarchical structure of taxonomic tree, the taxonomic loss was proposed, which could encode the hierarchical relationships among multilevel labels into the deep learning objective function by simple group and sum operation. By training various neural networks on PlantCLEF 2015 and PlantCLEF 2017 datasets, the experimental results demonstrated that the proposed loss function was easy to implement and outperformed the most commonly adopted cross-entropy loss. Eight neural networks were trained, respectively, by two different loss functions on PlantCLEF 2015 dataset, and the models trained by taxonomic loss led to significant performance improvements. On PlantCLEF 2017 dataset with 10,000 species, the SENet-154 model trained by taxonomic loss achieved the accuracies of 84.07%, 79.97%, and 73.61% at family, genus and species levels, which improved those of model trained by cross-entropy loss by 2.23%, 1.34%, and 1.08%, respectively. The taxonomic loss could further facilitate the fine-grained classification task with hierarchical labels.Entities:
Mesh:
Year: 2019 PMID: 31871441 PMCID: PMC6907043 DOI: 10.1155/2019/2015017
Source DB: PubMed Journal: Comput Intell Neurosci
Details of PlantCLEF 2015 and PlantCLEF 2017 dataset.
| Dataset | Number of classes | Number of samples | |||
|---|---|---|---|---|---|
| Family | Genus | Species | Train | Test | |
| PlantCLEF 2015 | 124 | 516 | 1,000 | 91,758 | 21,446 |
| PlantCLEF 2017 | 341 | 2,991 | 10,000 | 226,386 | 29,901 |
Figure 1The end-to-end training pipeline of deep learning plant identification. Two distinct loss modules are shown in the lower part: (a) the cross-entropy loss uses only the species-level labels; (b) the proposed taxonomic loss encodes the hierarchy among three-level labels into the objective function.
Figure 2(a) A brief taxonomic tree of Fagaceae family; (b) the derivation process of genus probabilities and family probabilities according to the taxonomic hierarchy by group and sum operation.
Figure 3The effects of image augmentation. (a) The cropped image, the image with (b) horizontal flipping, (c) rotation (degree = 10), (d) translation (ratio = 0.15), (e) scaling (ratio = 0.8), (f) scaling (ratio = 1.2), (g) shear (degree = 10), and (h) multiple random augmentation.
Accuracies of eight state-of-the-art neural networks trained by cross-entropy and taxonomic loss on PlantCLEF 2015 dataset.
| Neural network | Loss function | Accuracy (%) | ||
|---|---|---|---|---|
| Family | Genus | Species | ||
| GoogLeNet [ | CL | 72.62 | 65.97 | 59.69 |
| TAX |
|
|
| |
|
| ||||
| ResNet-50 [ | CL | 77.48 | 71.59 | 65.07 |
| TAX |
|
|
| |
|
| ||||
| Inception-v3 [ | CL | 77.93 | 74.01 | 67.98 |
| TAX | 80.31 |
| 67.42 | |
|
| ||||
| Inception-ResNet-v2 [ | CL | 80.66 |
| 67.93 |
| TAX |
| 76.85 |
| |
|
| ||||
| MobileNet v2 [ | CL | 72.13 | 65.65 | 59.16 |
| TAX |
|
|
| |
|
| ||||
| ShuffleNet v2 [ | CL | 66.39 | 59.32 | 52.88 |
| TAX |
|
|
| |
|
| ||||
| DenSeNet-169 [ | CL | 78.57 | 73.00 | 66.76 |
| TAX |
|
|
| |
|
| ||||
| SENet-154 [ | CL | 81.25 | 76.81 | 70.08 |
| TAX |
|
|
| |
Figure 4Loss curves of Inception-ResNet-v2 trained by cross-entropy and taxonomic loss.
Accuracies of eight state-of-the-art neural networks trained by cross-entropy and taxonomic loss on PlantCLEF 2017 dataset.
| Neural network | Loss function | Accuracy (%) | ||
|---|---|---|---|---|
| Family | Genus | Species | ||
| GoogLeNet [ | CL | 68.73 | 64.22 | 57.86 |
| TAX |
|
|
| |
|
| ||||
| ResNet-50 [ | CL | 76.32 | 72.49 | 66.68 |
| TAX |
|
|
| |
|
| ||||
| Inception-v3 [ | CL | 77.32 | 73.12 | 67.05 |
| TAX |
|
|
| |
|
| ||||
| Inception-ResNet-v2 [ | CL | 79.98 | 75.43 | 68.97 |
| TAX |
|
|
| |
|
| ||||
| MobileNet-v2 [ | CL | 71.76 | 67.73 | 61.78 |
| TAX |
|
|
| |
|
| ||||
| ShuffleNet-v2 [ | CL | 61.94 | 57.12 | 49.96 |
| TAX |
|
|
| |
|
| ||||
| DenSeNet-169 [ | CL | 76.34 | 72.53 | 66.60 |
| TAX |
|
|
| |
|
| ||||
| SENet-154 [ | CL | 81.84 | 78.63 | 72.53 |
| TAX |
|
|
| |
Typical images in PlantCLEF 2017 testing set and predictions made by ResNet-50 trained by two different loss functions: cross-entropy loss (CL) and taxonomic loss (TAX).
| Loss function | Family | Genus | Species | ||
|---|---|---|---|---|---|
| a |
| GT |
|
|
|
| CL | Orchidaceae |
|
| ||
| TAX |
|
|
| ||
|
| |||||
| b |
| GT |
|
|
|
| CL | Fagaceae |
|
| ||
| TAX |
|
|
| ||
|
| |||||
| c |
| GT |
|
|
|
| CL | Rosaceae |
|
| ||
| TAX |
|
|
| ||
|
| |||||
| d |
| GT |
|
|
|
| CL | Leguminosae |
|
| ||
| TAX |
|
|
| ||
Bold values indicate the ground truth (GT) and the correct predictions.
Accuracies of two neural networks trained by taxonomic loss with different taxonomic hierarchy.
| Dataset | Neural network | Taxonomic hierarchy | Accuracy (%) | ||
|---|---|---|---|---|---|
| Family | Genus | Species | |||
| PlantCLEF 2015 | Inception-ResNet-v2 [ | F-G-S | 83.36 | 76.85 | 70.38 |
| F-S | 82.04 | 76.10 | 69.36 | ||
| G-S | 81.48 | 75.82 | 68.94 | ||
| S | 80.66 | 74.57 | 67.93 | ||
|
| |||||
| PlantCLEF 2017 | ShuffleNet-v2 [ | F-G-S | 66.13 | 60.73 | 53.12 |
| F-S | 64.08 | 57.71 | 50.03 | ||
| G-S | 64.26 | 59.11 | 51.64 | ||
| S | 61.94 | 57.12 | 49.96 | ||