| Literature DB >> 36212336 |
Xing Sheng1,2, Fengyun Wang1, Huaijun Ruan1, Yangyang Fan1, Jiye Zheng1, Yangyang Zhang2, Chen Lyu2.
Abstract
Fruit tree diseases are one of the major agricultural disasters in China. With the popularity of smartphones, there is a trend to use mobile devices to identify agricultural pests and diseases. In order to identify leaf diseases of apples more easily and efficiently, this paper proposes a cascade backbone network-based (CBNet) disease identification method to detect leaf diseases of apple trees in the field. The method first replaces traditional convolutional blocks with MobileViT-based convolutional blocks particularly for feature extraction. Compared with the traditional convolutional block, the MobileViT-based convolutional block is able to mine feature information in the image better. In order to refine the mined feature information, a feature refinement module is proposed in this paper. At the same time, this paper proposes a cascaded backbone network for effective fusion of features using a pyramidal cascaded multiplication operation. The results conducted on field datasets collected using mobile devices showed that the network proposed in this paper can achieve 96.76% accuracy and 96.71% F1-score. To the best of our knowledge, this paper is the first to introduce Transformer into apple leaf disease identification, and the results are promising.Entities:
Keywords: Transformer; applet; cascade backbone network; cascade decoder; disease classification
Year: 2022 PMID: 36212336 PMCID: PMC9539913 DOI: 10.3389/fpls.2022.994227
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Figure 1Status and classification of the dataset.
Figure 2Schematic diagram of data enhancement. (A) Changing contrast. (B) Adding Gaussian noise. (C) Local zoom. (D) Mirroring operation. (E) Original image. (F) Random flip. (G) Random crop, (H) Increasing brightness. (I) Decreasing brightness.
Figure 3Architecture diagram of our proposed framework.
Figure 4The proposed Feature Refinement (FR) module.
Figure 5The proposed Global Context-aware Block (GCB) module.
Figure 6Introduction to the applet. From left to right, the main interface (A), the catalog page (B), the image selection page (C), and the recognition results display and prevention suggestions (D).
Figure 7Accuracy of the network vs. loss function line graph.
Comparison of the classification results of our model and other models (bold represents the model with the best results).
|
|
|
|
|
|---|---|---|---|
| CNN (Ashqar and Abu-Naser, | 80.6 | 81.2 | 80.6 |
| MobileNetV1 (Howard et al., | 63.43 | 64.2 | 63.43 |
| MobileNetV2 (Sandler et al., | 85.64 | 85.62 | 85.64 |
| ShuffleNetV1 (Zhang et al., | 91.05 | 90.89 | 91.05 |
| ShuffleNetV2 (Ma N. et al., | 64.93 | 64.88 | 64.93 |
| SENet-16 (Xing et al., | 91.04 | 91.02 | 91.04 |
| VGG-16 (Simonyan and Zisserman, | 96.27 |
| 96.27 |
| NIN-16 (Xing et al., | 85.07 | 85.06 | 85.07 |
| WDenseNet (Xing et al., | 89.55 | 89.24 | 89.55 |
| SSCNN (Barman et al., | 88.06 | 87.98 | 88.06 |
| DCNN (Ma J. et al., | 73.88 | 73.75 | 73.88 |
| CBNet |
| 96.71 |
|
Figure 8Histogram comparing CBNet with other methods.
Comparison of the classification results of our model and other models (bold represents the model with the best results).
|
|
|
|
|---|---|---|
| CNN (Ashqar and Abu-Naser, |
| 18.16 |
| MobileNetV2 (Sandler et al., | 8.5 | 13.55 |
| MobileNetV1 (Howard et al., | 12.25 | 10.6 |
| ShuffleNetV1 (Zhang et al., | 20.79 | 17.43 |
| ShuffleNetV2 (Ma N. et al., | 25.82 |
|
| SENet-16 (Xing et al., | 31.01 | 8.2 |
| SSCNN (Barman et al., | 34.2 | 14.1 |
| DCNN (Ma J. et al., | 34.46 | 33.59 |
| WDenseNet (Xing et al., | 62.7 | 15.08 |
| NIN-16 (Xing et al., | 68.32 | 16.68 |
| VGG-16 (Simonyan and Zisserman, | 512.21 | 61.09 |
| CBNet | 10.78 | 12.18 |
Figure 9Confusion matrix diagram for CBNet.
Comparison of the effectiveness of each module of CBNet (bold represents the model with the best results).
|
|
|
|
|
|---|---|---|---|
| Baseline | 95.1 | 95.3 | 95.1 |
| +Transformer | 95.6 | 95.6 | 95.6 |
| +FR | 95.9 | 96.1 | 95.9 |
| +GCB | 96.3 | 96.2 | 96.3 |
| +Cascaded |
|
|
|
Figure 10Line graph of the validity analysis of the proposed model modules on the three quantitative indicators.