| Literature DB >> 35027917 |
Dongzi Yang1,2, Fengcheng Wang1,2, Yuqi Hu1,2, Yubin Lan1,2,3,4, Xiaoling Deng1,2,3,4.
Abstract
Citrus Huanglongbing (HLB), also named citrus greening disease, occurs worldwide and is known as a citrus cancer without an effective treatment. The symptoms of HLB are similar to those of nutritional deficiency or other disease. The methods based on single-source information, such as RGB images or hyperspectral data, are not able to achieve great detection performance. In this study, a multi-modal feature fusion network, combining a RGB image network and hyperspectral band extraction network, was proposed to recognize HLB from four categories (HLB, suspected HLB, Zn-deficient, and healthy). Three contributions including a dimension-reduction scheme for hyperspectral data based on a soft attention mechanism, a feature fusion proposal based on a bilinear fusion method, and auxiliary classifiers to extract more useful information are introduced in this manuscript. The multi-modal feature fusion network can effectively classify the above four types of citrus leaves and is better than single-modal classifiers. In experiments, the highest accuracy of multi-modal network recognition was 97.89% when the amount of data was not very abundant (1,325 images of the four aforementioned types and 1,325 pieces of hyperspectral data), while the single-modal network with RGB images only achieved 87.98% recognition and the single-modal network using hyperspectral information only 89%. Results show that the proposed multi-modal network implementing the concept of multi-source information fusion provides a better way to detect citrus HLB and citrus deficiency.Entities:
Keywords: citrus greening disease; convolutional neural network; hyperspectral images; machine learning; multi-modal feature fusion
Year: 2021 PMID: 35027917 PMCID: PMC8751206 DOI: 10.3389/fpls.2021.809506
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1Dataset collection location.
FIGURE 2Four categories leaves.
FIGURE 3Data collection equipment. (A) RGB image capture equipment. (B) Hyperspectral image capture equipment.
FIGURE 4Feature area selection during processing in Envi software.
Four different types of data and amounts of each.
| Species | Number of images | Number of spectral images |
| Healthy | 300 | 300 |
| Zn-deficient | 350 | 350 |
HLB, Citrus Huanglongbing.
FIGURE 5Multi-modal network structure.
Single-network classification and multi-modal network classification accuracy.
| Sample | Model | Accuracy (%) |
| ResNet50 | 85 | |
| RGB image + hyperspectral data | Multi-modal network M1 | 96 |
*M1, ResNet50+hyperspectral feature extraction network; M2, VGG16+hyperspectral feature extraction network; M3, ResNeXt101+hyperspectral feature extraction network.
FIGURE 6Hyperspectral band feature extraction network.
FIGURE 7Change with epoch of loss and accuracy of three feature fusion methods used in present. (A) Work Acc-Epochs. (B) Loss-Epochs.
FIGURE 8Loss calculation method based on auxiliary and mixture loss.
Experimental environment.
| Hardware | Brand | Number |
| CPU | I7–10700 | 1 |
| Main board | Dell Precision 3640 tower | 1 |
Four classification results of multi-modal network M1.
| Type | Precision (%) | Recall (%) | F1 score (%) |
| HLB | 96 | 94 | 95 |
| Health | 98 | 99 | 98 |
FIGURE 9Change with epoch of loss and accuracy of different networks in training process. (A) Loss-Epochs curve. (B) Acc-Epochs curve.
FIGURE 10Confusion matrix of the three models. (A) Multi-modal network. (B) RGB image network. (C) Hyperspectral band network.