| Literature DB >> 33936189 |
Taohong Zhang1,2, Suli Fan1,2, Junnan Hu1,2, Xuxu Guo1,2, Qianqian Li1,2, Ying Zhang3, Aziguli Wulamu1,2.
Abstract
In this paper, a feature fusion method with guiding training (FGT-Net) is constructed to fuse image data and numerical data for some specific recognition tasks which cannot be classified accurately only according to images. The proposed structure is divided into the shared weight network part, the feature fused layer part, and the classification layer part. First, the guided training method is proposed to optimize the training process, the representative images and training images are input into the shared weight network to learn the ability that extracts the image features better, and then the image features and numerical features are fused together in the feature fused layer to input into the classification layer for the classification task. Experiments are carried out to verify the effectiveness of the proposed model. Loss is calculated by the output of both the shared weight network and classification layer. The results of experiments show that the proposed FGT-Net achieves the accuracy of 87.8%, which is 15% higher than the CNN model of ShuffleNetv2 (which can process image data only) and 9.8% higher than the DNN method (which processes structured data only).Entities:
Year: 2021 PMID: 33936189 PMCID: PMC8062192 DOI: 10.1155/2021/6647220
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1The image examples of Pomeranian (a), Japanese Spitz (b), and Samoyed (c).
Acronyms and full names in this paper.
| Acronyms | Full names |
|---|---|
| FGT-Net | Feature fusion network with guided training |
| CNN | Convolutional neural network |
| LRN | Local response normalization |
| DDE | Dynamic differential entropy |
| AHNF | Attribute heterogeneous network fusion |
| DFF-ADML | Deep feature fusion method based on adaptive discriminant metric learning |
| GAN | Generative adversarial network |
| MF-Net | Multifeature fusion neural network |
| SWN | Shared weight network |
Figure 2The framework of the FGT-Net model.
Figure 3The FGT-Net model used in test.
Figure 4Data examples in the dataset. (a) Japanese Spitz data example (class 0), (b) Pomeranian data example (class 1), (c) Samoyed data example (class 2), and (d) Husky data example (class 3).
Figure 5Training and verification accuracy with the number of iterations.
Figure 6Training and verification loss with the number of iterations.
The accuracy and other performance evaluation indexes of Experiment 1 and Experiment 2 of guided training.
| Experiment | Class | TP | FP | FN | Precision | Recall | F1-score | Accuracy |
|---|---|---|---|---|---|---|---|---|
| Experiment 1 | Japanese Spitz | 10 | 9 | 36 | 0.526 | 0.217 | 0.308 | 0.707 |
| Pomeranian | 9 | 4 | 18 | 0.692 | 0.333 | 0.450 | ||
| Samoyed | 69 | 49 | 15 | 0.585 | 0.821 | 0.683 | ||
| Husky | 86 | 10 | 3 | 0.896 | 0.966 | 0.930 | ||
|
| ||||||||
| Experiment 2 | Japanese Spitz | 18 | 9 | 28 | 0.667 | 0.391 | 0.493 | 0.728 |
| Pomeranian | 15 | 8 | 12 | 0.652 | 0.556 | 0.600 | ||
| Samoyed | 63 | 38 | 21 | 0.623 | 0.750 | 0.681 | ||
| Husky | 83 | 12 | 6 | 0.873 | 0.933 | 0.902 | ||
The accuracy and other performance evaluation indexes of models with fused data and separate data.
| Model | Class | TP | FP | FN | Precision | Recall | F1-score | Accuracy |
|---|---|---|---|---|---|---|---|---|
| FGT-Net (fused data) | Japanese Spitz | 40 | 6 | 6 | 0.870 | 0.870 | 0.870 | 0.878 |
| Pomeranian | 27 | 6 | 0 | 0.818 | 1.000 | 0.900 | ||
| Samoyed | 71 | 9 | 13 | 0.888 | 0.845 | 0.866 | ||
| Husky | 78 | 9 | 11 | 0.897 | 0.876 | 0.886 | ||
|
| ||||||||
| CNN (ShuffleNetv2) (only image data) | Japanese Spitz | 18 | 9 | 28 | 0.667 | 0.391 | 0.493 | 0.728 |
| Pomeranian | 15 | 8 | 12 | 0.652 | 0.556 | 0.600 | ||
| Samoyed | 63 | 38 | 21 | 0.623 | 0.750 | 0.681 | ||
| Husky | 83 | 12 | 6 | 0.873 | 0.933 | 0.902 | ||
|
| ||||||||
| DNN (only structured data) | Japanese Spitz | 45 | 0 | 1 | 1.000 | 0.978 | 0.989 | 0.780 |
| Pomeranian | 27 | 1 | 0 | 0.964 | 1.000 | 0.982 | ||
| Samoyed | 46 | 15 | 38 | 0.754 | 0.548 | 0.634 | ||
| Husky | 74 | 38 | 15 | 0.661 | 0.831 | 0.736 | ||
Accuracy comparison with other advanced models.
| Model | FGT-Net | AlexNet | VGG16 | ResNet50 |
|---|---|---|---|---|
| Accuracy |
| 0.728 | 0.691 | 0.675 |
Time comparison with other advanced models.
| Model (s) | FGT-Net | AlexNet | ShuffleNetV2 | VGG16 | ResNet50 |
|---|---|---|---|---|---|
| Time | 10.39 | 10.57 |
| 11.66 | 10.91 |