| Literature DB >> 35634050 |
Junying Gan1, Bicheng Wu1, Qi Zou1, Zexin Zheng1, Chaoyun Mai1, Yikui Zhai1, Guohui He1, Zhenfeng Bai1.
Abstract
Datasets usually suffer from supervised information missing and weak generalization ability in deep convolution neural network. In this paper, pseudolabel (PL) of Weakly Supervised Learning (WSL) was used to address the problem of supervised information missing, while Cross Network (CN) of Multitask Learning (MTL) was used to solve the problem of weak generalization ability in deep convolution neural network. In PL, the data of supervised information missing was predicted; thus, PL of the corresponding data was generated. In CN, PL data and labeled data were taken as two tasks to train together. Firstly, the labeled data was divided into training dataset and testing dataset, respectively, and image preprocessing was carried out. Secondly, the network was initialized and trained, and the model with high accuracy and good generalization was selected as the optimal model. Then, the optimal model was used to predict the unlabeled data and generate PL. Finally, the steps above were repeated several times to find a better optimal model. In the experiments of the fusion model of PL and CN, Facial Beauty Prediction was regarded as main task and the others as auxiliary tasks. Experimental results show that the model was suitable for multitask training of different tasks in different or similar datasets, and the accuracy of the main task of Facial Beauty Prediction reaches 64.76%, higher than the highest accuracy by conventional methods.Entities:
Mesh:
Year: 2022 PMID: 35634050 PMCID: PMC9135551 DOI: 10.1155/2022/9986611
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Block diagram of fusion model of PL and CN.
Figure 2Block diagram of data preprocessing.
Figure 3Block diagram of PL data generation.
Figure 4Block diagram of data cotraining.
Figure 5Fusion model of PL and CN structure diagram.
Experimental results of PL generation part.
| Hyperparameters | Model | ||||
|---|---|---|---|---|---|
| VGG16 [ | ResNet50 [ | Transfer network | |||
| VGG16 (%) | ResNet50 (%) | ||||
| Resolution | 32 × 32 | 53.06 | 48.34 | 53.56 | 53.16 |
| 64 × 64 | 53.36 | 53.36 | 53.51 | 54.57 | |
| 72 × 72 | 54.02 | 53.71 | 53.71 | 52.46 | |
| 96 × 96 | 53.16 | 54.27 | 55.12 | 54.72 | |
| 100 × 100 | 58.84 | 55.47 | 54.82 | 54.92 | |
|
|
|
|
|
| |
|
| |||||
| Learning rate | 0.001 | 53.51 | 53.97 | 53.16 | 52.06 |
|
|
|
|
|
| |
| 0.01 | 53.12 | 54.02 | 54.07 | 53.66 | |
| 0.03 | 56.12 | 52.51 | 54.47 | 53.82 | |
| 0.1 | 44.28 | 54.07 | 53.56 | 53.31 | |
| 0.3 | 44.18 | 44.28 | 44.78 | 44.43 | |
Better results in terms of recognition rates were highlighted in bold.
Experimental results of cotraining part.
| Hyperparameters | Model | ||||||
|---|---|---|---|---|---|---|---|
| VGG16 [ | ResNet50 [ | Transfer network | Multitask network | ||||
| VGG16 (%) | ResNet50 (%) | VGG16 (%) | ResNet50 (%) | ||||
| Revolution | 32 × 32 | 56.04 | 52.45 | 53.15 | 54.55 | 60.94 | 60.29 |
| 64 × 64 | 55.36 | 53.36 | 55.44 | 54.34 | 61.10 | 60.80 | |
| 72 × 72 | 56.37 | 54.71 | 53.71 | 54.59 | 60.25 | 60.10 | |
| 96 × 96 | 55.22 | 54.20 | 55.12 | 55.42 | 60.94 | 60.60 | |
|
|
|
|
|
| 60.10 | 57.43 | |
|
| 59.66 | 56.65 | 55.87 | 54.86 |
|
| |
|
| |||||||
| Learning rate |
| 56.74 | 55.56 | 55.11 | 54.27 |
|
|
|
|
|
|
|
| 60.10 | 57.43 | |
| 0.01 | 56.07 | 55.05 | 56.37 | 55.74 | 55.82 | 59.74 | |
| 0.03 | 57.42 | 54.49 | 56.65 | 56.57 | 54.02 | 50.64 | |
| 0.1 | 46.18 | 55.32 | 55.44 | 55.37 | 48.36 | 48.44 | |
| 0.3 | 48.23 | 46.53 | 47.61 | 45.46 | 47.98 | 45.55 | |
Better results in terms of recognition rates were highlighted in bold.
Experimental results of multitask and single task.
| ( | Multitask | Multitask | Single task | ||
|---|---|---|---|---|---|
| Facial beauty prediction (%) | Gender recognition (%) | Facial beauty prediction (%) | Expression recognition (%) | Facial beauty prediction (%) | |
| (0, 1, 1, 0) | 60.94 | 97.64 | 63.35 | 23.99 |
|
| (0.1, 0.9, 0.9, 0.1) | 62.85 | 96.91 | 62.55 | 45.91 | |
| (0.2, 0.8, 0.8, 0.2) |
| 95.64 | 62.45 | 49.85 | |
| (0.3, 0.7, 0.7, 0.3) | 62.65 | 97.82 | 62.95 | 38.95 | |
| (0.4, 0.6, 0.6, 0.4) | — | 55.82 | 61.75 | 44.85 | |
| (0.5, 0.5, 0.5, 0.5) | 61.65 | 95.45 | 62.75 | 32.80 | |
| (0.6, 0.4, 0.4, 0.6) | 62.35 | 94.73 | 62.65 | 45.67 | |
| (0.7, 0.3, 0.3, 0.7) | 61.55 | 95.82 | 63.45 | 46.21 | |
| (0.8, 0.2, 0.2, 0.8) | 62.85 | 96.55 | 61.35 | 48.91 | |
| (0.9, 0.1, 0.1, 0.9) | 63.45 | 96.55 | 62.35 | 47.94 | |
| (1, 0, 0, 1) | 59.35 | 97.45 | 63.05 | 23.99 | |
Better results in terms of recognition rates were highlighted in bold.
Experimental results of regular hyperparameters α values.
| ( | VGG16 of CN [ | ResNet50 of CN [ | Single-task | ||
|---|---|---|---|---|---|
| Facial Beauty dataset (%) | FaceShape dataset (%) | Facial Beauty dataset (%) | FaceShape dataset (%) | Facial Beauty Prediction (%) | |
| (0, 1, 1, 0) | — | 56.65 | — | 57.40 |
|
| ( | 62.35 | 58.91 |
| 56.35 | |
| (0.2, 0.8, 0.8, 0.2) | 62.25 | 57.65 | 61.70 | 57.20 | |
| (0.3, 0.7, 0.7, 0.3) | 62.25 | 56.85 | 62.60 | 57.45 | |
| (0.4, 0.6, 0.6, 0.4) | 62.10 | 56.80 | 62.40 | 57.40 | |
| (0.5, 0.5, 0.5, 0.5) | 61.80 | 57.05 | 61.55 | 56.80 | |
| (0.6, 0.4, 0.4, 0.6) | 62.30 | 57.35 | 62.50 | 57.25 | |
| (0.7, 0.3, 0.3, 0.7) | 62.40 | 57.60 | 62.05 | 57.90 | |
| (0.8, 0.2, 0.2, 0.8) | 62.30 | 57.45 | 62.90 | 58.71 | |
| (0.9, 0.1, 0.1, 0.9) | 62.80 | 57.10 | 62.70 | 58.20 | |
| (1, 0, 0, 1) | 61.19 | 57.20 | — | 57.95 | |
Better results in terms of recognition rates were highlighted in bold.
Experimental results of irregular hyperparameters α values.
| ( | Facial Beauty Prediction (%) | ( | Facial Beauty Prediction (%) | ( | Facial Beauty Prediction (%) |
|---|---|---|---|---|---|
| (0.1, 0.1, 0.1, 0.1) | 62.55 | (0.4, 0.1, 0.1, 0.4) | 62.55 | (0.7, 0.1, 0.1, 0.7) | 63.21 |
| (0.1, 0.2, 0.2, 0.1) | 62.45 | ( |
| (0.7, 0.2, 0.2, 0.7) | 62.70 |
| (0.1, 0.3, 0.3, 0.1) | 62.25 | ( |
| (0.7, 0.3, 0.3, 0.7) | 63.16 |
| (0.1, 0.4, 0.4, 0.1) | 63.10 | ( |
| (0.7, 0.4, 0.4, 0.7) | 62.60 |
| (0.1, 0.5, 0.5, 0.1) | 64.66 | ( |
| (0.7, 0.5, 0.5, 0.7) | 62.65 |
| (0.1, 0.6, 0.6, 0.1) | 62.95 | (0.4, 0.6, 0.6, 0.4) | 62.35 | (0.7, 0.6, 0.6, 0.7) | 62.00 |
| (0.1, 0.7, 0.7, 0.1) | 62.55 | (0.4, 0.7, 0.7, 0.4) | 62.60 | (0.7, 0.7, 0.7, 0.7) | 60.29 |
| (0.1, 0.8, 0.8, 0.1) | 62.60 | (0.4, 0.8, 0.8, 0.4) | — | (0.7, 0.8, 0.8, 0.7) | — |
| (0.1, 0.9, 0.9, 0.1) | 63.15 | (0.4, 0.9, 0.9, 0.4) | 62.10 | (0.7, 0.9, 0.9, 0.7) | — |
| (0.2, 0.1, 0.1, 0.2) | 62.15 | (0.5, 0.1, 0.1, 0.5) | 63.15 | (0.8, 0.1, 0.1, 0.8) | 62.50 |
| (0.2, 0.2, 0.2, 0.2) | 62.55 | (0.5, 0.2, 0.2, 0.5) | 63.56 | (0.8, 0.2, 0.2, 0.8) | 62.55 |
| (0.2, 0.3, 0.3, 0.2) | 62.42 | (0.5, 0.3, 0.3, 0.5) | 63.00 | (0.8, 0.3, 0.3, 0.8) | 63.20 |
| (0.2, 0.4, 0.4, 0.2) | 62.85 | (0.5, 0.4, 0.4, 0.5) | 61.50 | (0.8, 0.4, 0.4, 0.8) | 63.30 |
| (0.2, 0.5, 0.5, 0.2) | 62.70 | (0.5, 0.5, 0.5, 0.5) | 62.20 | (0.8, 0.5, 0.5, 0.8) | — |
| (0.2, 0.6, 0.6, 0.2) | 62.28 | (0.5, 0.6, 0.6, 0.5) | 63.15 | (0.8, 0.6, 0.6, 0.8) | 60.74 |
| (0.2, 0.7, 0.7, 0.2) | 61.10 | (0.5, 0.7, 0.7, 0.5) | 62.80 | (0.8, 0.7, 0.7, 0.8) | 63.38 |
| (0.2, 0.8, 0.8, 0.2) | 62.55 | (0.5, 0.8, 0.8, 0.5) | 62.85 | (0.8, 0.8, 0.8, 0.8) | 62.35 |
| (0.2, 0.9, 0.9, 0.2) | 62.70 | (0.5, 0.9, 0.9, 0.5) | 61.24 | (0.8, 0.9, 0.9, 0.8) | 62.80 |
| (0.3, 0.1, 0.1, 0.3) | 61.55 | (0.6, 0.1, 0.1, 0.6) | 63.05 | (0.9, 0.1, 0.1, 0.9) | 63.10 |
| (0.3, 0.2, 0.2, 0.3) | 62.80 | (0.6, 0.2, 0.2, 0.6) | 62.75 | (0.9, 0.2, 0.2, 0.9) | 63.00 |
| (0.3, 0.3, 0.3, 0.3) | 62.40 | (0.6, 0.3, 0.3, 0.6) | 63.26 | (0.9, 0.3, 0.3, 0.9) | 62.40 |
| (0.3, 0.4, 0.4, 0.3) | 62.95 | (0.6, 0.4, 0.4, 0.6) | 62.60 | (0.9, 0.4, 0.4, 0.9) | 62.20 |
| (0.3, 0.5, 0.5, 0.3) | 63.76 | (0.6, 0.5, 0.5, 0.6) | 63.20 | (0.9, 0.5, 0.5, 0.9) | 63.71 |
| (0.3, 0.6, 0.6, 0.3) | 63.56 | (0.6, 0.6, 0.6, 0.6) | 62.68 | (0.9, 0.6, 0.6, 0.9) | — |
| (0.3, 0.7, 0.7, 0.3) | 62.32 | (0.6, 0.7, 0.7, 0.6) | 61.60 | (0.9, 0.7, 0.7, 0.9) | — |
| (0.3, 0.8, 0.8, 0.3) | 62.45 | (0.6, 0.8, 0.8, 0.6) | — | (0.9, 0.8, 0.8, 0.9) | 62.39 |
| (0.3, 0.9, 0.9, 0.3) | 62.30 | (0.6, 0.9, 0.9, 0.6) | — | (0.9, 0.9, 0.9, 0.9) | — |
Better results in terms of recognition rates were highlighted in bold.