| Literature DB >> 34335711 |
Jifeng Guo1, Wenbo Sun1, Zhiqi Pang1, Yuxiao Fei1, Yu Chen1.
Abstract
The current unsupervised domain adaptation person re-identification (re-ID) method aims to solve the domain shift problem and applies prior knowledge learned from labelled data in the source domain to unlabelled data in the target domain for person re-ID. At present, the unsupervised domain adaptation person re-ID method based on pseudolabels has obtained state-of-the-art performance. This method obtains pseudolabels via a clustering algorithm and uses these pseudolabels to optimize a CNN model. Although it achieves optimal performance, the model cannot be further optimized due to the existence of noisy labels in the clustering process. In this paper, we propose a stable median centre clustering (SMCC) for the unsupervised domain adaptation person re-ID method. SMCC adaptively mines credible samples for optimization purposes and reduces the impact of label noise and outliers on training to improve the performance of the resulting model. In particular, we use the intracluster distance confidence measure of the sample and its K-reciprocal nearest neighbour cluster proportion in the clustering process to select credible samples and assign different weights according to the intracluster sample distance confidence of samples to measure the distances between different clusters, thereby making the clustering results more robust. The experiments show that our SMCC method can select credible and stable samples for training and improve performance of the unsupervised domain adaptation model. Our code is available at https://github.com/sunburst792/SMCC-method/tree/master.Entities:
Mesh:
Year: 2021 PMID: 34335711 PMCID: PMC8321743 DOI: 10.1155/2021/2883559
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Iterative adjustment process for credible sample selection. As the number of iterations increases, the number of credible samples gradually increases. The distances between samples belonging to the same cluster in the feature space are smaller, and the distances between samples belonging to different clusters are larger. Finally, the credible samples are used for training to prevent the influence of label noise on the model.
Figure 2Overview of the SMCC method. First, we use SPGAN [51] to transfer the image style of the source domain to images with styles similar to the target domain style while preserving the person identity, and we pretrain the CNN model on these data. After that, HDBSCAN [56] is performed on the feature vectors obtained by feature extraction based on the target domain data of the CNN model. Due to the noise in the labels, we select credible samples according to the IDC and KCP of the samples to optimize the CNN model by the batch hard triple loss [57] and softmax cross-entropy loss.
Algorithm 1Stable median centre clustering.
Ablation studies regarding SMCC on Market-1501. Supervised model: the re-ID model for which training and testing are conducted in the target domain. Direct transfer: the re-ID model pretrained in the source domain is directly transferred to the target domain. Baseline: the re-ID model uses SPGAN to conduct style transfer on the images for pretraining and then performs the HDBSCAN method. IDC : the intracluster distance confidence. KCP : the K-reciprocal nearest neighbour cluster proportion.
| Method | DukeMTMC-reID⟶Market-1501 | |||
|---|---|---|---|---|
| Rank-1 | Rank-5 | Rank-10 | mAP | |
| Supervised model | 92.0 | 97.4 | 98.4 | 80.9 |
| Direct transfer | 48.7 | 69.5 | 76.9 | 25.1 |
| Baseline | 77.8 | 85.7 | 89.1 | 58.3 |
| Baseline + IDC | 83.1 | 92.2 | 95.5 | 62.9 |
| Baseline + KCP | 81.9 | 91.8 | 93.9 | 61.3 |
| Baseline + IDC + KCP | 86.5 | 94.6 | 96.7 | 70.2 |
Ablation studies regarding SMCC on DukeMTMC-reID.
| Method | Market-1501⟶DukeMTMC-reID | |||
|---|---|---|---|---|
| Rank-1 | Rank-5 | Rank-10 | mAP | |
| Supervised model | 82.6 | 92.1 | 94.6 | 70.2 |
| Direct transfer | 29.7 | 44.1 | 50.4 | 16.2 |
| Baseline | 72.6 | 82.0 | 85.3 | 55.4 |
| Baseline + IDC | 76.2 | 84.9 | 87.7 | 59.1 |
| Baseline + KCP | 75.3 | 83.5 | 87.0 | 58.2 |
| Baseline + IDC + KCP | 79.1 | 86.8 | 89.1 | 63.4 |
Figure 3The CMC and accuracy score of the model under different KCP thresholds. (a) Market-1051 dataset. (b) DukeMTMC-reID dataset.
Performance of different clustering algorithms on the Market1501 dataset.
| Clustering method | DukeMTMC-reID⟶Market-1501 | |||
|---|---|---|---|---|
| mAP | Rank-1 | Rank-5 | Rank-10 | |
|
| 63.1 | 81.3 | 88.0 | 90.9 |
| CURE [ | 68.4 | 84.9 | 93.1 | 94.7 |
| BRICH [ | 68.9 | 84.0 | 92.4 | 93.1 |
| DBSCAN [ | 69.5 | 85.6 | 93.3 | 95.6 |
| Ours | 70.2 | 86.5 | 94.6 | 96.7 |
Performance of different clustering algorithms on the DukeMTMC-reID dataset.
| Clustering method | Market-1501⟶DukeMTMC-reID | |||
|---|---|---|---|---|
| mAP | Rank-1 | Rank-5 | Rank-10 | |
|
| 58.4 | 74.1 | 82.3 | 84.9 |
| CURE [ | 61.7 | 77.6 | 84.4 | 87.2 |
| BRICH [ | 61.4 | 77.3 | 84.0 | 86.6 |
| DBSCAN [ | 62.1 | 78.0 | 85.2 | 88.1 |
| Ours | 63.4 | 79.1 | 86.8 | 89.1 |
The comparison of our proposed SMCC method with the state-of-the-art method on Market-1501.
| Method | DukeMTMC-reID⟶Market-1501 | |||
|---|---|---|---|---|
| mAP | Rank-1 | Rank-5 | Rank-10 | |
| PTGAN [ | — | 38.6 | — | 66.1 |
| SPGAN [ | 22.8 | 51.5 | 70.1 | 76.8 |
| HHL [ | 31.4 | 62.2 | 78.8 | 84.0 |
| CR-GAN [ | 54.0 | 77.7 | 89.7 | 92.7 |
| EANet [ | 51.6 | 78.0 | — | — |
| CAMEL [ | 26.3 | 54.5 | — | — |
| PAUL [ | 40.1 | 68.5 | 82.4 | 87.4 |
| UDAP [ | 53.7 | 75.8 | 89.5 | 93.2 |
| PAST [ | 54.6 | 78.4 | — | — |
| SSG [ | 58.3 | 80.0 | 90.0 | 92.4 |
| AD-cluster [ | 68.3 | 86.7 | 94.4 | 96.5 |
| SSG++ [ | 68.7 | 86.2 | 94.6 | 96.5 |
| Ours | 70.2 | 86.5 | 94.6 | 96.7 |
The comparison of our proposed SMCC method with the state-of-the-art method on DukeMTMC-reID.
| Method | Market-1501⟶DukeMTMC-reID | |||
|---|---|---|---|---|
| mAP | Rank-1 | Rank-5 | Rank-10 | |
| PTGAN [ | — | 27.4 | — | 50.7 |
| SPGAN [ | 22.3 | 41.1 | 56.6 | 63.0 |
| HHL [ | 27.2 | 46.9 | 61.0 | 66.7 |
| CR-GAN [ | 48.6 | 68.9 | 80.2 | 84.7 |
| EANet [ | 48.0 | 78.0 | — | — |
| CAMEL [ | — | — | — | — |
| PAUL [ | 53.2 | 72.0 | 82.7 | 86.0 |
| UDAP [ | 49.0 | 68.4 | 80.1 | 83.5 |
| PAST [ | 54.3 | 72.4 | — | — |
| SSG [ | 53.4 | 73.0 | 80.6 | 83.2 |
| AD-cluster [ | 54.1 | 72.6 | 82.5 | 85.5 |
| SSG++ [ | 60.3 | 76.0 | 85.8 | 89.3 |
| Ours | 63.4 | 79.1 | 86.8 | 89.1 |