| Literature DB >> 35494795 |
Tingyi Zheng1,2, Yilin Zhang3, Yuhang Wang4.
Abstract
Multi-view clustering (MVC) is a mainstream task that aims to divide objects into meaningful groups from different perspectives. The quality of data representation is the key issue in MVC. A comprehensive meaningful data representation should be with the discriminant characteristics in a single view and the correlation of multiple views. Considering this, a novel framework called Dynamic Guided Metric Representation Learning for Multi-View Clustering (DGMRL-MVC) is proposed in this paper, which can cluster multi-view data in a learned latent discriminated embedding space. Specifically, in the framework, the data representation can be enhanced by multi-steps. Firstly, the class separability is enforced with Fisher Discriminant Analysis (FDA) within each single view, while the consistence among different views is enhanced based on Hilbert-Schmidt independence criteria (HSIC). Then, the 1st enhanced representation is obtained. In the second step, a dynamic routing mechanism is introduced, in which the location or direction information is added to fulfil the expression. After that, a generalized canonical correlation analysis (GCCA) model is used to get the final ultimate common discriminated representation. The learned fusion representation can substantially improve multi-view clustering performance. Experiments validated the effectiveness of the proposed method for clustering tasks. ©2022 Zheng et al.Entities:
Keywords: Dynamic routing; Fisher discriminant analysis; Generalized canonical correlation analysis; Guided metric representation learning; Hilbert-Schmidt independence criteria; Multi-view clustering
Year: 2022 PMID: 35494795 PMCID: PMC9044235 DOI: 10.7717/peerj-cs.922
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Figure 1Illustration of the proposed DGMRL-MVC model.
The differences among baselines.
| Method | Type | Dynamic guided representation | FDA-HSIC | Optimization |
|---|---|---|---|---|
| DCCA | Deep, 2-view | × | × |
|
| DGCCA | Deep, multi-view | × | × |
|
| FISH-MML | Deep, multi-view | × | ✓ |
|
| Dynamic guided representation-MVC | Deep, multi-view | ✓ | × |
|
| DGMRL-MVC | Deep, multi-view | ✓ | ✓ |
|
Improvement performance of dynamic guided representation on different views.
The best results are highlighted in bold.
| Dataset | Method | Number of views | Precision | Recall | F1 | RI | NMI |
|---|---|---|---|---|---|---|---|
| Handwritten | FISH-MML | 6 (full views) | 0.09961 | 0.11099 | 0.10493 | 0.11100 | 0.09643 |
| DGMRL-MVC |
|
|
|
|
| ||
| FISH-MML | 2 views | 0.17843 | 0.20600 | 0.18586 | 0.20600 | 0.21906 | |
| DGMRL-MVC |
|
|
|
|
| ||
| FISH-MML | 3 views | 0.09092 | 0.10100 | 0.09569 | 0.10100 | 0.25037 | |
| DGMRL-MVC |
|
|
|
|
| ||
| Wikipedia | FISH-MML | 2 (full views) | 0.08957 | 0.10149 | 0.09313 | 0.10032 | 0.51780 |
| DGMRL-MVC |
|
|
|
|
| ||
| Football | FISH-MML | 9 (full views) | 0.05679 | 0.05885 | 0.01761 | 0.06048 | 0.22956 |
| DGMRL-MVC |
|
|
|
|
| ||
| FISH-MML | 6 views |
| 0.04903 | 0.03745 | 0.05645 |
| |
| DGMRL-MVC | 0.05619 |
|
|
| 0.27343 | ||
| Pascal | FISH-MML | 6 (full views) | 0.07616 | 0.06700 | 0.06606 | 0.06700 | 0.23351 |
| DGMRL-MVC |
|
|
|
|
| ||
| FISH-MML | 2 views | 0.06266 | 0.05600 | 0.05094 | 0.05600 | 0.18011 | |
| DGMRL-MVC |
|
|
|
|
|
Comparison with latest methods with multiple views.
The best results are highlighted in bold.
| Dataset | Method | Number of views | Precision | Recall | F1 | RI | NMI |
|---|---|---|---|---|---|---|---|
| Handwritten | DCCA | 6 (full views) | 0.10111 | 0.10000 | 0.10047 | 0.10000 |
|
| DGCCA | 0.08222 | 0.05400 | 0.06244 | 0.10800 | 0.54150 | ||
| FISH-MML | 0.09961 | 0.11099 | 0.10493 | 0.11100 | 0.09643 | ||
| Dynamic guided representation-MVC | 0.09835 | 0.11650 | 0.10402 | 0.11650 | 0.52245 | ||
| DGMRL-MVC |
|
|
|
| 0.11521 | ||
| Wikipedia | DCCA | 2 (full views) | 0.09245 | 0.08467 | 0.08289 | 0.09090 | 0.50819 |
| DGCCA | 0.10329 | 0.10567 | 0.10267 | 0.10400 | 0.54222 | ||
| FISH-MML | 0.08957 | 0.10149 | 0.09313 | 0.10032 | 0.51780 | ||
| Dynamic guided representation-MVC | 0.16876 | 0.14224 | 0.13550 | 0.17027 | 0.59470 | ||
| DGMRL-MVC |
|
|
|
|
| ||
| Football | DCCA | 9 (full views) | 0.05625 | 0.04909 | 0.05200 | 0.05241 | 0.21037 |
| DGCCA | 0.01404 | 0.01885 | 0.01603 | 0.02016 | 0.21256 | ||
| FISH-MML | 0.05679 | 0.05885 | 0.01761 | 0.06048 | 0.22956 | ||
| Dynamic guided representation-MVC |
| 0.06449 | 0.05430 | 0.06451 | 0.26470 | ||
| DGMRL-MVC | 0.07540 |
|
|
|
| ||
| Pascal | DCCA | 6 (full views) | 0.02123 | 0.04200 | 0.02814 | 0.04200 | 0.03363 |
| DGCCA | 0.04716 | 0.06600 | 0.05351 | 0.06600 | 0.09261 | ||
| FISH-MML | 0.07616 | 0.06700 | 0.06606 | 0.06700 | 0.23351 | ||
| Dynamic guided representation-MVC | 0.06619 | 0.07200 | 0.04782 | 0.07200 | 0.15842 | ||
| DGMRL-MVC |
|
|
|
|
|
Visualization of clustering result with FISH-MML on the Pascal dataset.
The best results are highlighted in bold.
|
|
Visualization of clustering result with DGMRL-MVC (ours) on the Pascal dataset.
The best results are highlighted in bold.
|
|
Figure 2Parameter analysis on n_clusters in terms of Silhouette_score on the Handwritten dataset.
Figure 3Parameter analysis on n_clusters in terms of Silhouette_score on the Wikipedia dataset.
Figure 4Parameter analysis on n_clusters in terms of Silhouette_score on the Football dataset.
Figure 5Parameter analysis on n_clusters in terms of Silhouette_score on the Pascal dataset.
Figure 6Ablation experiments on four multi-view datasets.