| Literature DB >> 28542234 |
Wenzhang Zhuge1, Chenping Hou1, Yuanyuan Jiao2, Jia Yue3, Hong Tao1, Dongyun Yi1.
Abstract
In many computer vision and machine learning applications, the data sets distribute on certain low-dimensional subspaces. Subspace clustering is a powerful technology to find the underlying subspaces and cluster data points correctly. However, traditional subspace clustering methods can only be applied on data from one source, and how to extend these methods and enable the extensions to combine information from various data sources has become a hot area of research. Previous multi-view subspace methods aim to learn multiple subspace representation matrices simultaneously and these learning task for different views are treated equally. After obtaining representation matrices, they stack up the learned representation matrices as the common underlying subspace structure. However, for many problems, the importance of sources and the importance of features in one source both can be varied, which makes the previous approaches ineffective. In this paper, we propose a novel method called Robust Auto-weighted Multi-view Subspace Clustering (RAMSC). In our method, the weight for both the sources and features can be learned automatically via utilizing a novel trick and introducing a sparse norm. More importantly, the objective of our method is a common representation matrix which directly reflects the common underlying subspace structure. A new efficient algorithm is derived to solve the formulated objective with rigorous theoretical proof on its convergency. Extensive experimental results on five benchmark multi-view datasets well demonstrate that the proposed method consistently outperforms the state-of-the-art methods.Entities:
Mesh:
Year: 2017 PMID: 28542234 PMCID: PMC5441581 DOI: 10.1371/journal.pone.0176769
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Details of the multiview datasets used in our experiments (view type (dimensionality)).
| View type | MSRC-v1 | Caltech101-7 | Digit | NBA-NASCAR | WebKB |
|---|---|---|---|---|---|
| 1 | LBP (256) | LBP (256) | FOU (76) | Gray (1024) | Fulltext (2949) |
| 2 | HOG (100) | PHOG (680) | FAC (216) | TFIDF (296) | Inlinks (334) |
| 3 | GIST (512) | GIST (512) | KAR (64) | - | - |
| 4 | CENTRIST (1302) | Gabor (32) | PIX (240) | - | - |
| 5 | CMT (48) | SURF (200) | ZER (47) | - | - |
| 6 | SIFT (200) | SIFT (200) | MOR (6) | - | - |
| Date points | 210 | 441 | 2000 | 840 | 1051 |
| Classes | 7 | 7 | 10 | 2 | 2 |
Clustering results of different methods on MSRC-v1 data set. (mean(± std)).
(On the following five result tables, two best results of each metrics are bold).
| Method | ACC | NMI | Purity |
|---|---|---|---|
| SC(1) | 0.6022(±0.0510) | 0.4887(±0.0283) | 0.6336(±0.0382) |
| SC(2) | 0.5755(±0.0230) | 0.4936(±0.0262) | 0.5965(±0.0245) |
| SC(3) | 0.6547(±0.0369) | 0.5865(±0.0284) | 0.6877(±0.0377) |
| SC(4) | 0.7002(±0.0567) | 0.6064(±0.0330) | 0.7096(±0.0430) |
| SC(5) | 0.2873(±0.0124) | 0.1448(±0.0146) | 0.3174(±0.0122) |
| SC(6) | 0.5489(±0.0371) | 0.4538(±0.0311) | 0.5750(±0.0280) |
| SMR(1) | 0.6166(±0.0351) | 0.4853(±0.0307) | 0.6388(±0.0356) |
| SMR(2) | 0.6184(±0.0347) | 0.4958(±0.0242) | 0.6283(±0.0296) |
| SMR(3) | 0.7074(±0.0352) | 0.6444(±0.0248) | 0.7490(±0.0246) |
| SMR(4) | 0.7596(±0.0742) | 0.7106(±0.0373) | 0.7733(±0.0556) |
| SMR(5) | 0.4932(±0.0384) | 0.3765(±0.0291) | 0.5101(±0.0330) |
| SMR(6) | 0.5538(±0.0279) | 0.4605(±0.0259) | 0.5752(±0.0264) |
| SC-CON | 0.5983(±0.0337) | 0.4796(±0.0223) | 0.6192(±0.0287) |
| SMR-CON | 0.7338(±0.0495) | 0.6920(±0.0232) | 0.7661(±0.0326) |
| RMKMC | 0.6501(±0.0782) | 0.5700(±0.0537) | 0.6728(±0.0651) |
| PC-SPC | 0.7936(±0.0589) | 0.6965(±0.0278) | 0.8029(±0.0433) |
| CC-SPC | 0.8368(±0.0605) | 0.7799(±0.0306) | 0.8546(±0.0393) |
| MVSC | 0.7444(±0.0754) | 0.7076(±0.0507) | 0.7626(±0.0608) |
| DiMSC | 0.7759(±0.0462) | 0.6788(±0.0361) | 0.7825(±0.0382) |
| RAMSC | |||
| RAMSC( |
Clustering results of different methods on Caltech101-7 data set. (mean(± std)).
| Method | ACC | NMI | Purity |
|---|---|---|---|
| SC(1) | 0.4208(±0.0228) | 0.3188(±0.0253) | 0.5233(±0.0242) |
| SC(2) | 0.4699(±0.0288) | 0.3839(±0.0385) | 0.5653(±0.0299) |
| SC(3) | 0.6116(±0.0515) | 0.5248(±0.0490) | 0.6644(±0.0441) |
| SC(4) | 0.5275(±0.0342) | 0.4348(±0.0379) | 0.5746(±0.0276) |
| SC(5) | 0.6265(±0.0371) | 0.5787(±0.0211) | 0.7031(±0.0274) |
| SC(6) | 0.5208(±0.0225) | 0.4453(±0.0271) | 0.5965(±0.0218) |
| SMR(1) | 0.3288(±0.0197) | 0.1602(±0.0127) | 0.3936(±0.0097) |
| SMR(2) | 0.4946(±0.0303) | 0.4645(±0.0261) | 0.6009(±0.0303) |
| SMR(3) | 0.6874(±0.0336) | 0.6331(±0.0271) | 0.7639(±0.0336) |
| SMR(4) | 0.5498(±0.0218) | 0.4558(±0.0244) | 0.5837(±0.0221) |
| SMR(5) | 0.5212(±0.0587) | 0.4631(±0.0355) | 0.6095(±0.0357) |
| SMR(6) | 0.5191(±0.0229) | 0.4090(±0.0279) | 0.6089(±0.0274) |
| SC-CON | 0.4303(±0.0258) | 0.3230(±0.0284) | 0.5317(±0.0246) |
| SMR-CON | 0.6247(±0.0246) | 0.6049(±0.0288) | 0.7095(±0.0343) |
| RMKMC | 0.6034(±0.0680) | 0.5488(±0.0482) | 0.6846(±0.0541) |
| PC-SPC | 0.6975(±0.0499) | 0.6547(±0.0262) | 0.7581(±0.0288) |
| CC-SPC | 0.7047(±0.0654) | 0.6879(±0.0378) | 0.7972(±0.0389) |
| MVSC | 0.6034(±0.0309) | 0.4766(±0.0373) | 0.6559(±0.0314) |
| DiMSC | 0.7312(±0.0244) | 0.6458(±0.0179) | 0.7698(±0.0268) |
| RAMSC | |||
| RAMSC( |
Clustering results on dight data set. (mean(± std)).
| Method | ACC | NMI | Purity |
|---|---|---|---|
| SC(1) | 0.5714(±0.0484) | 0.6061(±0.0256) | 0.6307(±0.0347) |
| SC(2) | 0.8329(±0.0683) | 0.8181(±0.0337) | 0.8493(±0.0549) |
| SC(3) | 0.6651(±0.0438) | 0.7804(±0.0249) | 0.7457(±0.0355) |
| SC(4) | 0.8144(±0.0705) | 0.8374(±0.0299) | 0.8476(±0.0506) |
| SC(5) | 0.5758(±0.0389) | 0.6273(±0.0217) | 0.6139(±0.0356) |
| SC(6) | 0.5978(±0.0389) | 0.6159(±0.0217) | 0.6179(±0.0356) |
| SMR(1) | 0.6210(±0.0234) | 0.5937(±0.0155) | 0.6374(±0.0167) |
| SMR(2) | 0.8555(±0.0632) | 0.8119(±0.0308) | 0.8666(±0.0504) |
| SMR(3) | 0.7890(±0.0576) | 0.7625(±0.0272) | 0.8007(±0.0476) |
| SMR(4) | 0.8697(±0.0722) | 0.8375(±0.0302) | 0.8852(±0.0522) |
| SMR(5) | 0.3505(±0.0116) | 0.2850(±0.0092) | 0.3573(±0.0071) |
| SMR(6) | 0.4628(±0.0145) | 0.4533(±0.0141) | 0.4889(±0.0138) |
| SC-CON | 0.8729(±0.0698) | 0.8756(±0.0326) | 0.8847(±0.0598) |
| SMR-CON | 0.8496(±0.0725) | 0.8344(±0.0288) | 0.8720(±0.0501) |
| RMKMC | 0.7853(±0.0800) | 0.8125(±0.0384) | 0.8190(±0.0614) |
| PC-SPC | 0.8682(±0.0604) | 0.8267(±0.0303) | 0.8759(±0.0500) |
| CC-SPC | 0.8768(±0.0605) | 0.8234(±0.0338) | 0.8855(±0.0471) |
| MVSC | 0.8242(±0.0686) | 0.8399(±0.0355) | 0.8286(±0.0664) |
| DiMSC | 0.8400(±0.0569) | 0.8076(±0.0347) | 0.8465(±0.0518) |
| RAMSC | |||
| RAMSC( |
Clustering results on NBA-NASCAR data set. (mean(± std)).
| Method | ACC | NMI | Purity |
|---|---|---|---|
| SC(1) | 0.5631(±0.0000) | 0.0139(±0.0000) | 0.5631(±0.0000) |
| SC(2) | 0.5036(±0.0000) | 0.0003(±0.0000) | 0.5036(±0.0000) |
| SMR(1) | 0.6440(±0.0000) | 0.0640(±0.0000) | 0.6440(±0.0000) |
| SMR(2) | 0.5750(±0.0000) | 0.0289(±0.0000) | 0.5750(±0.0000) |
| SC-CON | 0.5631(±0.0000) | 0.0132(±0.0000) | 0.5631(±0.0000) |
| SMR-CON | 0.7952(±0.0000) | 0.2712(±0.0000) | 0.7952(±0.0000) |
| RMKMC | 0.9858(±0.0000) | 0.9005(±0.0000) | 0.9858(±0.0000) |
| PC-SPC | 0.7250(±0.0000) | 0.1521(±0.0000) | 0.7250(±0.0000) |
| CC-SPC | 0.8357(±0.0000) | 0.3555(±0.0000) | 0.8357(±0.0000) |
| MVSC | 0.5131(±0.0000) | 0.0060(±0.0000) | 0.5131(±0.0000) |
| DiMSC | 0.5476(±0.0000) | 0.0071(±0.0000) | 0.5476(±0.0000) |
| RAMSC | |||
| RAMSC( |
Clustering results on WebKB data set. (mean(± std)).
| Method | ACC | NMI | Purity |
|---|---|---|---|
| SC(1) | 0.7774(±0.0000) | 0.0013(±0.0000) | 0.7812(±0.0000) |
| SC(2) | 0.7755(±0.0000) | 0.0027(±0.0000) | 0.7812(±0.0000) |
| SMR(1) | 0.7402(±0.0000) | 0.0573(±0.0000) | 0.7812(±0.0000) |
| SMR(2) | 0.8069(±0.0000) | 0.0665(±0.0000) | 0.8069(±0.0000) |
| SC-CON | 0.7774(±0.0000) | 0.0013(±0.0000) | 0.7812(±0.0000) |
| SMR-CON | 0.7774(±0.0000) | 0.0013(±0.0000) | 0.7812(±0.0000) |
| RMKMC | 0.8049(±0.0000) | 0.1592(±0.0000) | 0.8159(±0.0000) |
| PC-SPC | 0.7659(±0.0000) | 0.0991(±0.0000) | 0.7812(±0.0000) |
| CC-SPC | 0.5785(±0.0000) | 0.0019(±0.0000) | 0.7812(±0.0000) |
| MVSC | 0.7802(±0.0000) | 0.0041(±0.0000) | 0.7812(±0.0000) |
| DiMSC | 0.6147(±0.0000) | 0.0006(±0.0000) | 0.7812(±0.0000) |
| RAMSC | |||
| RAMSC( |
Fig 1Convergence behaviors of RAMSC with λ = 50 on two datasets.
(A) MSRC-v1; (B) Caltech101-7.
Fig 2ACC and NMI of RAMSC with different selection of parameter λ.
(A) NBA-NASCAR; (B) WebKB.
Fig 3ACC and NMI of RAMSC(S2) with different combinations of parameters λ and γ.
(A) NBA-NASCAR; (B) WebKB.