| Literature DB >> 25057510 |
Jing Chen1, Yuan Yan Tang1, C L Philip Chen2, Bin Fang3, Zhaowei Shang3, Yuewei Lin4.
Abstract
Adopting a measure is essential in many multimedia applications. Recently, distance learning is becoming an active research problem. In fact, the distance is the natural measure for dissimilarity. Generally, a pairwise relationship between two objects in learning tasks includes two aspects: similarity and dissimilarity. The similarity measure provides different information for pairwise relationships. However, similarity learning has been paid less attention in learning problems. In this work, firstly, we propose a general framework for similarity measure learning (SML). Additionally, we define a generalized type of correlation as a similarity measure. By a set of parameters, generalized correlation provides flexibility for learning tasks. Based on this similarity measure, we present a specific algorithm under the SML framework, called correlation similarity measure learning (CSML), to learn a parameterized similarity measure over input space. A nonlinear extension version of CSML, kernel CSML, is also proposed. Particularly, we give a closed-form solution avoiding iterative search for a local optimal solution in the high-dimensional space as the previous work did. Finally, classification experiments have been performed on face databases and a handwritten digits database to demonstrate the efficiency and reliability of CSML and KCSML.Entities:
Mesh:
Year: 2014 PMID: 25057510 PMCID: PMC4099089 DOI: 10.1155/2014/747105
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Algorithm 1The details of algorithm CSML.
Computational costs of related algorithms.
| Algorithm | Computational cost |
|---|---|
| The method of Xing et al. (Xing's) |
|
| The method of Xiang et al. (Xiang's) |
|
| Correlation Discriminant Analysis (CDA) |
|
| Correlation Embedding Analysis (CEA) |
|
| Correlation Similarity Measure Learning (CSML) |
|
| Kernel CSML |
|
Figure 1Samples from the Yale database.
Figure 2Samples from the CMU PIE database.
Figure 3Samples from the MNIST database.
Classification performance comparison on the Yale database.
| Method | 2 Train | 3 Train | 4 Train | 5 Train | 6 Train | 7 Train | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Error (%) |
| Error (%) |
| Error (%) |
| Error (%) |
| Error (%) |
| Error (%) |
| |
| PCA | 56.6 ± 6.3 | 29 | 50.6 ± 8.3 | 44 | 47.4 ± 7.2 | 58 | 43.8 ± 6.5 | 74 | 40.8 ± 7.4 | 32 | 39.5 ± 5.1 | 30 |
| RS-2DPCA | 44.2 ± 6.0 | 17 | 32.5 ± 5.4 | 17 | 27.6 ± 5.9 | 17 | 22.4 ± 5.2 | 17 | 17.5 ± 6.3 | 17 | 16.1 ± 5.2 | 17 |
| LDA | 52.8 ± 7.5 | 10 | 35.1 ± 5.9 | 14 | 27.1 ± 5.3 | 14 | 21.2 ± 5.7 | 14 | 18.7 ± 4.9 | 14 | 17.6 ± 4.6 | 14 |
| LPP | 42.6 ± 6.8 | 14 | 31.2 ± 7.0 | 14 | 27.3 ± 6.2 | 19 | 21.1 ± 4.8 | 23 | 17.8 ± 5.8 | 24 | 16.3 ± 5.4 | 21 |
| MFA | 41.7 ± 7.4 | 18 | 33.6 ± 6.5 | 23 | 28.4 ± 5.9 | 27 | 21.5 ± 5.2 | 20 | 16.1 ± 5.3 | 19 | 15.2 ± 4.0 | 25 |
| CDA | 43.2 ± 5.9 | 19 | 32.9 ± 5.8 | 22 | 26.8 ± 6.7 | 23 | 20.3 ± 5.4 | 18 | 16.9 ± 6.8 | 26 | 16.0 ± 6.2 | 19 |
| CEA | 42.0 ± 6.1 | 21 | 30.7 ± 4.3 | 25 | 25.2 ± 4.9 | 18 | 19.2 ± 5.1 | 19 | 15.3 ± 5.4 | 20 | 14.1 ± 4.8 | 18 |
| ISM-GE | 43.1 ± 6.5 | 19 | 29.2 ± 5.9 | 20 | 23.6 ± 6.8 | 19 | 17.5 ± 6.3 | 20 | 14.7 ± 5.6 | 22 | 12.5 ± 5.1 | 19 |
| MSE | 42.4 ± 7.2 | 23 | 32.3 ± 6.7 | 24 | 28.2 ± 5.2 | 22 | 19.6 ± 5.9 | 21 | 16.2 ± 5.1 | 24 | 15.3 ± 5.7 | 22 |
| CSML |
| 14 |
| 19 |
| 13 |
| 20 |
| 19 |
| 15 |
| KCSML |
| 18 |
| 24 |
| 21 |
| 26 |
| 21 |
| 27 |
Classification performance comparison on the CMU PIE database.
| Method | 5 Train | 10 Train | 20 Train | 30 Train | ||||
|---|---|---|---|---|---|---|---|---|
| Error (%) |
| Error (%) |
| Error (%) |
| Error (%) |
| |
| PCA | 76.6 ± 4.3 | 334 | 64.8 ± 4.6 | 673 | 48.6 ± 3.8 | 982 | 37.9 ± 3.5 | 1023 |
| RS-2DPCA | 44.5 ± 4.1 | 18 | 28.3 ± 3.5 | 18 | 20.1 ± 2.6 | 18 | 9.6 ± 2.2 | 18 |
| LDA | 42.0 ± 3.6 | 67 | 29.7 ± 3.7 | 67 | 21.5 ± 2.9 | 67 | 10.9 ± 3.2 | 67 |
| LPP | 38.0 ± 4.8 | 67 | 29.6 ± 3.5 | 139 | 20.2 ± 3.3 | 147 | 10.8 ± 2.7 | 86 |
| MFA | 36.8 ± 4.4 | 72 | 28.2 ± 2.8 | 69 | 17.5 ± 2.6 | 68 | 9.8 ± 3.0 | 77 |
| CDA | 34.7 ± 3.9 | 85 | 23.5 ± 2.5 | 76 | 17.3 ± 2.3 | 79 | 8.9 ± 2.6 | 82 |
| CEA | 33.5 ± 4.2 | 241 | 22.1 ± 2.7 | 196 | 14.8 ± 1.9 | 283 | 8.4 ± 1.7 | 129 |
| ISM-GE | 32.6 ± 4.1 | 76 | 20.7 ± 3.2 | 73 | 11.3 ± 2.7 | 77 | 6.8 ± 1.6 | 79 |
| MSE | 34.9 ± 4.5 | 223 | 25.4 ± 2.5 | 226 | 19.0 ± 2.1 | 230 | 9.2 ± 1.9 | 221 |
| CSML |
| 201 |
| 215 |
| 194 |
| 192 |
| KCSML |
| 211 |
| 253 |
| 200 |
| 203 |
Classification performance comparison on the MNIST database.
| Method | 50 Train | 100 Train | 150 Train | 200 Train | 250 Train | 300 Train | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Error (%) |
| Error (%) |
| Error (%) |
| Error (%) |
| Error (%) |
| Error (%) |
| |
| PCA | 16.1 ± 0.72 | 499 | 10.9 ± 0.46 | 517 | 9.2 ± 0.48 | 561 | 7.8 ± 0.33 | 578 | 7.0 ± 0.35 | 603 | 7.0 ± 0.28 | 610 |
| RS-2DPCA | 11.2 ± 0.42 | 18 | 7.3 ± 0.26 | 18 | 4.5 ± 0.21 | 18 | 3.8 ± 0.19 | 18 | 3.3 ± 0.20 | 18 | 2.0 ± 0.19 | 18 |
| LDA | 12.4 ± 0.53 | 9 | 9.2 ± 0.39 | 9 | 8.6 ± 0.27 | 9 | 7.0 ± 0.22 | 9 | 5.4 ± 0.24 | 9 | 4.6 ± 0.17 | 9 |
| LPP | 10.7 ± 0.47 | 56 | 6.7 ± 0.21 | 51 | 4.8 ± 0.25 | 43 | 3.5 ± 0.17 | 58 | 4.5 ± 0.14 | 69 | 1.9 ± 0.20 | 73 |
| MFA | 10.5 ± 0.42 | 114 | 7.1 ± 0.27 | 108 | 4.0 ± 0.23 | 121 | 3.7 ± 0.19 | 98 | 3.0 ± 0.17 | 105 | 1.8 ± 0.14 | 94 |
| CDA | 11.6 ± 0.38 | 49 | 6.2 ± 0.29 | 63 | 3.4 ± 0.26 | 70 | 3.3 ± 0.24 | 62 | 2.9 ± 0.22 | 54 | 2.2 ± 0.23 | 68 |
| CEA | 12.1 ± 0.43 | 31 | 5.9 ± 0.35 | 52 | 3.1 ± 0.16 | 62 | 3.8 ± 0.18 | 60 | 2.7 ± 0.17 | 83 | 1.6 ± 0.19 | 76 |
| ISM-GE | 10.8 ± 0.39 | 92 | 6.1 ± 0.32 | 91 | 3.3 ± 0.20 | 87 | 2.6 ± 0.17 | 94 | 1.9 ± 0.15 | 90 | 1.8 ± 0.17 | 91 |
| MSE | 11.5 ± 0.41 | 83 | 6.4 ± 0.28 | 85 | 3.7 ± 0.17 | 82 | 3.1 ± 0.22 | 87 | 2.7 ± 0.21 | 84 | 2.3 ± 0.19 | 83 |
| CSML |
| 62 |
| 79 |
| 85 |
| 83 |
| 83 |
| 89 |
| KCSML |
| 74 |
| 82 |
| 85 |
| 89 |
| 93 |
| 90 |
Figure 4The behavior of the proposed methods under various k 1. (a) CSML on Yale database, (b) KCSML on Yale database, (c) CSML on CMU PIE database, (d) KCSML on CMU PIE database, (e) CSML on MNIST database, and (f) KCSML on MNIST database.
Figure 5The best dimension number d for various k 2: (a) on Yale database, (b) on CMU PIE database, and (c) on MNIST database.
Figure 6Comparison on CPU time in log scale.