| Literature DB >> 28934244 |
Yanhong Bi1,2, Bin Fan1, Fuchao Wu1.
Abstract
As a specific kind of non-Euclidean metric lies in projective space, Cayley-Klein metric has been recently introduced in metric learning to deal with the complex data distributions in computer vision tasks. In this paper, we extend the original Cayley-Klein metric to the multiple Cayley-Klein metric, which is defined as a linear combination of several Cayley-Klein metrics. Since Cayley-Klein is a kind of non-linear metric, its combination could model the data space better, thus lead to an improved performance. We show how to learn a multiple Cayley-Klein metric by iterative optimization over single Cayley-Klein metric and their combination coefficients under the objective to maximize the performance on separating inter-class instances and gathering intra-class instances. Our experiments on several benchmarks are quite encouraging.Entities:
Mesh:
Year: 2017 PMID: 28934244 PMCID: PMC5608239 DOI: 10.1371/journal.pone.0184865
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Intuitive illustration of multiple Cayley-Klein metrics and single Cayley-Klein metric by a toy example.
(a) non-linear metric VS. linear metrics. (b) multiple non-linear metrics VS. single non-linear metric.
Characteristics and experiment settings of the UCI datasets.
| Datasets | Data points | Training | Validation | Test | Attributes | Classes |
|---|---|---|---|---|---|---|
| Wine | 178 | 107 | 35 | 36 | 13 | 3 |
| Iono. | 351 | 210 | 70 | 71 | 34 | 2 |
| Vowel | 528 | 317 | 105 | 106 | 10 | 11 |
| Bala. | 625 | 375 | 125 | 125 | 4 | 3 |
| Pima | 768 | 461 | 153 | 154 | 8 | 2 |
| Vehicle | 846 | 507 | 169 | 170 | 18 | 4 |
| Seg. | 2310 | 1386 | 462 | 462 | 19 | 7 |
| Wave | 5000 | 3000 | 1000 | 1000 | 21 | 3 |
| Letter | 20000 | 3000 | 1000 | 1000 | 16 | 26 |
Classification accuracies (mean and standard deviation in %) on UCI datasets.
MCKML achieves the best performance on 6 out of 9 datasets.
| Method | MMC [ | LMNN [ | CKMMC [ | CKLMNN [ | MMLMNN [ | SCML-local [ | MCKML |
|---|---|---|---|---|---|---|---|
| Wine | 94.8±2.8 | 96.2±2.3 | 95.7±2.7 | 96.8±2.4 | 96.9±2.3 | 97.2±2.4 | |
| Iono. | 84.5±1.5 | 86.7±1.4 | 84.8±1.4 | 87.2±1.4 | 88.7±1.4 | 89.7±1.5 | |
| Vowel | 89.4±1.4 | 95.1±1.3 | 92.4±1.5 | 95.5±1.3 | 95.2±1.1 | 95.0±1.2 | |
| Bala. | 86.0±2.0 | 87.3±1.7 | 86.2±1.9 | 88.7±1.7 | 89.6±1.8 | 90.7±1.7 | |
| Pima | 68.6±1.9 | 70.3±1.6 | 71.9±1.8 | 71.5±1.6 | 71.8±1.7 | 70.3±1.6 | |
| Vehicle | 70.1±2.5 | 75.4±2.3 | 78.1±2.4 | 78.0±2.3 | 78.4±2.4 | 79.8±2.4 | |
| Seg. | 94.6±1.4 | 96.2±1.2 | 96.9±1.3 | 97.0±1.2 | 97.0±1.2 | 97.1±1.2 | |
| Wave | 80.9±1.1 | 81.5±0.8 | 81.0±1.2 | 82.8±1.0 | 82.8±1.1 | 82.7±1.1 | |
| Letter | 89.6±1.3 | 91.2±1.2 | 90.9±1.3 | 92.0±1.2 | 92.3±1.2 | 92.3±1.2 | |
| avg. | 84.3±1.8 | 86.6±1.5 | 86.4±1.7 | 87.7±1.6 | 88.1±1.6 | 88.5±1.6 |
Paired t-test for statistical evaluation of the classification results on UCI datasets.
| Datasets | Paired |
|---|---|
| Wine | CKMMC < CKLMNN ∼ MMLMNN < SCML-local ∼ MCKML |
| Iono. | CKMMC < CKLMNN < MMLMNN < SCML-local ∼ MCKML |
| Vowel | CKMMC < SCML-local ∼ MMLMNN < CKLMNN ∼ MCKML |
| Bala. | CKMMC < CKLMNN < MMLMNN < MCKML < SCML-local |
| Pima | SCML-local < CKLMNN ∼ MMLMNN ∼ CKMMC < MCKML |
| Vehicle | CKLMNN ∼ CKMMC ∼ MMLMNN < SCML-local < MCKML |
| Seg. | CKMMC ∼ CKLMNN ∼ MMLMNN ∼ SCML-local < MCKML |
| Wave | CKMMC < MCKML ∼ CKLMNN ∼ MMLMNN ∼ SCML-local |
| Letter | CKMMC < CKLMNN < MMLMNN ∼ MCKML ∼ SCML-local |
Fig 2Illustrative experiment on Segmentation dataset in 2D.
(a)∼(c) Distributions of training points under the learned metrics (MMC, CK-MMC and MCKML) respectively. (d)∼(f) Distributions of test points under the learned metrics (MMC, CK-MMC and MCKML) respectively.
Classification accuracies for each identity (mean in %) and average accuracies (mean and standard deviation in %) obtained on the PubFig dataset.
| Method | MMC [ | LMNN [ | CKMMC [ | CKLMNN [ | MMLMNN [ | SCML-local [ | MCKML |
|---|---|---|---|---|---|---|---|
| Alex | 79.2 | 80.1 | 84.3 | 81.6 | 81.0 | 82.1 | |
| Clive | 72.7 | 75.8 | 83.8 | 82.5 | 80.4 | 83.0 | |
| Hugh | 83.4 | 80.6 | 85.8 | 83.7 | 82.7 | 83.4 | |
| Jared | 79.2 | 80.2 | 79.9 | 80.6 | 80.2 | 80.6 | |
| Miley | 77.9 | 75.4 | 78.2 | 78.7 | 77.9 | 78.9 | |
| Scarlett | 84.3 | 83.1 | 83.0 | 84.1 | 84.8 | 85.2 | |
| Viggo | 77.3 | 77.9 | 78.4 | 78.4 | 79.1 | 79.1 | |
| Zac | 84.9 | 82.1 | 85.2 | 84.3 | 83.5 | 84.9 | |
| avg. | 79.9±1.2 | 79.4±1.9 | 82.3±0.9 | 82.0±1.9 | 81.2±1.8 | 82.1±1.1 |
Classification accuracies for each category (mean in %) and average accuracies (mean and standard deviation in %) obtained on the OSR dataset.
| Method | MMC [ | LMNN [ | CKMMC [ | CKLMNN [ | MMLMNN [ | SCML-local [ | MCKML |
|---|---|---|---|---|---|---|---|
| 69.0 | 72.7 | 74.1 | 75.2 | 74.7 | 75.2 | ||
| 41.2 | 45.8 | 46.3 | 47.4 | 46.9 | 46.5 | ||
| 64.2 | 69.6 | 71.9 | 74.7 | 74.1 | 74.4 | ||
| 74.6 | 75.3 | 75.0 | 75.5 | 76.8 | 77.9 | ||
| 69.1 | 70.9 | 70.1 | 71.4 | 71.1 | 71.9 | ||
| 56.3 | 57.3 | 57.9 | 58.8 | 58.3 | 59.5 | ||
| 60.2 | 62.5 | 64.0 | 65.0 | 64.3 | 63.9 | ||
| 85.4 | 86.3 | 87.3 | 88.3 | 88.2 | 87.8 | ||
| avg. | 65.0±1.2 | 67.5±1.1 | 68.3±1.2 | 70.0±1.1 | 69.2±1.1 | 69.4±1.0 |
Running times on OSR and PubFig.
| Method | Training time | Testing time | ||
|---|---|---|---|---|
| OSR | PubFig | OSR | PubFig | |
| MMC [ | 4.6s | 5.8s | 0.2s | 0.2s |
| CK-MMC [ | 5.3s | 3.3s | 0.3s | 0.3s |
| LMNN [ | 3.3s | 9.2s | 0.3s | 0.2s |
| CK-LMNN [ | 2.7s | 4.5s | 0.4s | 0.3s |
| MM-LMNN [ | 6.5s | 14.7s | 0.3s | 0.3s |
| SCML-local [ | 17.7s | 13.1s | 0.2s | 0.2s |
| MCKML | 59.4s | 77.3s | 0.4s | 0.3s |