| Literature DB >> 34764570 |
Rakesh Kumar Yadav1, Shekhar Verma1, S Venkatesan1.
Abstract
The accuracy of graph based learning techniques relies on the underlying topological structure and affinity between data points, which are assumed to lie on a smooth Riemannian manifold. However, the assumption of local linearity in a neighborhood does not always hold true. Hence, the Euclidean distance based affinity that determines the graph edges may fail to represent the true connectivity strength between data points. Moreover, the affinity between data points is influenced by the distribution of the data around them and must be considered in the affinity measure. In this paper, we propose two techniques, C C G A L and C C G A N that use cross-covariance based graph affinity (CCGA) to represent the relation between data points in a local region. C C G A L also explores the additional connectivity between data points which share a common local neighborhood. C C G A N considers the influence of respective neighborhoods of the two immediately connected data points, which further enhance the affinity measure. Experimental results of manifold learning on synthetic datasets show that CCGA is able to represent the affinity measure between data points more accurately. This results in better low dimensional representation. Manifold regularization experiments on standard image dataset further indicate that the proposed CCGA based affinity is able to accurately identify and include the influence of the data points and its common neighborhood that increase the classification accuracy. The proposed method outperforms the existing state-of-the-art manifold regularization methods by a significant margin. © Springer Science+Business Media, LLC, part of Springer Nature 2020.Entities:
Keywords: Affinity; Cross-Covariance; Euclidean distance; Graph; Manifold regularization; Neighborhoods
Year: 2020 PMID: 34764570 PMCID: PMC7677445 DOI: 10.1007/s10489-020-01986-9
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.086
Mathematical notations with their descriptions
| Notation | Description |
|---|---|
| Dataset | |
| Ambient Dimension | |
| Space | |
| Manifold | |
| Total number of data instances | |
| Graph | |
| Data instance | |
| Weight matrix | |
| Affinity between data point | |
| Row sum | |
| Column sum | |
| Neighborhood of | |
| Similarity between | |
| Random variables | |
| Reproducing Kernel Hebert spaces | |
| Hilbert-Schmidt Independence Criterion | |
| Distance between | |
| Distance between points in a neighborhood | |
| kernel defined in a neighborhood |
Fig. 1CCGA for Local Region
Fig. 2CCGA for Neighborhood
CCGAL and CCGAN vs state-of-the-art techniques
Fig. 3Real world Datasets Images
Mean error(± Standard deviation) LapSVM (Test) k = 6
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 5.9853(± 2.0131) | 6.0026(± 2.0067) | 28.4868(± 1.2284) | 28.4868(± 1.2284) | 3.5657(± 1.4774) | ||
| 22.9591(± 0.65604) | 3.3982(± 2.5612) | 13.1766(± 0.60032) | 13.1766(± 0.60032) | 2.5943(± 0.35458) | ||
| 38.4751(± 0.54756) | 10.4451(± 1.767) | 35.9541(± 0.9027) | 35.9541(± 0.9027) | 8.654(± 1.9719) | ||
| 20.6458(± 0.4408) | 21.6187(± 0.55034) | 21.6187(± 0.55034) | 20.7254(± 0.41862) | 20.7732(± 0.43116) | ||
| 31.1644(± 1.0143) | 31.3531(± 1.0624) | 31.3738(± 1.068) | 31.3738(± 1.068) | 31.2954(± 1.0459) | ||
| 13.9991(± 8.1623) | 10.2096(± 9.077) | 18.5477(± 5.5793) | 17.2964(± 5.9808) | 8.7628(± 9.0811) | ||
| 27.049(± 3.5047) | 25.1724(± 3.6374) | 26.8367(± 3.5862) | 26.8367(± 3.5862) | 24.9548(± 3.4188) | 23.8519(± 2.8291) | |
| 28.7065(± 1.7688) | 28.6331(± 1.9164) | 29.5558(± 2.0614) | 29.5558(± 2.0614) | 25.6428(± 1.8901) | ||
| 21.3264(± 2.336) | 20.1441(± 2.4681) | 20.8939(± 2.179) | 20.7964(± 1.9059) | 17.5111(± 2.6718) | 16.1957(± 2.5134) | |
| 11.4564(± 1.4242) | 12.2342(± 1.7352) | 14.2344(± 1.5345) | 15.3133(± 1.4234) | 10.3131(± 0.9312) |
Mean error(± Standard deviation) LapSVM (Unlabeled) k = 6
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 5.9196(± 2.0438) | 5.9352(± 2.059) | 33.4718(± 1.1126) | 33.4718(± 1.1126) | 3.0609(± 1.6038) | ||
| 10.7445(± 1.2973) | 3.2973(± 2.6276) | 13.3647(± 0.63509) | 13.3647(± 0.63509) | 2.4118(± 0.36646) | ||
| 8.2996(± 1.8522) | 1.5(± 2.1197) | 36.2816(± 0.93587) | 36.2816(± 0.93587) | 1.1498(± 1.4258) | ||
| 19.4467(± 0.44692) | 20.2633(± 0.52944) | 20.2633(± 0.52944) | 19.5453(± 0.45401) | 19.6514(± 0.43513) | ||
| 31.0024(± 1.0302) | 31.1762(± 1.0589) | 31.2028(± 1.0641) | 31.2028(± 1.0641) | 31.1123(± 1.0555) | ||
| 12.2481(± 8.4792) | 9.2706(± 9.1341) | 19.0064(± 5.8736) | 17.6866(± 6.0596) | 7.771(± 9.0185) | ||
| 18.6597(± 2.0702) | 22.1499(± 1.8301) | 18.8922(± 2.311) | 26.5602(± 2.622) | 29.4454(± 2.6659) | ||
| 25.6905(± 1.6005) | 25.6246(± 1.8902) | 27.0735(± 1.7924) | 27.0735(± 1.7924) | 22.6165(± 1.7756) | ||
| 19.6906(± 1.2029) | 17.8456(± 1.652) | 19.441(± 1.7698) | 19.4345(± 1.7707) | 16.8485(± 1.3853) | ||
| 12.4564(± 1.9845) | 12.4241(± 1.0995) | 17.5433(± 2.6344) | 18.2423(± 1.2342) | 12.4566(± 0.9353) |
Mean error (± Standard deviation) LapRLSC (Test) with k = 6
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 6.0525(± 1.9381) | 6.062(± 1.9493) | 27.7172(± 1.7853) | 27.7172(± 1.7853) | 3.6006(± 1.4074) | ||
| 25.2406(± 0.96617) | 12.2475(± 0.69306) | 12.2475(± 0.69306) | 2.4943(± 0.34876) | 2.3303(± 0.34844) | ||
| 38.4103(± 0.57778) | 35.5803(± 0.94473) | 35.5803(± 0.94473) | 13.4018(± 2.9515) | 18.0368(± 3.0749) | ||
| 19.5864(± 0.58403) | 19.5775(± 0.56837) | 20.2675(± 0.61025) | 20.2675(± 0.61025) | 18.73(± 0.52904) | ||
| 28.6935(± 0.7116) | 28.7648(± 0.71153) | 28.7747(± 0.70966) | 28.7747(± 0.70966) | 28.7566(± 0.71979) | ||
| 13.9378(± 8.4856) | 10.7887(± 9.2475) | 18.6991(± 5.8643) | 17.2439(± 6.6636) | 7.9037(± 9.1091) | ||
| 27.0733(± 3.4778) | 26.1643(± 3.7012) | 26.4614(± 3.4997) | 26.4581(± 3.4752) | 23.5705(± 3.6623) | ||
| 28.7285(± 1.8327) | 29.5571(± 2.065) | 29.5571(± 2.065) | 28.7167(± 1.9946) | 28.7214(± 1.8789) | ||
| 21.1371(± 1.6499) | 20.0112(± 2.5415) | 20.991(± 2.2732) | 21.0072(± 2.2715) | 18.911(± 2.7065) | ||
| 13.4234(± 1.4232) | 16.4322(± 2.5545) | 17.3133(± 2.9891) | 14.3131(± 0.93423) | 15.2342(± 0.9859) |
Mean error (± Standard deviation) LapRLSC (Unlabeled) k = 6
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 5.8922(± 2.0385) | 5.8928(± 2.0405) | 30.2328(± 1.5825) | 30.2328(± 1.5825) | 3.0793(± 1.4541) | ||
| 10.5111(± 1.2068) | 12.3712(± 0.71244) | 12.3712(± 0.71244) | 2.4037(± 0.35836) | 2.2252(± 0.36416) | ||
| 8.0237(± 1.8893) | 1.3303(± 2.0178) | 35.6724(± 1.034) | 35.6724(± 1.034) | 1.1592(± 1.4827) | ||
| 18.7148(± 0.58304) | 19.1057(± 0.46685) | 19.1057(± 0.46685) | 18.6926(± 0.59944) | 18.796(± 0.58017) | ||
| 28.6764(± 0.67734) | 28.7455(± 0.66441) | 28.7507(± 0.66312) | 28.7507(± 0.66312) | 28.7434(± 0.66381) | ||
| 11.5328(± 8.1146) | 8.9066(± 8.7905) | 18.6421(± 5.3925) | 18.4514(± 5.9489) | 7.198(± 8.9484) | ||
| 18.6723(± 2.067) | 28.2353(± 2.1595) | 18.3936(± 2.277) | 24.0322(± 2.5972) | 25.5714(± 2.8975) | ||
| 25.8091(± 1.6133) | 25.7981(± 1.7799) | 27.0735(± 1.7923) | 27.0735(± 1.7923) | 24.7966(± 1.8538) | ||
| 19.7043(± 1.2225) | 18.0463(± 2.4952) | 19.419(± 1.777) | 19.4072(± 1.7758) | 18.7154(± 2.436) | ||
| 15.3423(± 1.4564) | 14.5435(± 1.0543) | 14.6453(± 1.5354) | 13.5433(± 1.5453) | 14.2342(± 1.5859) |
Mean error(± Standard deviation) LapSVM (Test) k = 10
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 5.0859(± 2.1001) | 5.0326(± 1.9367) | 27.8768(± 1.1293) | 26.4564(± 0.9583) | 3.1552(± 1.2774) | ||
| 10.4718(± 0.69454) | 3.1924(± 2.2765) | 12.872(± 0.70001) | 12.2332(± 0.50045) | 2.0301(± 0.13543) | ||
| 36.7321(± 0.32343) | 9.1432(± 1.773) | 32.1237(± 0.6037) | 33.7543(± 0.7047) | 6.3654(± 2.001) | ||
| 18.4658(± 0.3206) | 19.4881(± 0.5932) | 20.5057(± 0.51053) | 18.1453(± 0.43827) | 17.7732(± 0.43116) | ||
| 30.0544(± 1.0093) | 30.5231(± 1.0443) | 30.2723(± 1.068) | 31.5732(± 1.108) | 30.054(± 1.0309) | ||
| 12.3784(± 7.2413) | 9.1209(± 8.134) | 16.3517(± 4.4453) | 17.1943(± 5.1908) | 7.4651(± 7.0421) | ||
| 25.105(± 3.4276) | 24.0922(± 3.6374) | 25.1233(± 2.1352) | 25.1267(± 3.1532) | 23.1344(± 3.4188) | 22.4519(± 1.92191) | |
| 26.5405(± 1.7556) | 26.2122(± 1.8141) | 27.3788(± 2.7789) | 28.1258(± 2.0614) | 27.1232(± 1.7821) | ||
| 20.1624(± 2.1326) | 19.1214(± 2.4323) | 19.1282(± 2.0238) | 19.8732(± 1.8009) | 18.0051(± 2.6718) | ||
| 10.6564(± 1.3672) | 12.3696(± 1.45784) | 13.9689(± 2.0123) | 17.2343(± 2.1533) | 10.7565(± 0.8312) |
Mean error(± Standard deviation) LapSVM (Unlabeled) k = 10
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 5.1132(± 2.1423) | 5.0642(± 2.059) | 31.3341(± 1.1214) | 31.4123(± 1.1335) | 2.0103(± 1.3448) | ||
| 10.1125(± 1.2334) | 2.9936(± 1.5244) | 12.3347(± 0.5389) | 12.3347(± 0.43509) | 2.4118(± 0.36646) | ||
| 7.1294(± 1.3452) | 1.4345(± 2.0497) | 35.2312(± 0.73327) | 35.2336(± 0.9232) | 1.3434(± 1.4348) | ||
| 18.3251(± 0.3445) | 19.4343(± 0.52944) | 19.3453(± 0.5434) | 18.5443(± 0.45401) | 19.6514(± 0.43513) | ||
| 30.1334(± 1.1342) | 30.1342(± 1.1549) | 30.3434(± 1.0445) | 30.1048(± 1.0441) | 30.0120(± 1.04233) | ||
| 11.348(± 8.1272) | 8.3206(± 8.2311) | 17.7764(± 5.5343) | 17.5466(± 6.0496) | 6.4231(± 8.0435) | ||
| 17.23597(± 2.0212) | 20.1323(± 1.2323) | 17.5422(± 2.551) | 24.4302(± 2.6656) | 27.6654(± 2.6566) | ||
| 24.1105(± 1.2305) | 24.2346(± 1.3202) | 26.4555(± 1.7454) | 26.0554(± 1.7543) | 24.6543(± 1.5335) | ||
| 18.6546(± 1.5353) | 16.84536(± 1.4622) | 18.6541(± 1.5548) | 18.4665(± 1.6457) | 16.2634(± 1.36455) | ||
| 11.764(± 1.6445) | 13.46411(± 1.0645) | 15.5644(± 2.7664) | 17.6463(± 1.7552) | 10.34434(± 0.7647) |
Mean error (± Standard deviation) LapRLSC (Test) with k = 10
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 5.03425(± 1.3243) | 5.0432(± 1.3443) | 26.6642(± 1.7424) | 26.7432(± 1.4353) | 2.1734(± 1.30755) | ||
| 24.2456(± 0.953417) | 11.25455(± 0.6454) | 11.2454(± 0.7656) | 2.8937(± 0.34876) | 2.3324(± 0.34564) | ||
| 36.3134(± 0.44578) | 34.4403(± 0.94473) | 34.5423(± 0.8443) | 12.4434(± 2.0955) | 17.3448(± 2.34749) | ||
| 19.5864(± 0.58403) | 19.5775(± 0.56837) | 20.2675(± 0.61025) | 20.2675(± 0.61025) | 19.63(± 0.52904) | ||
| 26.6434(± 0.6766) | 27.4348(± 0.6453) | 26.7437(± 0.70966) | 27.4237(± 0.70966) | 27.7436(± 0.6146) | ||
| 12.6487(± 6.46456) | 9.64487(± 8.2665) | 17.65641(± 4.8664) | 16.6459(± 6.5666) | 7.637(± 8.16446) | 6.5645(± 7.4456) | |
| 26.5345(± 3.5345) | 25.15656(± 2.4534) | 25.5345(± 2.4557) | 25.45381(± 2.4442) | 23.5445(± 2.64523) | ||
| 27.7565(± 1.3236) | 27.4564(± 1.7715) | 28.6453(± 1.8565) | 28.6772(± 1.7665) | 24.6567(± 2.0046) | ||
| 20.14471(± 1.6543) | 19.052(± 1.5535) | 19.4491(± 1.9732) | 20.0552(± 2.4515) | 17.9331(± 2.7065) | ||
| 12.2426(± 1.5345) | 17.6444(± 2.6423) | 18.4543(± 1.9644) | 15.4531(± 0.9354) | 14.5154(± 1.9565) |
Mean error (± Standard deviation) LapRLSC (Unlabeled) k = 10
| Method | Heat | Binary | EMR24 | EMR72 | CCGAL | CCGAN |
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| 5.0491(± 2.1335) | 5.8348(± 2.1465) | 29.2432(± 1.3425) | 28.2248(± 1.4245) | 2.4553(± 1.0541) | ||
| 9.7811(± 1.2068) | 10.42342(± 0.64243) | 11.4422(± 0.61424) | 1.4237(± 0.35836) | 2.2122(± 0.3123) | ||
| 7.31237(± 1.3893) | 1.9903(± 2.9178) | 33.6231(± 1.3384) | 32.4244(± 1.3324) | 1.4394(± 1.43447) | ||
| 17.42348(± 0.48423) | 18.4457(± 0.3645) | 18.1237(± 0.5623) | 17.4232(± 0.5234) | 17.7423(± 0.48047) | ||
| 27.6424(± 0.62374) | 27.7425(± 0.5241) | 27.75237(± 0.56312) | 27.72307(± 0.36312) | 26.7434(± 0.4481) | 25.5794(± 0.6446) | |
| 10.5428(± 7.1426) | 7.9446(± 8.79423) | 17.6321(± 4.425) | 17.444(± 5.94239) | 7.23438(± 7.92344) | 5.7424(± 7.64224) | |
| 17.6323(± 2.5547) | 26.2443(± 2.1595) | 17.536(± 2.277) | 23.5622(± 2.5372) | 24.5524(± 1.93975) | ||
| 24.5391(± 1.5553) | 24.7531(± 1.7534) | 26.0355(± 1.534523) | 26.053(± 1.7923) | 22.3426(± 1.8442) | ||
| 18.7423(± 1.22425) | 17.2423(± 2.4422) | 17.2424(± 1.4234) | 18.4534(± 1.5335) | 16.7534(± 2.5356) | ||
| 12.9423(± 1.5345) | 14.4272(± 1.9554) | 15.3533(± 2.7232) | 12.3645(± 0.65434) | 14.2534(± 0.5485) |
Fig. 4Random walk on BCI 5F dataset
Mean error (± Standard deviation) for Test set (k = 6)
| Method | KTA | HHG | CCGAN | |||
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| LapSVM | LapRLSC | LapSVM | LapRLSC | LapSVM | LapRLSC | |
| 3.3(± 1.5344) | 3.8447(1.9252) | 24.0332(± 1.6126) | 23.4384(± 1.8533) | 2.5867(± 1.1814) | ||
| 5.0219(± 0.3899) | 4.7491(0.39956) | 3.5963(± 2.5182) | 2.2731(± 0.3316) | 2.4104(± 0.33458) | 2.3303(± 0.34844) | |
| 9.2015(± 2.0432) | 18.511(3.0134) | 10.4607(± 1.7727) | 10.214(± 1.4854) | 18.0368(± 3.0749) | ||
| 24.8703(± 0.39415) | 29.7855(0.57972) | 20.7107(± 0.42489) | 19.5775(± 0.56837) | 20.7732(± 0.43116) | ||
| 33.7413(± 1.3908) | 35.212(± 1.454) | 31.3532(± 1.0618) | 30.7642(± 0.7105) | 30.5624(± 0.93737) | ||
| 13.321(± 9.323) | 12.983(± 8.546) | 10.1609(± 9.0374) | 10.0636(± 9.0922) | 8.5574(± 9.3981) | ||
| 26.763(± 3.2439) | 26.1543(± 3.6545) | 25.0995(± 3.5483) | 26.0824(± 3.7675) | 23.8519(± 2.8291) | ||
| 32.765(± 1.652) | 28.6259(± 1.8406) | 26.0824(± 3.7675) | 25.6428(± 1.8901) | 28.7214(± 1.8789) | ||
| 25.1902(± 3.978) | 26.1624(3.6115) | 19.9342(± 2.1742) | 20.3912(± 2.7141) | 17.8757(± 2.3333) | ||
Mean error (± Standard deviation) for Unlabeled set (k = 6)
| Method | KTA | HHG | CCGAN | |||
|---|---|---|---|---|---|---|
| Dataset ↓ | ||||||
| LapSVM | LapRLSC | LapSVM | LapRLSC | LapSVM | LapRLSC | |
| 2.7856(± 1.065) | 2.8805(± 1.0663) | 26.2032(± 1.3588) | 25.6689(± 1.4892) | 1.7102(± 0.9751) | ||
| 3.8451(± 0.411) | 4.8413(± 0.41825) | 3.5037(± 2.5778) | 2.176(± 0.35363) | 2.2367(± 0.33978) | ||
| 1.106(± 4.3124) | 4.1049(± 1.3093) | 3.5013(± 2.1274) | 2.3337(± 2.0545) | 1.0551(± 1.3974) | ||
| 19.832(± 0.40552) | 19.0049(± 0.56849) | 19.5145(± 0.45644) | 18.6783(± 0.59023) | 19.6514(± 0.43513) | 18.796(± 0.58017) | |
| 32.834(± 1.178) | 30.512(± 1.768) | 31.1762(± 1.0589) | 28.7446(± 0.6641) | 30.3794(± 0.95679) | ||
| 10.418(± 7.432) | 11.623(± 6.445) | 8.9558(± 8.8238) | 8.9379(± 8.6683) | 6.9222(± 8.9307) | 6.7162(± 8.6574) | |
| 25.765(± 1.232) | 26.141(± 1.242) | 27.3655(± 2.183) | 29.4454(± 2.6659) | 25.5714(± 2.8975) | ||
| 27.758(± 1.653) | 28.637(± 1.23) | 25.6181(± 1.8376) | 25.8012(± 1.8334) | 23.7501(± 1.7327) | ||
| 22.9309(± 3.1456) | 23.5219(± 2.6424) | 17.8556(± 1.6498) | 17.8501(± 1.693) | 16.8485(± 1.3853) | ||