| Literature DB >> 35572050 |
Nasrullah Khan1,2, Zongmin Ma1,3, Li Yan1, Aman Ullah4.
Abstract
Knowledge graph embedding (KGE) is effectively exploited in providing precise and accurate recommendations from many perspectives in different application scenarios. However, such methods that utilize entire embedded Knowledge Graph (KG) without applying information-relevance regulatory constraints fail to stop the noise penetration into the underlying information. Moreover, higher computational time complexity is a CPU overhead in KG-enhanced systems and applications. The occurrence of these limitations significantly degrade the recommendation performance. Therefore, to cope with these challenges we proposed novel KGEE (Knowledge Graph Embedding Enhancement) approach of Hashing-based Semantic-relevance Attributed Graph-embedding Enhancement (H-SAGE) to model semantically-relevant higher-order entities and relations into the unique Meta-paths. For this purpose, we introduced Node Relevance-based Guided-walk (NRG) modeling technique. Further, to deal with the computational time-complexity, we converted the relevant information to the Hash-codes and proposed Deep-Probabilistic (dProb) technique to place hash-codes in the relevant hash-buckets. Again, we used dProb to generate guided function-calls to maximize the possibility of Hash-Hits in the hash-buckets. In case of Hash-Miss, we applied Locality Sensitive (LS) hashing to retrieve the required information. We performed experiments on three benchmark datasets and compared the empirical as well as the computational performance of H-SAGE with the baseline approaches. The achieved results and comparisons demonstrate that the proposed approach has outperformed the-state-of-the-art methods in the mentioned facets of evaluation.Entities:
Keywords: DNN; Hashing; Information relevance; KGEE; Knowledge graph; Recommendation
Year: 2022 PMID: 35572050 PMCID: PMC9075930 DOI: 10.1007/s10489-022-03235-7
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.086
Fig. 1News Recommendation’s – an exemplary scenario; where Babar interacts with “Aljazeera News” and “Afghanistan News” whereas “Kamal” visited “eco-news” published by “Bing-News”. In response, “Covid-News” and “Economy News” are recommended to both of them
Fig. 2Conversion of prominent Meta-Paths to Hashed Tracks of Binarized Information. Abbreviations used: MP – Meta Path, ep – Entity Path (one relation distance between two nodes), r – relation, eh & et – Head & Tail entities respectively
The Statistics of Datasets Utilized
| Literals | Datasets | ||
|---|---|---|---|
| Amazon-Book | Last-FM | Bing-News | |
| Domain | Books | Music | News |
| Users | 55,255 | 1865 | 40,237 |
| Items | 20,235 | 6526 | 32,562 |
| Interactions | 232,562 | 68,456 | 192,356 |
| Entities | 77,253 | 12,039 | 76,586 |
| 29 | 58 | 45 | |
| 122,562 | 29,650 | 115,620 | |
| Sparsity | 0.999792 | 0.994375 | 0.999853 |
| Training | 162,793 | 47,919 | 134,649 |
| Testing | 46,512 | 13,691 | 38,471 |
| Validation | 23,256 | 6845 | 19,235 |
The settings of hyper parameters wrt the experimental Environment. Literals: μ – Dropout Ration, b – Batch Size, η – Learning Rate, λ – L2 Regularization Weight, ε – KGE Weight, s – hop-length wrt path-steps, d – Dimensions of Embedding, K – The sampling size of influential neighboring
| Datasets | Environment Setting |
|---|---|
| Amazon-Book | |
| Last-FM | |
| Bing-News |
CTRP Results: Evaluated wrt AUC and Acc. Terms: Upper-Bound (α), Lower-Bound (β), Mean ()
| Approaches | Amazon-Book | Last-FM | Bing-News | ||||
|---|---|---|---|---|---|---|---|
| AUC | Acc | AUC | Acc | AUC | Acc | ||
| PER | 0.6210 | 0.5812 | 0.6129 | 0.5866 | 0.5153 | 0.4932 | |
| CKE | 0.6419 | 0.6056 | 0.7389 | 0.6632 | 0.5432 | 0.5011 | |
| MCRec | 0.6421 | 0.6287 | 0.7412 | 0.6701 | 0.5814 | 0.5633 | |
| RippleNet | 0.6646 | 0.6419 | 0.7611 | 0.6787 | 0.6418 | 0.6002 | |
| KGAT | 0.6789 | 0.6498 | 0.7691 | 0.6823 | 0.6796 | 0.6437 | |
| AKGE | 0.6641 | 0.6399 | 0.7785 | 0.6891 | 0.6632 | 0.6411 | |
| NACF | 0.7063 | 0.6734 | 0.7913 | 0.7232 | 0.6952 | 0.6741 | |
| DKEN | |||||||
| Improved: (%)-age | 03.79 | 04.12 | 03.54 | 04.04 | 02.97 | 03.20 | |
| 03.94 | 04.30 | 03.67 | 04.21 | 03.06 | 03.31 | ||
*The numbers in bold represent the most significant values among the identical comparing outcomes, and the numbers in italic with '*' describe the second important values accordingly
Top-k Recommendations via Prec, Rec & NDCG. Terms: Upper-Bound (α), Lower-Bound (β), Mean ()
| Datasets & Evaluation @ | Comparison and Improvements of H-SAGE wrt the Baseline Approaches | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PER | CKE | MCRec | RNeta | KGAT | AKGE | NACF | DKEN | H-SAGE | Improved: (%)-age | ||||
| Amazon-Book | Prec@05 | 0.057 | 0.060 | 0.067 | 0.074 | 0.077 | 0.078 | 0.090 | 13.64 | 15.79 | |||
| Prec@10 | 0.052 | 0.050 | 0.060 | 0.076 | 0.080 | 0.080 | 0.088 | 10.48 | 11.70 | ||||
| Rec@05 | 0.027 | 0.031 | 0.030 | 0.032 | 0.035 | 0.035 | 0.039 | 04.76 | 05.00 | ||||
| Rec@10 | 0.029 | 0.032 | 0.034 | 0.035 | 0.037 | 0.040 | 0.043 | 06.25 | 06.67 | ||||
| NDCG@05 | 0.031 | 0.031 | 0.031 | 0.032 | 0.033 | 05.56 | 05.88 | ||||||
| NDCG@10 | 0.032 | 0.032 | 0.034 | 0.035 | 0.035 | 05.13 | 05.41 | ||||||
| Last-FM | Prec@05 | 0.057 | 0.060 | 0.067 | 0.064 | 0.073 | 0.075 | 0.073 | 0.077* | 04.94 | 05.19 | ||
| Prec@10 | 0.050 | 0.053 | 0.060 | 0.061 | 0.064 | 0.064 | 0.066 | 0.068* | 04.23 | 04.41 | |||
| Rec@05 | 0.020 | 0.030 | 0.040 | 0.050 | 0.050 | 0.040 | 0.050 | 0.053* | 08.62 | 09.43 | |||
| Rec@10 | 0.030 | 0.040 | 0.050 | 0.060 | 0.070 | 0.060 | 0.080* | 0.077 | 12.09 | 13.75 | |||
| NDCG@05 | 0.030 | 0.033 | 0.045 | 0.045 | 0.050 | 0.045 | 0.055 | 0.057* | 09.52 | 10.53 | |||
| NDCG@10 | 0.060 | 0.080 | 0.090 | 0.100 | 0.110 | 0.114 | 0.119* | 0.113 | 08.46 | 09.24 | |||
| Bing-News | Prec@05 | 0.004 | 0.004 | 0.006 | 0.007 | 0.007 | 0.007 | 0.009 | 09.09 | 10.00 | |||
| Prec@10 | 0.004 | 0.004 | 0.006 | 0.007 | 0.008 | 0.008 | 09.09 | 10.00 | |||||
| Rec@05 | 0.027 | 0.027 | 0.028 | 0.032 | 0.035 | 0.035 | 0.039 | 11.11 | 12.50 | ||||
| Rec@10 | 0.028 | 0.029 | 0.030 | 0.035 | 0.037 | 0.036 | 0.043 | 18.18 | 22.22 | ||||
| NDCG@05 | 0.010 | 0.011 | 0.015 | 0.017 | 0.023 | 0.025 | 0.029 | 10.81 | 12.12 | ||||
| NDCG@10 | 0.050 | 0.070 | 0.100 | 0.110 | 0.110 | 0.115 | 0.120 | 05.30 | 05.60 | ||||
aRippleNet. *The numbers in bold represent the most significant values among the identical comparing outcomes, and the numbers in italic with '*' describe the second important values accordingly
Fig. 3Result-Analysis of top-K recommendations wrt (Prec, Rec, NDCG)@K on the three Mentioned Datasets
Study of Time Complexity Comparison of the proposed Approach with the baseline approaches. Terms used: M – Modules, L – Layers, T – Type of the mentioned item, X – No. of Modules or Layers
| Approaches | M / L Mentioned | Defined Time Complexity | Time Complexity of M / L | |||||
|---|---|---|---|---|---|---|---|---|
| No | Yes | ≥ | ||||||
| T | X | |||||||
| PER | ||||||||
| CKE | Not Defined | |||||||
| MCRec | ||||||||
| RippleNet | M | 2 | 2 | |||||
| KGAT | M | 3 | 1 | 1 | 1 | |||
| AKGE | M | 3 | 1 | 1 | 1 | |||
| NACF | Not Defined | |||||||
| DKEN | L | 4 | Not Defined | 1 | 2 | 1 | ||
| L | 2 | 1 | 1 | |||||
|
| ||||||||
| L | 2 | 1 | 1 | |||||
Fig. 4AUC and Acc results analysis wrt the Data-Sparsity on the proposed datasets
The comparison of performance among variations of H-SAGE based on path selection techniques
| H-SAGE Variants | Amazon-Book | Last-FM | Bing-News | |||
|---|---|---|---|---|---|---|
| AUC | Acc | AUC | Acc | AUC | Acc | |
| H-SAGEMS | 0.6832 | 0.6601 | 0.7532 | 0.6786 | 0.6656 | 0.6515 |
| H-SAGEMR | 0.7011 | 0.6689 | 0.7609 | 0.6910 | 0.6708 | 0.6602 |
| H-SAGEMW | 0.7219 | 0.6799 | 0.7723 | 0.6987 | 0.6898 | 0.6692 |
| H-SAGESN | 0.7342 | 0.6957 | 0.8011 | 0.7235 | 0.7029 | 0.6720 |
The numbers in bold represent the most significant values among the comparing outcomes
The comparison of performance wrt different variations of H-SAGE. Abbreviations: H-SAGELS is H-SAGE through LS (Locality Sensitive-hashing), LH (Learning to Hash), NR (Node Relevance), H-SAGELS + LH H-SAGE LS and LH and so on, & H-SAGE means H-SAGEALL i.e., NR + LS + LH
| H-SAGE Variants | Amazon-Book | Last-FM | Bing-News | |||
|---|---|---|---|---|---|---|
| AUC | Acc | AUC | Acc | AUC | Acc | |
| H-SAGELS | 0.6875 | 0.6578 | 0.7698 | 0.7192 | 0.6912 | 0.6434 |
| H-SAGELH | 0.7099 | 0.6626 | 0.7768 | 0.7278 | 0.7007 | 0.6508 |
| H-SAGELS + LH | 0.7120 | 0.6791 | 0.7811 | 0.7332 | 0.7065 | 0.6621 |
| H-SAGENR + LS | 0.7331 | 0.6902 | 0.8023 | 0.7519 | 0.7203 | 0.6842 |
| H-SAGENR + LH | 0.7482 | 0.7099 | 0.8134 | 0.7589 | 0.7289 | 0.6923 |
The numbers in bold represent the most significant values among the comparing outcomes
The result analysis of H-SAGE’s performance wrt the Meta-path length of higher-order relations
| s | Amazon-Book | Last-FM | Bing-News | |||
|---|---|---|---|---|---|---|
| AUC | Acc | AUC | Acc | AUC | Acc | |
| 1 | 0.7326 | 0.7023 | 0.8091 | 0.7485 | 0.7101 | 0.6819 |
| 2 | 0.7506 | 0.7179 | 0.8259 | 0.7622 | 0.7298 | 0.7011 |
| 0.8309 | 0.7181 | |||||
| 0.7547 | 0.7122 | 0.7681 | 0.7388 | |||
| 5 | 0.7101 | 0.6599 | 0.7546 | 0.6697 | 0.6755 | 0.6257 |
The numbers in bold represent the most significant values among the comparing outcomes
The result analysis of H-SAGE’s performance wrt the embedding length of entity representations
| Amazon-Book | Last-FM | Bing-News | ||||
|---|---|---|---|---|---|---|
| AUC | Acc | AUC | Acc | AUC | Acc | |
| 21 | 0.7124 | 0.6675 | 0.7839 | 0.7314 | 0.6892 | 0.6723 |
| 22 | 0.7299 | 0.6835 | 0.8036 | 0.7497 | 0.7036 | 0.6860 |
| 23 | 0.7432 | 0.7018 | 0.8199 | 0.7536 | 0.7212 | 0.7011 |
| 0.7521 | 0.7156 | 0.7622 | 0.7376 | 0.7127 | ||
| 0.8311 | 0.7724 | |||||
| 0.7501 | 0.7125 | 0.8232 | 0.7388 | 0.7117 | ||
| 27 | 0.6855 | 0.6432 | 0.7623 | 0.7109 | 0.6645 | 0.6433 |
The numbers in bold represent the most significant values among the comparing outcomes
The result analysis of H-SAGE’s performance wrt the sampling length of influential neighboring
| K | Amazon-Book | Last-FM | Bing-News | |||
|---|---|---|---|---|---|---|
| AUC | Acc | AUC | Acc | AUC | Acc | |
| 21 | 0.7367 | 0.6731 | 0.7823 | 0.7278 | 0.7065 | 0.6802 |
| 22 | 0.7432 | 0.6891 | 0.8067 | 0.7412 | 0.7208 | 0.6898 |
| 23 | 0.7509 | 0.7030 | 0.8235 | 0.7546 | 0.7316 | 0.7032 |
| 0.7156 | 0.7622 | 0.7376 | 0.7127 | |||
| 0.7552 | 0.8311 | 0.7724 | ||||
| 0.7501 | 0.7125 | 0.8232 | 0.7388 | 0.7117 | ||
| 27 | 0.6835 | 0.6445 | 0.7456 | 0.6875 | 0.6643 | 0.6457 |
The numbers in bold represent the most significant values among the comparing outcomes
Fig. 5Case Study of a daily-based news recommendation scenario