| Literature DB >> 35327841 |
Xiaohui Li1, Zhiliang Wang1, Zhaohui Zhang2.
Abstract
Large-scale knowledge graphs not only store entities and relations but also provide ontology-based information about them. Type constraints that exist in this information are of great importance for link prediction. In this paper, we proposed a novel complex embedding method, CHolE, in which complex circular correlation was introduced to extend the classic real-valued compositional representation HolE to complex domains, and type constraints were integrated into complex representational embeddings for improving link prediction. The proposed model consisted of two functional components, the type constraint model and the relation learning model, to form type constraints such as modulus constraints and acquire the relatedness between entities accurately by capturing rich interactions in the modulus and phase angles of complex embeddings. Experimental results on benchmark datasets showed that CHolE outperformed previous state-of-the-art methods, and the impartment of type constraints improved its performance on link prediction effectively.Entities:
Keywords: complex circular correlation; complex embedding; link prediction; type constraint
Year: 2022 PMID: 35327841 PMCID: PMC8947114 DOI: 10.3390/e24030330
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Example of TCE and TCR in Freebase.
Figure 2Circular correlation as the compression of the tensor product.
Figure 3Summation processes over a fixed partition in the complex circular correlation.
Figure 4Mechanisms of Modulus Constraint and Phase Interaction.
Symbols and descriptions.
| Symbols | Descriptions | Symbols | Descriptions |
|---|---|---|---|
| KG | knowledge graph |
| TCR relation |
| E | entity set | S | triple set |
| C | type (concept) set | S | general triple set |
| R | relation set | S | type constraint triple set |
| R | instance-level relation set | S | TCE triple set |
| R | type constraint relation set | S | TCR triple set |
|
| TCE ( |
Figure 5An overview of CHolE model. The leftmost part contains a part of KG that includes seven entities, their types, and one relationship between them. The TCM learns the TCE and TCR with the modulus of complex vectors (green box). The RLM models the detailed interactions with the modulus and phase angles of complex (blue box). The solid circle denotes the real number, and the slash-marked circle with arrow line denotes the complex number (slash-marked circle without arrow line is the moduli of complex number). For a brief description: let the relationship of phase angles be zero and simplify the phase angles of golden entity pairs to be the same to make the phase difference of entries in head and tail entities be zero (this can be also regarded as a special case in Equation (9).
Statistics of FB15K-571 and FB15K-237-TC.
| Dataset | FB15K-571 | FB15K-237-TC |
|---|---|---|
| #Entity * | 14,951 | 14,541 |
| #Type | 571 | 542 |
| #General (Instance-level) Relation | 1345 | 237 |
| #General Relation Triple | 592,213 | 310,116 |
| #TCE ( | 123,842 | 121,287 |
| #TCR Triple | 1345 | 237 |
| #Train (General Relation Triple) | 483,142 | 272,115 |
| #Valid (General Relation Triple) | 50,000 | 17,535 |
| #Test (General Relation Triple) | 59,071 | 20,466 |
* The #X represents the number of elements in the X set.
Link prediction results on FB15K-571 and FB15K-237-TC *.
| Dataset | FB15K-571 | FB15K-237-TC | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Metrics | MRR | Hits@N | MRR | Hits@N | ||||||
| Setting | Raw | Filter | N = 1 | N = 3 | N = 10 | Raw | Filter | N = 1 | N = 3 | N = 10 |
| TransE | 0.417 | 0.150 | 0.314 | 0.476 | 0.144 | 0.233 | 0.147 | 0.263 | 0.398 | |
| TransH | 0.495 | 0.284 | 0.535 | 0.641 | 0.136 | 0.041 | 0.160 | 0.331 | ||
| RESCAL | 0.189 | 0.354 | 0.235 | 0.409 | 0.587 | 0.255 | 0.185 | 0.278 | 0.397 | |
| DistMult | 0.350 | 0.577 | 0.100 | 0.191 | 0.106 | 0.207 | 0.376 | |||
| HolE |
| 0.524 | 0.402 | 0.613 | 0.739 | 0.124 | 0.222 | 0.133 | 0.253 | 0.391 |
| ComplEx | 0.223 | 0.485 | 0.347 | 0.577 | 0.729 | 0.109 | 0.201 | 0.112 | 0.213 | 0.388 |
| CHolE (RL only) |
| 0.510 | 0.387 | 0.601 | 0.725 |
|
|
|
|
|
| CHolE (TC+RL) | 0.231 |
|
|
|
|
|
|
|
|
|
* Best score is in bold, and scores that are underlined represent better results than the original model HolE. For FB15K-571, the scores of DistMult [14] and HolE [17] are taken from the corresponding original papers, the results of TransE [11] and TransH [12] are taken from [39], and the result of RESACAL [15] comes from [17]. For FB15K-237-TC, the scores of TransE [11], DistMult [14], HolE [17], and ComplEx [16] are taken from [29], and the results of TransH [12] and RESCAL [15] come from [39,40], respectively.
Figure 6Effect of the strength of type constraints on the results of link prediction.
Figure 7Effect of the scales of type constraints on the results of link prediction.