| Literature DB >> 31983896 |
Chengchun Shi1, Wenbin Lu1, Rui Song1.
Abstract
Statistical relational learning is primarily concerned with learning and inferring relationships between entities in large-scale knowledge graphs. Nickel et al. (2011) proposed a RESCAL tensor factorization model for statistical relational learning, which achieves better or at least comparable results on common benchmark data sets when compared to other state-of-the-art methods. Given a positive integer s, RESCAL computes an s-dimensional latent vector for each entity. The latent factors can be further used for solving relational learning tasks, such as collective classification, collective entity resolution and link-based clustering. The focus of this paper is to determine the number of latent factors in the RESCAL model. Due to the structure of the RESCAL model, its log-likelihood function is not concave. As a result, the corresponding maximum likelihood estimators (MLEs) may not be consistent. Nonetheless, we design a specific pseudometric, prove the consistency of the MLEs under this pseudometric and establish its rate of convergence. Based on these results, we propose a general class of information criteria and prove their model selection consistencies when the number of relations is either bounded or diverges at a proper rate of the number of entities. Simulations and real data examples show that our proposed information criteria have good finite sample properties.Entities:
Keywords: Information criteria; Knowledge graph; Model selection consistency; RESCAL model; Statistical relational learning; Tensor factorization
Year: 2019 PMID: 31983896 PMCID: PMC6980192
Source DB: PubMed Journal: J Mach Learn Res ISSN: 1532-4435 Impact factor: 5.177
Simulation results for Setting I, II and III (standard errors in parenthesis)
| TP |
| TP |
| TP |
| |
| IC0 | 0.97 (0.02) | 2.03 (0.02) | 0.97 (0.02) | 4.03 (0.02) | 0.90(0.03) | 7.90 (0.03) |
| IC0.5 | 0.97 (0.02) | 2.03 (0.02) | 0.98 (0.01) | 4.02 (0.01) | 0.90(0.03) | 7.90 (0.03) |
| IC1 | 0.97 (0.02) | 2.03 (0.02) | 0.98 (0.01) | 4.02 (0.01) | 0.89(0.03) | 7.89 (0.03) |
| BIC | 0.00 (0.00) | 11.99 (0.01) | 0.00 (0.00) | 12.00 (0.00) | 0.00 (0.00) | 11.99 (0.01) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 0.99 (0.01) | 2.01 (0.01) | 0.97 (0.02) | 4.03 (0.02) | 0.96(0.02) | 8.04 (0.02) |
| IC0.5 | 0.99 (0.01) | 2.01 (0.01) | 0.97 (0.02) | 4.03 (0.02) | 0.96(0.02) | 8.04 (0.02) |
| IC1 | 0.99 (0.01) | 2.01 (0.01) | 0.97 (0.02) | 4.03 (0.02) | 0.96(0.02) | 8.04 (0.02) |
| BIC | 0.00 (0.00) | 12.00 (0.00) | 0.00 (0.00) | 12.00 (0.00) | 0.00 (0.00) | 11.98 (0.01) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 0.99 (0.01) | 2.01 (0.01) | 0.95 (0.02) | 4.05 (0.02) | 0.95(0.02) | 8.05 (0.02) |
| IC0.5 | 0.99 (0.01) | 2.01 (0.01) | 0.95 (0.02) | 4.05 (0.02) | 0.95(0.02) | 8.05 (0.02) |
| IC1 | 0.99 (0.01) | 2.01 (0.01) | 0.95 (0.02) | 4.05 (0.02) | 0.95(0.02) | 8.05 (0.02) |
| BIC | 0.00 (0.00) | 12.00 (0.00) | 0.00 (0.00) | 11.99 (0.01) | 0.00 (0.00) | 11.98 (0.01) |
Simulation results for Setting IV, V and VI (standard errors in parenthesis)
| TP |
| TP |
| TP |
| |
| IC0 | 1.00 (0.00) | 2.00 (0.00) | 0.97 (0.02) | 4.03 (0.02) | 0.69(0.05) | 7.91 (0.06) |
| IC0.5 | 1.00 (0.00) | 2.00 (0.00) | 0.97 (0.02) | 4.03 (0.02) | 0.66(0.05) | 7.75 (0.06) |
| IC1 | 1.00 (0.00) | 2.00 (0.00) | 0.98 (0.01) | 4.02 (0.01) | 0.60(0.05) | 7.62 (0.06) |
| BIC | 0.00 (0.00) | 11.81 (0.06) | 0.00 (0.00) | 11.60 (0.06) | 0.01 (0.01) | 11.67 (0.07) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 0.97 (0.02) | 2.03 (0.02) | 0.95 (0.02) | 4.05 (0.02) | 0.73(0.04) | 8.46 (0.10) |
| IC0.5 | 0.97 (0.02) | 2.03 (0.02) | 0.98 (0.01) | 4.02 (0.01) | 0.87(0.03) | 8.09 (0.03) |
| IC1 | 0.98 (0.01) | 2.02 (0.02) | 1.00 (0.00) | 4.00 (0.00) | 0.79(0.04) | 7.99 (0.05) |
| BIC | 0.00 (0.00) | 12.00 (0.00) | 0.00 (0.00) | 11.92 (0.03) | 0.00 (0.00) | 11.99 (0.01) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 0.98 (0.01) | 2.02 (0.01) | 0.93 (0.03) | 4.07 (0.03) | 0.17(0.04) | 11.24 (0.15) |
| IC0.5 | 0.99 (0.01) | 2.01 (0.01) | 0.97 (0.02) | 4.03 (0.02) | 0.76(0.04) | 8.24 (0.05) |
| IC1 | 1.00 (0.00) | 2.00 (0.00) | 0.98 (0.01) | 4.02 (0.01) | 0.79(0.04) | 7.99 (0.05) |
| BIC | 0.00 (0.00) | 12.00 (0.00) | 0.00 (0.00) | 12.00 (0.00) | 0.00 (0.00) | 11.99 (0.01) |
AUC scores
|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | 0.7201 | 0.8341 | 0.8952 | 0.9095 | 0.9257 | 0.9364 | 0.9444 | 0.9486 | 0.9513 | 0.9518 | 0.9485 | 0.9467 |
| Subject | Predicate | Object |
|---|---|---|
| Jon Snow | character in | A Song of Ice and Fire |
| Jon Snow | character in | Game of Thrones |
| A Song of Ice and Fire | genre | novel |
| Game of Thrones | genre | television series |
| George R.R. Martin | author of | A Song of Ice and Fire |
| George R.R. Martin | profession | novelist |
Simulation results for Setting I, II and III (standard errors in parenthesis)
| TP |
| TP |
| TP |
| |
| IC0 | 1.00(0.00) | 2.00(0.00) | 0.96(0.02) | 3.98(0.02) | 0.88(0.03) | 5.87(0.04) |
| IC0.5 | 1.00(0.00) | 2.00(0.00) | 0.96(0.02) | 3.98(0.02) | 0.88(0.03) | 5.87(0.04) |
| IC1 | 1.00(0.00) | 2.00(0.00) | 0.96(0.02) | 3.98(0.02) | 0.85(0.04) | 5.81(0.05) |
| BIC | 0.00(0.00) | 11.98(0.01) | 0.00(0.00) | 11.99(0.01) | 0.00(0.00) | 12.00(0.00) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 1.00(0.00) | 2.00(0.00) | 0.97(0.02) | 4.03(0.02) | 0.94(0.02) | 6.04(0.02) |
| IC0.5 | 1.00(0.00) | 2.00(0.00) | 0.97(0.02) | 4.03(0.02) | 0.94(0.02) | 6.04(0.02) |
| IC1 | 1.00(0.00) | 2.00(0.00) | 0.97(0.02) | 4.03(0.02) | 0.94(0.02) | 6.04(0.02) |
| BIC | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 11.99(0.01) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 1.00(0.00) | 2.00(0.00) | 0.97(0.02) | 4.03(0.02) | 0.98(0.01) | 6.02(0.01) |
| IC0.5 | 1.00(0.00) | 2.00(0.00) | 0.97(0.02) | 4.03(0.02) | 0.98(0.01) | 6.02(0.01) |
| IC1 | 1.00(0.00) | 2.00(0.00) | 0.97(0.02) | 4.03(0.02) | 0.98(0.01) | 6.02(0.01) |
| BIC | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 11.99(0.01) |
Simulation results for Setting IV, V and VI (standard errors in parenthesis)
| TP |
| TP |
| TP |
| |
| IC0 | 1.00(0.00) | 2.00(0.00) | 0.96(0.02) | 3.98(0.02) | 0.73(0.04) | 5.83(0.06) |
| IC0.5 | 1.00(0.00) | 2.00(0.00) | 0.95(0.02) | 3.97(0.02) | 0.69(0.05) | 5.77(0.06) |
| IC1 | 1.00(0.00) | 2.00(0.00) | 0.93(0.03) | 3.93(0.03) | 0.63(0.05) | 5.57(0.07) |
| BIC | 0.00(0.00) | 11.83(0.05) | 0.00(0.00) | 11.82(0.04) | 0.00(0.00) | 11.86(0.04) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 0.98(0.01) | 2.02(0.01) | 0.90(0.03) | 4.10(0.03) | 0.76(0.04) | 6.06(0.05) |
| IC0.5 | 0.98(0.01) | 2.02(0.01) | 0.94(0.02) | 3.98(0.02) | 0.81(0.04) | 5.99(0.04) |
| IC1 | 0.98(0.01) | 2.02(0.01) | 0.94(0.02) | 3.94(0.02) | 0.74(0.04) | 5.81(0.05) |
| BIC | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 11.99(0.01) |
|
| ||||||
| TP |
| TP |
| TP |
| |
| IC0 | 0.96(0.02) | 2.04(0.02) | 0.88(0.03) | 4.12(0.03) | 0.68(0.05) | 6.57(0.13) |
| IC0.5 | 0.98(0.01) | 2.02(0.01) | 0.94(0.02) | 4.04(0.02) | 0.82(0.04) | 6.06(0.04) |
| IC1 | 0.98(0.01) | 2.02(0.01) | 0.94(0.02) | 4.02(0.02) | 0.74(0.04) | 5.75(0.05) |
| BIC | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 12.00(0.00) | 0.00(0.00) | 12.00(0.00) |