| Literature DB >> 35174275 |
Ilya Makarov1,2,3, Andrey Savchenko4, Arseny Korovko1, Leonid Sherstyuk1, Nikita Severin5, Dmitrii Kiselev1,3, Aleksandr Mikheev1,6, Dmitrii Babaev3,6.
Abstract
Many tasks in graph machine learning, such as link prediction and node classification, are typically solved using representation learning. Each node or edge in the network is encoded via an embedding. Though there exists a lot of network embeddings for static graphs, the task becomes much more complicated when the dynamic (i.e., temporal) network is analyzed. In this paper, we propose a novel approach for dynamic network representation learning based on Temporal Graph Network by using a highly custom message generating function by extracting Causal Anonymous Walks. We provide a benchmark pipeline for the evaluation of temporal network embeddings. This work provides the first comprehensive comparison framework for temporal network representation learning for graph machine learning problems involving node classification and link prediction in every available setting. The proposed model outperforms state-of-the-art baseline models. The work also justifies their difference based on evaluation in various transductive/inductive edge/node classification tasks. In addition, we show the applicability and superior performance of our model in the real-world downstream graph machine learning task provided by one of the top European banks, involving credit scoring based on transaction data. ©2022 Makarov et al.Entities:
Keywords: Dynamic networks; Temporal graph attention; Temporal network embedding; Temporal networks; Temporal random walks
Year: 2022 PMID: 35174275 PMCID: PMC8802774 DOI: 10.7717/peerj-cs.858
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Figure 1Flowchart of the evaluation framework.
Figure 2The proposed model.
Numbers denote the order of steps.
Descriptive statistics for the datasets.
Left to right: whether the graph is bipartite, number of unique nodes (representing users and items) and edges, labels, average degree, number of edge updates per node as source/target, and setting for splitting the temporal network into batches for snapshot DTDG models.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
| Yes | 10000/984 | 672447 | 366 | 61.37 | 67.24/683.38 | Daily |
|
| Yes | 8227/1000 | 157474 | 217 | 17.19 | 19.14/157.47 | Daily |
|
| No | 185 | 125236 | – | 29.06 | 675.78 | 1% |
|
| No | 1900 | 59836 | – | 13.04 | 106.41 | 1% |
|
| No | 231288 | 300000 | – | 2.34 | 2.59 | 1% |
Ablation study on Reddit dataset for inductive edge prediction.
| Enabled modules | AUC-ROC |
|---|---|
| (Msg)+(Emb)+(RNN) | 0.894 ± 0.034 |
| (Emb) | 0.893 ± 0.035 |
| (Msg) | 0.888 ± 0.042 |
| (Emb)+(Msg) | 0.878 ± 0.051 |
| (Msg)+(RNN) | 0.876 ± 0.047 |
| (Emb)+(RNN) | 0.870 ± 0.055 |
| TGN baseline | 0.865 ± 0.065 |
The average precision of our model depends on parameters.
| Data | Number of walks | Hops | Dimension | AP |
|---|---|---|---|---|
| UCI | 32 | 1 | 10 | 0.759 ± 0.000 |
| UCI | 32 | 1 | 100 | 0.757 ± 0.009 |
| UCI | 8 | 2 | 10 | 0.764 ± 0.001 |
| UCI | 8 | 2 | 100 | 0.767 ± 0.010 |
| Wikipedia | 32 | 1 | 10 | 0.898 ± 0.012 |
| Wikipedia | 32 | 1 | 100 | 0.909 ± 0.002 |
| Wikipedia | 8 | 2 | 10 | 0.909 ± 0.007 |
| Wikipedia | 8 | 2 | 100 | 0.912 ± 0.011 |
Figure 3Inductive average precision depends on the epoch number on Reddit dataset.
Transductive edge prediction, AUC-ROC.
Best results in bold, second-best underlined.
| Node mask | Edge mask | DyRep | Jodie | TGAT | TigeCMN | APAN | HiLi | TGN | Ours | |
|---|---|---|---|---|---|---|---|---|---|---|
| 10% | 75% | 0.774 | 0.534 |
| – | 0.744 | 0.714 | 0.755 |
| |
| Enron | 75% | 10% | 0.647 | 0.528 | 0.711 | – |
| 0.636 | 0.660 |
|
| 75% | 75% | 0.595 | 0.537 | 0.590 | – |
| 0.709 | 0.534 |
| |
| 10% | 75% | 0.964 | 0.704 | 0.973 | 0.961 | 0.497 | 0.948 |
|
| |
| 75% | 10% | 0.973 | 0.825 | 0.962 | 0.824 | 0.502 | 0.961 |
|
| |
| 75% | 75% | 0.959 | 0.720 | 0.936 |
| 0.500 | 0.950 |
| 0.920 | |
| 10% | 75% | 0.770 | 0.496 | 0.835 | – | 0.858 | 0.546 |
|
| |
| UCI | 75% | 10% | 0.731 | 0.573 | 0.826 | – | 0.867 | 0.556 |
|
|
| 75% | 75% | 0.781 | 0.489 | 0.834 | – |
| 0.539 | 0.776 |
| |
| 10% | 75% | 0.966 | 0.728 | 0.970 | 0.824 | 0.556 | 0.863 |
|
| |
| Wikipedia | 75% | 10% | 0.964 | 0.737 | 0.961 | 0.835 | 0.504 | 0.874 |
|
|
| 75% | 75% | 0.962 | 0.719 | 0.931 | 0.818 | 0.494 | 0.865 |
|
| |
| 10% | 75% | 0.728 | 0.914 | 0.924 | – |
| – | 0.939 |
| |
| Ethereum | 75% | 10% | 0.868 | 0.929 | 0.917 | – |
| – | 0.942 |
|
| 75% | 75% | 0.730 | 0.919 | 0.920 | – |
| – | 0.928 |
|
Inductive edge prediction, AUC-ROC.
Best results in bold, second-best underlined.
| Node mask | Edge mask | DyRep | Jodie | TGAT | TigeCMN | APAN | HiLi | TGN | Ours | |
|---|---|---|---|---|---|---|---|---|---|---|
| 10% | 75% | 0.613 | 0.453 | 0.761 | – |
| 0.552 | 0.723 |
| |
| Enron | 75% | 10% | 0.672 | 0.508 |
| – | 0.691 | 0.567 | 0.627 |
|
| 75% | 75% | 0.666 | 0.573 | 0.586 | – |
| 0.556 | 0.536 |
| |
| 10% | 75% | 0.832 | 0.588 | 0.959 | 0.803 | 0.527 | 0.847 |
|
| |
| 75% | 10% | 0.910 | 0.294 | 0.960 | 0.644 | 0.511 | 0.928 |
|
| |
| 75% | 75% | 0.880 | 0.298 |
| 0.680 | 0.494 | 0.924 |
| 0.915 | |
| 10% | 75% | 0.602 | 0.576 |
| – | 0.675 | 0.546 | 0.825 |
| |
| UCI | 75% | 10% | 0.723 | 0.431 | 0.816 | – | 0.745 | 0.517 |
|
|
| 75% | 75% | 0.718 | 0.423 |
| – | 0.750 | 0.601 | 0.775 |
| |
| 10% | 75% | 0.950 | 0.584 | 0.964 | 0.644 | 0.500 | 0.824 |
|
| |
| Wikipedia | 75% | 10% | 0.902 | 0.462 | 0.960 | 0.456 | 0.517 | 0.839 |
|
|
| 75% | 75% | 0.888 | 0.470 |
| 0.498 | 0.498 | 0.859 | 0.914 |
| |
| 10% | 75% | 0.519 | 0.655 | 0.777 | – | 0.772 | – |
|
| |
| Ethereum | 75% | 10% | 0.483 | 0.629 | 0.751 | – | 0.750 | – |
|
|
| 75% | 75% | 0.494 | 0.616 | 0.763 | – | 0.760 | – |
|
|
Node classification, AUC-ROC.
Best results in bold, second-best underlined.
| Node mask | Edge mask | DyRep | Jodie | TGAT | APAN | HiLi | TGN | Ours | |
|---|---|---|---|---|---|---|---|---|---|
| 10% | 75% | 0.531 | 0.421 | 0.589 | 0.500 | 0.509 |
|
| |
| Reddit (Transductive) | 75% | 10% | 0.601 | 0.435 |
| 0.500 | 0.491 | 0.584 |
|
| 75% | 75% | 0.597 | 0.439 | 0.589 | 0.379 | 0.491 |
|
| |
| 10% | 75% | 0.510 | 0.456 | 0.555 | 0.500 | 0.551 |
|
| |
| Reddit (Inductive) | 75% | 10% |
| 0.539 | 0.512 | 0.542 | 0.476 | 0.495 |
|
| 75% | 75% | 0.521 | 0.567 | 0.435 |
| 0.524 |
| 0.584 |
Figure 4Inductive edge prediction on transactions of regional subdivision of major European bank.
Node classification results based on node embeddings obtained from TGN and our proposed model with feature dimension d being multiple of 8 × 2 shape.
| Node Embeddings | AUC-ROC |
|---|---|
| TGN | 0.621 ± 0.066 |
| Our | 0.678 ± 0.059 |