| Literature DB >> 33800642 |
Xun Zhang1, Lanyan Yang1, Bin Zhang1, Ying Liu1, Dong Jiang2, Xiaohai Qin1, Mengmeng Hao2.
Abstract
The problem of extracting meaningful data through graph analysis spans a range of different fields, such as social networks, knowledge graphs, citation networks, the World Wide Web, and so on. As increasingly structured data become available, the importance of being able to effectively mine and learn from such data continues to grow. In this paper, we propose the multi-scale aggregation graph neural network based on feature similarity (MAGN), a novel graph neural network defined in the vertex domain. Our model provides a simple and general semi-supervised learning method for graph-structured data, in which only a very small part of the data is labeled as the training set. We first construct a similarity matrix by calculating the similarity of original features between all adjacent node pairs, and then generate a set of feature extractors utilizing the similarity matrix to perform multi-scale feature propagation on graphs. The output of multi-scale feature propagation is finally aggregated by using the mean-pooling operation. Our method aims to improve the model representation ability via multi-scale neighborhood aggregation based on feature similarity. Extensive experimental evaluation on various open benchmarks shows the competitive performance of our method compared to a variety of popular architectures.Entities:
Keywords: graph analysis; graph neural network; neighborhood aggregation; semi-supervised learning
Year: 2021 PMID: 33800642 PMCID: PMC8067278 DOI: 10.3390/e23040403
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Flow illustration of the proposed architecture.
Figure 2Diagrammatic representation of the proposed architecture.
Figure 3Schematic depiction of the MAGN network for semi-supervised learning.
The Statistics of Datasets.
| Dataset | Type | Classes | Features | Nodes | Edges |
|---|---|---|---|---|---|
| Cora | Citation | 7 | 1433 | 2708 | 5429 |
| CiteSeer | Citation | 6 | 3703 | 3327 | 4732 |
| PubMed | Citation | 3 | 500 | 19,717 | 44,338 |
| Coauthor CS | Co-author | 15 | 6805 | 18,333 | 81,894 |
Experimental settings for fixed data splits.
| Setting | Hidden Sizes | Learning Rate | Dropout Rate |
| Epochs | Early Stopping | Layers |
| |
|---|---|---|---|---|---|---|---|---|---|
| Dataset | |||||||||
| Cora | 64 | 0.002 | 0.5 | 0.0005 | 250 | 10 | 2 | 5 | |
| PubMed | 64 | 0.01 | 0.5 | 0.0005 | 100 | 10 | 2 | 6 | |
| CiteSeer | 64 | 0.01 | 0.5 | 0.0005 | 150 | 10 | 2 | 4 | |
Experimental settings for random data splits.
| Setting | Hidden Sizes | Learning Rate | Dropout Rate |
| Epochs | Early Stopping | Layers | |
|---|---|---|---|---|---|---|---|---|
| Method | ||||||||
| MLP | 64 | 0.01 | 0.5 | 0.0005 | 200 | 10 | 2 | |
| Geniepath | 64 | 0.005 | 0.4 | 0.0005 | 10,000 | 200 | 2 | |
| SGC | -- | 0.2 | 0.5 | 0.000005 | 200 | 10 | 1 | |
| GCN | 64 | 0.01 | 0.5 | 0.0005 | 200 | 10 | 2 | |
| SIGN | 128 | 0.0005 | 0.5 | -- | 200 | 10 | 5 | |
| DGI | 512 | 0.001 | -- | -- | 5000 | 20 | 2 | |
| GAT | 64 | 0.01 | 0.6 | 0.0005 | 10,000 | 100 | 2 | |
| MAGN | 64 | 0.005 | 0.5 | 0.0005 | 200 | 10 | 2 | |
Classification accuracy with a fixed split of data. The highest accuracy in each column is highlighted in bold.
| Dataset | Cora | CiteSeer | PubMed | |
|---|---|---|---|---|
| Method | ||||
| MLP | 55.1 | 46.5 | 71.4 | |
| ManiReg | 59.5 | 60.1 | 70.7 | |
| SemiEmb | 59.0 | 59.6 | 71.1 | |
| LP | 68.0 | 45.3 | 63.0 | |
| Planetoid | 75.7 | 64.7 | 77.2 | |
| GCN | 81.5 | 70.3 | 79.0 | |
| GeniePath | -- | -- | 78.5 | |
| MoNet | 81.7 ± 0.5 | -- | 78.8 ± 0.3 | |
| SGC | 81.0 ± 0.0 | 71.9 ± 0.1 | 78.9 ± 0.0 | |
| DGI | 82.3 ± 0.6 | 71.8 ± 0.7 | 76.8 ± 0.6 | |
| GAT | 83.0 ± 0.7 |
| 79.0 ± 0.3 | |
| MAGN |
| 72.0 ± 0.2 |
| |
Figure 4Average accuracy on different datasets when varying K and t. (a) MAGN on Cora. (b) MAGN on CiteSeer. (c) MAGN on PubMed.
Figure 5Average accuracy of MAGN for varying smoothing parameter . (a) MAGN on Cora. (b) MAGN on CiteSeer. (c) MAGN on PubMed.
Figure 6t-SNE visualization of the nodes from the Cora dataset. (a) Raw Cora dataset. (b) Trained Cora dataset.
Classification accuracy with random split of the data. The highest accuracy in each column is highlighted in bold.
| Dataset | Cora | CiteSeer | PubMed | Coauthor CS | |
|---|---|---|---|---|---|
| Method | |||||
| MLP | 59.23 ± 1.09 | 57.87 ± 1.61 | 58.94 ± 1.12 | 88.11 ± 0.76 | |
| GCN | 80.77 ± 1.14 | 70.89 ± 1.22 | 79.33 ± 1.34 | 91.68 ± 0.64 | |
| GeniePath | 79.04 ± 1.46 | 70.34 ± 1.16 | 78.52 ± 1.61 | 91.01 ± 0.97 | |
| SGC | 80.32 ± 1.17 | 70.48 ± 0.94 | 78.39 ± 1.35 | 90.93 ± 0.96 | |
| SIGN | 80.85 ± 1.65 | 68.13 ± 1.52 | 79.57 ± 1.94 | 92.02 ± 0.86 | |
| DGI | 81.84 ± 1.21 | 71.55 ± 0.62 | 78.25 ± 1.53 | 91.09 ± 0.85 | |
| GAT | 81.19 ± 1.27 | 71.01 ± 0.72 | 79.42 ± 1.21 | 91.33 ± 0.74 | |
| MAGN |
|
|
|
| |
Number of parameters for different methods.
| Dataset | Cora | CiteSeer | PubMed | Coauthor CS | |
|---|---|---|---|---|---|
| Method | |||||
| MLP | 92,160 | 237,376 | 32,192 | 436,480 | |
| GeniePath | 149,632 | 294,784 | 89,664 | 493,952 | |
| SGC | 8598 | 22,218 | 1500 | 102,075 | |
| GCN | 92,160 | 237,376 | 32,192 | 436,480 | |
| SIGN | 1,297,920 | 3,041,280 | 580,992 | 5,424,768 | |
| DGI | 999,424 | 2,161,152 | 194,304 | 1,811,456 | |
| GAT | 92,302 | 237,516 | 32,326 | 436,638 | |
| MAGN | 92,160 | 237,376 | 32,192 | 436,480 | |
Figure 7Average accuracy of MAGN and GCN for varying numbers of network layers. (a) Cora. (b) CiteSeer. (c) PubMed. (d) Coauthor CS.
Classification accuracy for different training set sizes on Cora. The highest accuracy in each column is highlighted in bold.
| Size | 5 Per Class | 10 Per Class | 15 Per Class | |
|---|---|---|---|---|
| Method | ||||
| MLP | 38.35 ± 3.88 | 55.04 ± 2.74 | 55.99 ± 1.07 | |
| Geniepath | 64.20 ± 3.89 | 73.30 ± 2.44 | 77.43 ± 1.34 | |
| SGC | 66.28 ± 3.66 | 75.45 ± 2.37 | 78.65 ± 1.38 | |
| GCN | 66.87 ± 3.89 | 75.43 ± 2.23 | 78.73 ± 1.43 | |
| SIGN | 64.53 ± 4.81 | 74.96 ± 2.65 | 78.18 ± 1.61 | |
| DGI | 71.01 ± 2.82 | 77.65 ± 2.08 | 79.77 ± 1.38 | |
| GAT | 68.12 ± 3.91 | 77.51 ± 2.16 | 79.68 ± 1.11 | |
| MAGN |
|
|
| |
Classification accuracy for different training set sizes on CiteSeer. The highest accuracy in each column is highlighted in bold.
| Size | 5 Per Class | 10 Per Class | 15 Per Class | |
|---|---|---|---|---|
| Method | ||||
| MLP | 41.79 ± 5.29 | 51.11 ± 2.78 | 54.29 ± 2.28 | |
| Geniepath | 56.99 ± 4.13 | 64.47 ± 2.27 | 67.98 ± 1.32 | |
| SGC | 56.01 ± 6.36 | 64.44 ± 2.43 | 67.49 ± 1.41 | |
| GCN | 58.31 ± 5.41 | 65.97 ± 2.28 | 68.54 ± 1.47 | |
| SIGN | 52.67 ± 5.05 | 61.19 ± 2.45 | 64.84 ± 1.69 | |
| DGI | 59.68 ± 4.82 | 66.09 ± 2.11 | 69.13 ± 1.51 | |
| GAT | 59.19 ± 5.67 | 65.81 ± 2.23 | 68.38 ± 1.31 | |
| MAGN |
|
|
| |
Classification accuracy for different training set sizes on PubMed. The highest accuracy in each column is highlighted in bold.
| Size | 5 Per Class | 10 Per Class | 15 Per Class | |
|---|---|---|---|---|
| Method | ||||
| MLP | 37.78 ± 3.86 | 49.30 ± 3.26 | 54.36 ± 1.94 | |
| Geniepath | 67.43 ± 5.52 | 73.84 ± 4.35 | 77.11 ± 2.43 | |
| SGC | 67.95 ± 5.12 | 73.64 ± 4.12 | 76.89 ± 2.10 | |
| GCN | 67.92 ± 5.19 | 74.44 ± 3.92 | 77.33 ± 2.72 | |
| SIGN | 65.95 ± 5.77 | 74.35 ± 2.91 | 77.51 ± 1.69 | |
| DGI | 66.35 ± 6.14 | 73.64 ± 4.01 | 76.72 ± 2.32 | |
| GAT | 68.13 ± 5.32 | 74.52 ± 3.64 | 77.42 ± 2.14 | |
| MAGN |
|
|
| |
Classification accuracy for different training set sizes on Coauthor CS. The highest accuracy in each column is highlighted in bold.
| Size | 5 Per Class | 10 Per Class | 15 Per Class | |
|---|---|---|---|---|
| Method | ||||
| MLP | 72.93 ± 2.90 | 82.92 ± 1.63 | 86.40 ± 1.23 | |
| Geniepath | 88.45 ± 1.43 | 89.85 ± 1.32 | 90.72 ± 1.01 | |
| SGC | 88.27 ± 1.51 | 89.89 ± 1.11 | 90.60 ± 1.24 | |
| GCN | 88.44 ± 1.22 | 90.10 ± 1.16 | 91.17 ± 0.87 | |
| SIGN | 88.34 ± 1.57 | 90.37 ± 1.14 | 91.34 ± 0.69 | |
| DGI | 88.54 ± 1.12 | 90.45 ± 0.76 | 90.91 ± 0.74 | |
| GAT | 88.52 ± 1.38 | 90.23 ± 1.05 | 91.05 ± 0.82 | |
| MAGN |
|
|
| |