| Literature DB >> 33286933 |
Kaushalya Madhawa1, Tsuyoshi Murata1.
Abstract
Current breakthroughs in the field of machine learning are fueled by the deployment of deep neural network models. Deep neural networks models are notorious for their dependence on large amounts of labeled data for training them. Active learning is being used as a solution to train classification models with less labeled instances by selecting only the most informative instances for labeling. This is especially important when the labeled data are scarce or the labeling process is expensive. In this paper, we study the application of active learning on attributed graphs. In this setting, the data instances are represented as nodes of an attributed graph. Graph neural networks achieve the current state-of-the-art classification performance on attributed graphs. The performance of graph neural networks relies on the careful tuning of their hyperparameters, usually performed using a validation set, an additional set of labeled instances. In label scarce problems, it is realistic to use all labeled instances for training the model. In this setting, we perform a fair comparison of the existing active learning algorithms proposed for graph neural networks as well as other data types such as images and text. With empirical results, we demonstrate that state-of-the-art active learning algorithms designed for other data types do not perform well on graph-structured data. We study the problem within the framework of the exploration-vs.-exploitation trade-off and propose a new count-based exploration term. With empirical evidence on multiple benchmark graphs, we highlight the importance of complementing uncertainty-based active learning models with an exploration term.Entities:
Keywords: active learning; graph neural networks; graph representation learning; machine learning; node classification
Year: 2020 PMID: 33286933 PMCID: PMC7597335 DOI: 10.3390/e22101164
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Summary of existing work for active node classification on attributed graphs. The work by Gadde et al. [43] does not use the labels of the nodes. Therefore, this method does not use a classifier. We use the following abbreviations in the table. LR—Logistic Regression, GRF—Gaussian Random Fields, LP—Label Propagation, SC—Spectral Clustering, NA—Not Applicable.
| Work | AL Approach | Classifier | Attributes | Adaptive | Labels | Year |
|---|---|---|---|---|---|---|
| Zhu et al. [ | EER | GRF | No | No | Yes | 2003 |
| Macskassy [ | EER + Heuristics | GRF | No | Yes | Yes | 2009 |
| Bilgic et al. [ | QBC | LR | No | Yes | Yes | 2010 |
| Gu and Han [ | EER | LP | No | No | Yes | 2012 |
| Ji and Han [ | Variation Minimization | GRF | No | No | No | 2012 |
| Ma et al. [ | Uncertainty | GRF | No | No | Yes | 2013 |
| Gadde et al. [ | SC | NA | No | No | No | 2015 |
| Cai et al. [ | Uncertainty + Heuristics | GCN | Yes | Yes | Yes | 2017 |
Dataset statistics. Labeling rate as a percentage of total nodes is shown within brackets. Avg. deg.: Average degree, Avg. CC: Average clustering coefficient, : Degree assortativity, : Label assortativity.
| Dataset | Nodes | Classes | Avg. Deg. | Avg. CC |
|
| Features | Labels (%) |
|---|---|---|---|---|---|---|---|---|
| CiteSeer | 2110 | 6 | 2.84 | 0.17 | 0.007 | 0.67 | 3703 | 12 (0.56) |
| PubMed | 19,717 | 3 | 6.34 | 0.06 | −0.044 | 0.69 | 500 | 6 (0.03) |
| n CORA | 2485 | 7 | 4.00 | 0.24 | −0.071 | 0.76 | 1433 | 14 (0.56) |
| Amazon Comp. | 13,752 | 10 | 36.74 | 0.35 | −0.057 | 0.68 | 767 | 20 (0.14) |
| Co-author Phy | 34,493 | 5 | 14.38 | 0.38 | 0.201 | 0.87 | 8415 | 10 (0.03) |
| Co-author CS | 18,333 | 15 | 8.93 | 0.34 | 0.113 | 0.79 | 6805 | 30 (0.16) |
| Disease | 1044 | 2 | 2.00 | 0.0 | −0.544 | 0.68 | 1000 | 4 (0.38) |
| Wiki-CS | 11,701 | 10 | 36.94 | 0.47 | −0.065 | 0.58 | 300 | 20 (0.17) |
| PPI-Brain | 3480 | 121 | 31.94 | 0.17 | −0.064 | 0.09 | 50 | 35 (1.0) |
| PPI-Blood | 3312 | 121 | 32.91 | 0.18 | −0.061 | 0.09 | 50 | 33 (1.0) |
| PPI-Kidney | 3284 | 121 | 31.70 | 0.18 | −0.067 | 0.09 | 50 | 33 (1.0) |
| Github | 37,700 | 2 | 15.33 | 0.17 | −0.075 | 0.38 | 4005 | 4 (0.01) |
Figure 1Macro-F1 score (test) of active learning algorithms with number of acquisitions. A two-layer graph convolutional network (GCN) is used as the graph neural network (GNN) model. (a) CiteSeer. (b) PubMed. (c) CORA. (d) Amazon Computers. (e) Co-author CS. (f) Co-author Physics. (g) Disease. (h) Wiki-CS. (i) PPI-Brain. (j) PPI-Blood. (k) PPI-Kidney. (l) Github.
Figure 2Macro-F1 score (test) of active learning algorithms with number of acquisitions. SGC model is used as the GNN model. (a) CiteSeer. (b) PubMed. (c) CORA. (d) Amazon Computers. (e) Co-author CS. (f) Co-author Physics. (g) Disease. (h) Wiki-CS. (i) PPI-Brain. (j) PPI-Blood. (k) PPI-Kidney. (l) Github.
Average F1-score of different acquisition functions. Forty query instances are selected (average of 30 runs). Standard deviation is shown underneath the macro-averaged F1-score. Classifier: GCN. Rand—Random, Ent—Entropy, PR—PageRank, Deg—Degree, CC: Clustering coefficient.
| Dataset | Rand | Ent | BALD | PR | Deg | CC | Ent | BALD | Ent Count | BALD Count | AGE |
|---|---|---|---|---|---|---|---|---|---|---|---|
| CiteSeer | 58.4 ± 6.9 | 60.1 ± 8.1 | 58.6 ± 5.1 | 54.4 ± 3.4 | 53.8 ± 4.3 | 53.6 ± 7.0 | 59.4 ± 4.2 | 52.0 ± 5.7 | 60.3 ± 4.2 | 59.1 ± 4.7 | |
| CORA | 74.1 ± 5.1 | 73.9 ± 6.6 | 71.4 ± 7.4 | 71.8 ± 5.1 | 70.2 ± 4.9 | 70.4 ± 6.4 | 73.3 ± 5.4 | 72.9 ± 3.6 | |||
| PubMed | 76.4 ± 4.0 | 74.1 ± 3.0 | 75.7 ± 3.8 | 75.3 ± 3.3 | 71.8 ± 3.7 | 76.8 ± 1.4 | 74.2 ± 4.2 | 74.1 ± 4.1 | 75.7 ± 4.4 | 74.5 ± 2.2 | |
| Coauthor CS | 82.4 ± 2.3 | 78.9 ± 3.4 | 80.6 ± 3.6 | 79.7 ± 3.6 | 80.7 ± 2.5 | 81.9 ± 3.7 | 78.2 ± 4.4 | 81.2 ± 2.3 | 80.3 ± 4.9 | 82.1 ± 2.5 | |
| Coauthor Phy | 85.2 ± 3.3 | 84.1 ± 3.2 | 83.8 ± 2.8 | 85.2 ± 1.8 | 77.1 ± 2.1 | 86.4 ± 2.9 | 85.5 ± 2.7 | 83.4 ± 3.5 | 86.9 ± 2.9 | 83.7 ± 2.8 | |
| Amazon Comp. | 74.2 ± 3.4 | 66.8 ± 7.6 | 65.2 ± 8.1 | 60.2 ± 15.6 | 73.1 ± 6.0 | 70.8 ± 8.1 | 75.4 ± 3.8 | 73.3 ± 7.5 | 74.2 ± 5.9 | ||
| Disease | 57.1 ± 7.1 | 59.4 ± 8.8 | 53.2 ± 9.1 | 20.8 ± 5.1 | 61.0 ± 10.7 | 65.8 ± 9.2 | 63.3 ± 8.0 | ||||
| Wiki-CS | 57.1 ± 7.1 | 55.0 ± 5.1 | 59.4 ± 3.1 | 58.2 ± 2.2 | 60.5 ± 3.7 | 61.0 ± 10.7 | 57.0 ± 3.3 | 57.7 ± 4.9 | |||
| PPI Brain | 25.6 ± 6.5 | 21.4 ± 6.3 | 31.6 ± 6.1 | 19.3 ± 6.6 | 22.3 ± 6.0 | 30.0 ± 8.9 | 22.2 ± 5.0 | 35.3 ± 6.1 | 22.3 ± 6.2 | ||
| PPI Blood | 27.7 ± 3.2 | 22.9 ± 5.7 | 31.0 ± 6.5 | 41.4 ± 1.9 | 21.0 ± 5.8 | 26.5 ± 4.9 | 36.9 ± 4.6 | 23.6 ± 5.4 | 37.4 ± 4.3 | 23.3 ± 5.6 | |
| PPI Kidney | 25.7 ± 2.9 | 18.7 ± 6.8 | 27.9 ± 9.6 | 41.1 ± 2.2 | 16.3 ± 5.9 | 18.8 ± 7.0 | 33.5 ± 3.3 | 29.2 ± 1.7 | 37.6 ± 3.4 | 19.4 ± 4.8 | |
| Github | 74.0 ± 8.0 | 74.5 ± 2.4 | 71.1 ± 2.9 | 62.3 ± 4.8 | 75.4 ± 1.8 | 74.4 ± 2.2 | 76.4 ± 2.2 | 73.8 ± 2.3 | 73.9 ± 2.1 |
Average F1-score of different acquisition functions. Forty query instances are selected (average of 30 runs). Standard deviation is shown underneath the macro-averaged F1-score. Classifier: SGC. Rand - Random, Ent—Entropy, PR—PageRank, Deg—Degree, CC: Clustering coefficient.
| Dataset | Rand | Ent | BALD | PR | Deg | CC | Ent | BALD | Ent Count | BALD Count | AGE |
|---|---|---|---|---|---|---|---|---|---|---|---|
| CiteSeer | 55.5 ± 4.6 | 58.0 ± 4.0 | 55.0 ± 3.4 | 53.4 ± 5.3 | 53.4 ± 7.4 | 56.3 ± 5.6 | 56.0 ± 4.6 | 56.6 ± 4.8 | |||
| CORA | 75.4 ± 4.0 | 71.4 ± 2.3 | 71.4 ± 5.1 | 69.3 ± 3.4 | 74.9 ± 6.1 | 73.9 ± 6.1 | 73.8 ± 4.4 | 74.2 ± 3.1 | 74.7 ± 6.4 | ||
| PubMed | 75.8 ± 3.6 | 74.8 ± 2.3 | 77.5 ± 2.6 | 76.7 ± 2.5 | 72.3 ± 6.4 | 60.7 ± 7.9 | 75.3 ± 3.7 | 77.2 ± 2.4 | 76.6 ± 2.8 | ||
| Coauthor CS | 81.7 ± 2.9 | 76.8 ± 3.4 | 81.9 ± 3.9 | 81.3 ± 4.1 | 81.4 ± 4.0 | 81.9 ± 3.7 | 76.9 ± 4.1 | 82.6 ± 3.7 | 77.2 ± 4.7 | ||
| Coauthor Phy | 86.5 ± 3.3 | 84.1 ± 2.4 | 86.7 ± 2.9 | 79.3 ± 3.7 | 88.1 ± 2.7 | 84.1 ± 3.1 | 89.6 ± 2.6 | 87.5 ± 3.6 | 88.9 ± 2.1 | ||
| Amazon Comp. | 77.3 ± 4.1 | 73.4 ± 4.2 | 74.2 ± 5.3 | 71.9 ± 3.5 | 73.5 ± 6.1 | 75.8 ± 5.4 | 74.5 ± 6.7 | 74.3 ± 3.2 | 74.9 ± 5.3 | 75.6 ± 3.8 | |
| Disease | 55.4 ± 8.7 | 59.7 9.5 | 58.5 ± 8.9 | 17.8 ± 4.5 | 63.4 ± 7.5 | 67.4 ± 8.5 | 67.1 ± 9.7 | 66.2 ± 8.4 | 66.4 ± 11.1 | ||
| Wiki-CS | 59.8 ± 6.3 | 55.5 ± 3.6 | 64.7 ± 4.0 | 62.9 ± 3.6 | 61.3 ± 3.1 | 55.4 ± 6.6 | 57.5 ± 5.3 | 63.8 ± 2.4 | 56.5 ± 5.8 | 50.4 ± 5.7 | |
| PPI Brain | 36.9 ± 2.2 | 38.4 ± 2.4 | 40.0 ± 1.4 | 41.0 ± 1.4 | 34.6 ± 3.6 | 38.2 ± 2.0 | 40.3 ± 1.4 | 40.6 ± 1.0 | 33.2 ± 2.7 | ||
| PPI Blood | 34.6 ± 2.2 | 37.2 ± 3.7 | 39.5 ± 2.6 | 41.7 ± 2.1 | 35.7 ± 1.7 | 37.0 ± 3.6 | 39.0 ± 2.9 | 39.8 ± 2.0 | 40.8 ± 2.1 | 39.4 ± 2.2 | |
| PPI Kidney | 39.1 ± 1.8 | 38.8 ± 2.6 | 39.9 ± 1.4 | 41.7 ± 2.0 | 37.0 ± 1.4 | 40.0 ± 1.7 | 39.9 ± 1.4 | 40.4 ± 2.1 | 41.0 ± 1.8 | 41.0 ± 1.7 | |
| Github | 76.4 ± 2.5 | 71.4 ± 2.5 | 69.7 ± 2.8 | 58.0 ± 5.6 | 76.8 ± 1.4 | 72.8 ± 1.5 | 75.8 ± 2.7 | 72.9 ± 1.5 | 73.3 ± 4.0 |
The best performing model according to Table 3 and Table 4.
| Data | Without Exploration | With Exploration | ||||
|---|---|---|---|---|---|---|
| Macro-F1 | Model | Classifier | Macro-F1 | Model | Classifier | |
| CiteSeer | 61.5 | AGE | GCN | 61.5 | AGE | GCN |
| CORA | 76.1 | Random | SGC | 76.7 | Entropy Count | SGC |
| PubMed | 77.7 | AGE | SGC | 78.0 | BALD Count | SGC |
| Coauthor CS | 83.9 | AGE | GCN | 83.9 | AGE | GCN |
| Coauthor Phy | 90.2 | BALD | SGC | 90.4 | BALD Count | SGC |
| Amazon Comp. | 78.3 | Clustering | SGC | 78.3 | Clustering | SGC |
| Disease | 68.2 | Entropy | SGC | 68.2 | Entropy | SGC |
| Wiki-CS | 64.7 | BALD | SGC | 65.6 | BALD Count | SGC |
| PPI Brain | 41.8 | Degree | SGC | 41.8 | Degree | SGC |
| PPI Blood | 42.4 | PageRank | GCN | 42.4 | PageRank | GCN |
| PPI Kidney | 42.3 | PageRank | SGC | 42.3 | PageRank | SGC |
| Github | 77.4 | Entropy | SGC | 77.4 | Entropy | SGC |
Running time (seconds): average execution time to acquire 40 unlabeled instances. We run all experiments on a single NVIDIA GTX 1080-Ti GPU. PR: PageRank, CC: Clustering coefficient.
| Clf. | Dataset | Rand | Ent | PR | Deg | CC | AGE | BALD | Count | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ent | BALD | Ent | BALD | |||||||||
| GCN | CiteSeer | 4.2 | 4.8 | 4.8 | 4.7 | 4.9 | 4.8 | 4.8 | 4.8 | 4.8 | 4.8 | 4.8 |
| PubMed | 6.9 | 7.6 | 25.4 | 7.3 | 32 | 1125.9 | 7.9 | 7.5 | 7.8 | 7.6 | 7.9 | |
| CORA | 4.2 | 4.5 | 4.6 | 4.4 | 14.5 | 26.8 | 4.5 | 4.5 | 4.5 | 4.5 | 4.5 | |
| Coauthor CS | 20.4 | 22.3 | 40.8 | 21.9 | 39.3 | 2154.2 | 23.7 | 22.3 | 23.6 | 22.4 | 23.6 | |
| Coauthor Phy | 46.1 | 50.5 | 116.4 | 48.5 | 98.6 | 2436.9 | 50.8 | 50.4 | 50.7 | 50.5 | 50.8 | |
| Amazon Comp. | 17.5 | 19.1 | 31.8 | 18.8 | 33.8 | 1688.9 | 19.2 | 19.1 | 19.1 | 19.1 | 19.2 | |
| Disease | 4.1 | 4.3 | 4.2 | 4.1 | 4.2 | 8.7 | 4.3 | 4.3 | 4.3 | 4.3 | 4.3 | |
| Wiki-CS | 15.3 | 16.6 | 30.0 | 28.3 | 33.0 | 410.8 | 16.7 | 16.6 | 16.6 | 16.7 | 16.7 | |
| PPI Brain | 8.3 | 8.9 | 11.5 | 10.2 | 10.9 | 133.3 | 9.0 | 8.4 | 8.6 | 8.4 | 8.7 | |
| PPI Blood | 7.9 | 8.2 | 10.4 | 9.4 | 9.9 | 130.2 | 8.4 | 8.2 | 8.4 | 8.3 | 8.5 | |
| PPI Kidney | 7.3 | 7.8 | 9.8 | 8.0 | 8.8 | 129.4 | 7.7 | 7.7 | 7.7 | 7.8 | 7.9 | |
| Github | 57.1 | 69.2 | 211.8 | 102.9 | 121.4 | 6810.0 | 72.1 | 69.6 | 71.1 | 70.5 | 73.2 | |
| SGC | CiteSeer | 1.7 | 1.9 | 5.6 | 1.8 | 2.7 | 18.3 | 1.9 | 1.9 | 1.9 | 1.9 | 1.9 |
| PubMed | 2.0 | 2.2 | 3.9 | 2.2 | 21.1 | 1229.2 | 2.2 | 2.2 | 2.2 | 2.2 | 2.2 | |
| CORA | 3.8 | 4.8 | 5.8 | 4.7 | 2.3 | 23.7 | 4.9 | 4.8 | 4.8 | 4.8 | 4.9 | |
| Coauthor CS | 16.8 | 19.8 | 33.2 | 19.3 | 37.9 | 2098.2 | 19.8 | 19.8 | 19.8 | 19.8 | 19.8 | |
| Coauthor Phy | 35.6 | 40.7 | 90.4 | 39.8 | 88.7 | 2232.3 | 40.8 | 40.4 | 40.5 | 40.7 | 40.7 | |
| Amazon Comp. | 12.2 | 14.7 | 17.2 | 16.9 | 17.1 | 1134.6 | 14.8 | 14.6 | 14.7 | 14.8 | 14.8 | |
| Disease | 1.4 | 1.4 | 1.5 | 1.4 | 1.4 | 6.0 | 1.4 | 1.4 | 1.4 | 1.4 | 1.4 | |
| Wiki-CS | 1.9 | 2.0 | 13.6 | 8.2 | 18.3 | 400.5 | 2.1 | 2.0 | 2.0 | 2.1 | 2.1 | |
| PPI Brain | 4.4 | 4.5 | 5.1 | 4.8 | 4.9 | 142.2 | 4.6 | 4.4 | 4.6 | 4.5 | 4.7 | |
| PPI Blood | 4.1 | 4.3 | 4.9 | 4.7 | 4.8 | 139.4 | 4.4 | 4.3 | 4.3 | 4.4 | 4.5 | |
| PPI Kidney | 3.9 | 4.1 | 4.4 | 4.3 | 4.5 | 135.6 | 4.1 | 4.1 | 4.1 | 4.1 | 4.2 | |
| Github | 22.3 | 24.5 | 166 | 78.3 | 106.2 | 4905.1 | 25.8 | 24.4 | 25.4 | 24.6 | 26.0 | |