| Literature DB >> 34179767 |
Hoseung Song1, Jayaraman J Thiagarajan2, Bhavya Kailkhura2.
Abstract
Dataset shift refers to the problem where the input data distribution may change over time (e.g., between training and test stages). Since this can be a critical bottleneck in several safety-critical applications such as healthcare, drug-discovery, etc., dataset shift detection has become an important research issue in machine learning. Though several existing efforts have focused on image/video data, applications with graph-structured data have not received sufficient attention. Therefore, in this paper, we investigate the problem of detecting shifts in graph structured data through the lens of statistical hypothesis testing. Specifically, we propose a practical two-sample test based approach for shift detection in large-scale graph structured data. Our approach is very flexible in that it is suitable for both undirected and directed graphs, and eliminates the need for equal sample sizes. Using empirical studies, we demonstrate the effectiveness of the proposed test in detecting dataset shifts. We also corroborate these findings using real-world datasets, characterized by directed graphs and a large number of nodes.Entities:
Keywords: dataset shift; graph learning; random graph models; safety; two-sample testing
Year: 2021 PMID: 34179767 PMCID: PMC8223254 DOI: 10.3389/frai.2021.589632
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
FIGURE 1Performance comparison of different tests for undirected graphs.
FIGURE 2Performance comparison of proposed test for directed graphs.
Power comparison of different tests for undirected graphs with varying sparsity levels.
|
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| 100 | 0.09 | 0.05 |
|
| 0.03 | 0.08 |
| 0.05 |
|
| 200 |
| 0.05 | 0.07 |
| 0.10 |
|
| 0.22 |
|
| 300 |
| 0.03 |
| 0.34 | 0.19 |
| 0.50 | 0.40 |
|
| 400 | 0.11 | 0.09 |
| 0.40 | 0.26 |
| 0.78 | 0.71 |
|
| 500 |
| 0.08 |
| 0.63 | 0.48 |
| 0.91 | 0.89 |
|
Bold values indicate the largest power of the test under each condition.
Power of the proposed test for directed graphs with varying sparsity levels.
|
|
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| 100 |
| 0.09 |
|
| 0.21 |
|
| 200 | 0.11 |
| 0.25 |
| 0.49 |
|
| 300 | 0.17 |
| 0.46 |
| 0.76 |
|
| 400 |
|
| 0.60 |
| 0.95 |
|
| 500 | 0.36 |
| 0.77 |
|
|
|
Bold values indicate the largest power of the test under each condition.
Power comparison of different tests for undirected graphs with varying sample sizes.
|
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| 50 | 0.08 | 0.08 |
| 0.11 | 0.04 |
| 0.28 | 0.15 |
|
| 100 | 0.16 | 0.08 |
| 0.18 | 0.05 |
| 0.61 | 0.42 |
|
| 150 |
| 0.03 | 0.15 | 0.21 | 0.14 |
| 0.70 | 0.52 |
|
| 200 | 0.14 | 0.06 |
| 0.37 | 0.21 |
| 0.94 | 0.89 |
|
Bold values indicate the largest power of the test under each condition.
Power comparison of different tests for directed graphs with varying sample sizes.
| Directed |
|
|
| |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| 50 | 0.05 |
| 0.12 |
| 0.49 |
|
| 100 | 0.15 |
| 0.29 |
| 0.82 |
|
| 150 | 0.15 |
| 0.39 |
| 0.95 |
|
| 200 | 0.28 |
| 0.66 |
|
|
|
Bold values indicate the largest power of the test under each condition.
Test summary on the phone-call network.
| Test statistic |
|
|---|---|
| 15.8131 |
|
Distribution of graphs. “M” and “F” indicate male and female, respectively. ‘<20’ and ‘>20’ represent age less than 20 and over 20, respectively.
FIGURE 3Example networks from Normal-Male and Normal-Female groups.
p-values of the tests on the ABIDE dataset.
Estimated power of the tests with the significance level at 5%. Black numbers indicate the power of test based on and red numbers represent the power of test based on .