| Literature DB >> 35668720 |
Łukasz Korycki1, Bartosz Krawczyk1.
Abstract
Continuous learning from streaming data is among the most challenging topics in the contemporary machine learning. In this domain, learning algorithms must not only be able to handle massive volume of rapidly arriving data, but also adapt themselves to potential emerging changes. The phenomenon of evolving nature of data streams is known as concept drift. While there is a plethora of methods designed for detecting its occurrence, all of them assume that the drift is connected with underlying changes in the source of data. However, one must consider the possibility of a malicious injection of false data that simulates a concept drift. This adversarial setting assumes a poisoning attack that may be conducted in order to damage the underlying classification system by forcing an adaptation to false data. Existing drift detectors are not capable of differentiating between real and adversarial concept drift. In this paper, we propose a framework for robust concept drift detection in the presence of adversarial and poisoning attacks. We introduce the taxonomy for two types of adversarial concept drifts, as well as a robust trainable drift detector. It is based on the augmented restricted Boltzmann machine with improved gradient computation and energy function. We also introduce Relative Loss of Robustness-a novel measure for evaluating the performance of concept drift detectors under poisoning attacks. Extensive computational experiments, conducted on both fully and sparsely labeled data streams, prove the high robustness and efficacy of the proposed drift detection framework in adversarial scenarios.Entities:
Keywords: Adversarial learning; Boltzmann machine; Concept drift; Data stream mining; Poisoning attacks; Robust machine learning
Year: 2022 PMID: 35668720 PMCID: PMC9162121 DOI: 10.1007/s10994-022-06177-w
Source DB: PubMed Journal: Mach Learn ISSN: 0885-6125 Impact factor: 5.414
Fig. 1Accurate adaptation to valid concept drift
Fig. 2Adversarial drift via instance-based poisoning attacks hinders (b) or exaggerates (c) the adaptation process
Fig. 3Adversarial drift via concept-based poisoning attacks critically misguide (b) or completely nullifies (c) the adaptation process
Properties of artificial and real data stream benchmarks
| Abbr. | Dataset | Instances | Features | Classes | Drift |
|---|---|---|---|---|---|
|
| |||||
| HYP | Hyperplane | 1 000 000 | 10 | 2 | Incremental |
| LED | LED | 1 000 000 | 24 | 10 | Sudden |
| RBF | RBF | 1 000 000 | 40 | 20 | Gradual |
| RBF | RBF | 1 000 000 | 20 | 10 | Sudden |
| SEA | SEA | 3 000 000 | 3 | 4 | Gradual |
| TRE | RandomTree | 2 000 000 | 10 | 6 | Sudden |
|
| |||||
| ecbdl14 | Protein structure prediction | 9 600 000 | 631 | 2 | Mixed |
| higgs | High-energy physics classification | 4 954 752 | 28 | 2 | Unknown |
| IntelLab | Intel lab sensors | 2 313 153 | 6 | 58 | Mixed |
| iot | IoT botnet attacks | 7 062 606 | 115 | 11 | Mixed |
| kddcup | KDD intrusion detection | 3 107 709 | 41 | 24 | Mixed |
| susy | Supersymmetric particle detection | 2 305 347 | 18 | 2 | Unknown |
Examined drift detectors and their parameters
| Abbr. | Name | Parameters |
|---|---|---|
| EDDM | Early drift detection | Warning threshold |
| Drift threshold | ||
| Min. no. of errors | ||
| ECDD | EWMA for drift detection | Differentiation weights |
| Min. no. of errors | ||
| FHDDM | Fast hoeffding drift detection | Sliding window size |
| Allowed error | ||
| RDDM | Reactive drift detection | Warning threshold |
| Drift threshold | ||
| Min. no. of errors | ||
| Min. no. of instances | ||
| Max. no. of instances | ||
| Warning limit | ||
| WSTD | Wilcoxon rank sum test | Sliding window size |
| Drift detection | Warning significance | |
| Drift significance | ||
| Max. no of old instances | ||
| RRBM–DD | Robust RBM drift detection | Mini–batch size |
| Visible neurons | ||
| Hidden neurons | ||
| Class neurons | ||
| Learning rate | ||
| Gibbs sampling steps | ||
| Robust gradient confidence |
Fig. 4Relationship between prequential accuracy and ratio of injected adversarial concept drift via instance-based poisoning attacks
RLR for RRBM–DD and reference drift detectors under instance-based poisoning attacks
| Dataset | EDDM | ECDD | FHDDM | RDDM | WSTD | RRBM–DD |
|---|---|---|---|---|---|---|
| HYP | 0.55 | 0.58 | 0.67 | 0.64 | 0.63 | 0.85 |
| LED | 0.61 | 0.62 | 0.71 | 0.71 | 0.74 | 0.90 |
| RBF | 0.54 | 0.55 | 0.58 | 0.56 | 0.61 | 0.77 |
| RBF | 0.49 | 0.47 | 0.50 | 0.54 | 0.52 | 0.73 |
| SEA | 0.67 | 0.71 | 0.70 | 0.74 | 0.77 | 0.86 |
| TRE | 0.44 | 0.43 | 0.48 | 0.49 | 0.51 | 0.72 |
| ecbdl14 | 0.40 | 0.45 | 0.52 | 0.56 | 0.53 | 0.69 |
| higgs | 0.71 | 0.73 | 0.77 | 0.81 | 0.82 | 0.94 |
| IntelLab | 0.41 | 0.44 | 0.46 | 0.48 | 0.49 | 0.69 |
| iot | 0.73 | 0.77 | 0.81 | 0.84 | 0.87 | 0.95 |
| kddcup | 0.63 | 0.60 | 0.67 | 0.65 | 0.69 | 0.83 |
| susy | 0.65 | 0.71 | 0.74 | 0.76 | 0.73 | 0.85 |
| avg. rank | 5.88 | 4.12 | 3.73 | 3.22 | 3.05 | 1.00 |
Fig. 5The Bonferroni–Dunn test for comparison among drift detectors under instance-based poisoning attacks, based on prequential accuracy
Fig. 6The Bonferroni–Dunn test for comparison among drift detectors under instance-based poisoning attacks, based on RLR
Fig. 7Relationship between prequential accuracy and number of injected adversarial concept drift via concept-based poisoning attacks
RLR for RRBM–DD and reference drift detectors under concept-based poisoning attacks
| Dataset | EDDM | ECDD | FHDDM | RDDM | WSTD | RRBM–DD |
|---|---|---|---|---|---|---|
| HYP | 0.44 | 0.45 | 0.50 | 0.52 | 0.51 | 0.78 |
| LED | 0.48 | 0.52 | 0.56 | 0.55 | 0.59 | 0.84 |
| RBF | 0.35 | 0.38 | 0.42 | 0.44 | 0.43 | 0.71 |
| RBF | 0.27 | 0.28 | 0.30 | 0.33 | 0.32 | 0.68 |
| SEA | 0.51 | 0.55 | 0.52 | 0.55 | 0.56 | 0.82 |
| TRE | 0.22 | 0.20 | 0.25 | 0.27 | 0.27 | 0.63 |
| ecbdl14 | 0.18 | 0.17 | 0.23 | 0.26 | 0.26 | 0.60 |
| higgs | 0.52 | 0.56 | 0.57 | 0.61 | 0.63 | 0.88 |
| IntelLab | 0.24 | 0.26 | 0.25 | 0.30 | 0.32 | 0.65 |
| iot | 0.63 | 0.65 | 0.69 | 0.71 | 0.74 | 0.90 |
| kddcup | 0.39 | 0.35 | 0.44 | 0.40 | 0.47 | 0.76 |
| susy | 0.42 | 0.45 | 0.46 | 0.51 | 0.47 | 0.80 |
| avg. rank | 5.60 | 4.55 | 3.10 | 3.90 | 2.95 | 1.00 |
Fig. 8The Bonferroni–Dunn test for comparison among drift detectors under concept-based poisoning attacks, based on prequential accuracy
Fig. 9The Bonferroni–Dunn test for comparison among drift detectors under concept-based poisoning attacks, based on RLR
Fig. 10Comparison of RRBM–DD with reference drift detectors under instance-based poisoning attacks and sparsely labeled assumption (only 10% labeled instances) with respect to the number of wins (green), ties (yellow), and losses (red), according to a pairwise F-test with statistical significance level . Prequential accuracy used as a metric. 60 runs per dataset were obtained from six different labeling budgets, each repeated 10 times with random selection of instances to be labeled
Fig. 11Comparison of RRBM–DD with reference drift detectors under concept-based poisoning attacks and sparsely labeled assumption with respect to the number of wins (green), ties (yellow), and losses (red), according to a pairwise F-test with statistical significance level . Prequential accuracy used as a metric. 60 runs per dataset were obtained from six different labeling budgets, each repeated 10 times with random selection of instances to be labeled
RLR for RRBM–DD and reference drift detectors under instance-based poisoning attacks and sparsely labeled data. Results averaged over all labeling budgets, presented with standard deviation
| Dataset | EDDM | ECDD | FHDDM | RDDM | WSTD | RRBM–DD |
|---|---|---|---|---|---|---|
| HYP | 0.47 ± 0.11 | 0.49 ± 0.10 | 0.61 ± 0.12 | 0.55 ± 0.16 | 0.57 ± 0.13 | 0.80 ± 0.08 |
| LED | 0.52 ± 0.09 | 0.51 ± 0.09 | 0.63 ± 0.10 | 0.61 ± 0.14 | 0.63 ± 0.11 | 0.83 ± 0.05 |
| RBF | 0.40 ± 0.16 | 0.42 ± 0.12 | 0.47 ± 0.09 | 0.45 ± 0.15 | 0.50 ± 0.11 | 0.69 ± 0.09 |
| RBF | 0.37 ± 0.06 | 0.38 ± 0.05 | 0.41 ± 0.07 | 0.40 ± 0.09 | 0.44 ± 0.06 | 0.63 ± 0.07 |
| SEA | 0.56 ± 0.12 | 0.62 ± 0.09 | 0.59 ± 0.09 | 0.57 ± 0.14 | 0.66 ± 0.10 | 0.81 ± 0.08 |
| TRE | 0.30 ± 0.05 | 0.32 ± 0.04 | 0.35 ± 0.06 | 0.32 ± 0.08 | 0.38 ± 0.08 | 0.63 ± 0.04 |
| ecbdl14 | 0.33 ± 0.07 | 0.36 ± 0.06 | 0.45 ± 0.09 | 0.42 ± 0.11 | 0.46 ± 0.10 | 0.60 ± 0.05 |
| higgs | 0.58 ± 0.14 | 0.62 ± 0.10 | 0.65 ± 0.12 | 0.64 ± 0.16 | 0.70 ± 0.11 | 0.82 ± 0.07 |
| IntelLab | 0.28 ± 0.08 | 0.30 ± 0.06 | 0.34 ± 0.06 | 0.31 ± 0.10 | 0.36 ± 0.08 | 0.60 ± 0.04 |
| iot | 0.65 ± 0.09 | 0.68 ± 0.06 | 0.72 ± 0.07 | 0.71 ± 0.11 | 0.77 ± 0.10 | 0.88 ± 0.07 |
| kddcup | 0.55 ± 0.05 | 0.53 ± 0.09 | 0.57 ± 0.10 | 0.55 ± 0.13 | 0.60 ± 0.10 | 0.77 ± 0.06 |
| susy | 0.60 ± 0.11 | 0.63 ± 0.08 | 0.67 ± 0.09 | 0.65 ± 0.13 | 0.68 ± 0.13 | 0.79 ± 0.07 |
| Avg. rank | 5.17 | 4.55 | 4.13 | 3.25 | 2.90 | 1.00 |
RLR for RRBM–DD and reference drift detectors under concept-based poisoning attacks and sparsely labeled data. Results averaged over all labeling budgets, presented with standard deviation
| Dataset | EDDM | ECDD | FHDDM | RDDM | WSTD | RRBM–DD |
|---|---|---|---|---|---|---|
| HYP | 0.28 ± 0.07 | 0.29 ± 0.06 | 0.37 ± 0.10 | 0.34 ± 0.13 | 0.37 ± 0.13 | 0.69 ± 0.06 |
| LED | 0.25 ± 0.05 | 0.32 ± 0.05 | 0.44 ± 0.08 | 0.41 ± 0.10 | 0.46 ± 0.08 | 0.75 ± 0.07 |
| RBF | 0.23 ± 0.06 | 0.26 ± 0.07 | 0.33 ± 0.07 | 0.29 ± 0.12 | 0.34 ± 0.14 | 0.62 ± 0.06 |
| RBF | 0.18 ± 0.04 | 0.19 ± 0.06 | 0.20 ± 0.04 | 0.18 ± 0.05 | 0.22 ± 0.04 | 0.59 ± 0.04 |
| SEA | 0.40 ± 0.12 | 0.42 ± 0.13 | 0.44 ± 0.10 | 0.41 ± 0.15 | 0.45 ± 0.12 | 0.74 ± 0.08 |
| TRE | 0.09 ± 0.03 | 0.11 ± 0.04 | 0.14 ± 0.04 | 0.13 ± 0.04 | 0.17 ± 0.04 | 0.51 ± 0.05 |
| ecbdl14 | 0.08 ± 0.02 | 0.07 ± 0.02 | 0.11 ± 0.02 | 0.09 ± 0.03 | 0.13 ± 0.02 | 0.48 ± 0.05 |
| higgs | 0.39 ± 0.10 | 0.40 ± 0.08 | 0.44 ± 0.11 | 0.41 ± 0.13 | 0.48 ± 0.15 | 0.79 ± 0.08 |
| IntelLab | 0.12 ± 0.04 | 0.14 ± 0.04 | 0.16 ± 0.05 | 0.13 ± 0.05 | 0.18 ± 0.04 | 0.58 ± 0.09 |
| iot | 0.51 ± 0.13 | 0.52 ± 0.16 | 0.56 ± 0.12 | 0.55 ± 0.16 | 0.60 ± 0.13 | 0.81 ± 0.11 |
| kddcup | 0.23 ± 0.03 | 0.20 ± 0.05 | 0.24 ± 0.05 | 0.19 ± 0.05 | 0.28 ± 0.03 | 0.68 ± 0.05 |
| susy | 0.31 ± 0.07 | 0.34 ± 0.07 | 0.36 ± 0.09 | 0.37 ± 0.10 | 0.39 ± 0.10 | 0.70 ± 0.07 |
| avg. rank | 5.38 | 4.14 | 4.66 | 3.72 | 2.10 | 1.00 |
Fig. 12The Bonferroni–Dunn test for comparison among drift detectors under instance-based poisoning attacks and sparsely labeled data, based on RLR and all examined labeling ratios
Fig. 13The Bonferroni–Dunn test for comparison among drift detectors under concept-based poisoning attacks and sparsely labeled data, based on RLR and all examined labeling ratios
Ablation results for RRBM–DD according to RLR under instance-based poisoning attacks. Results averaged over fully and sparsely labeled benchmarks, presented with standard deviation
| Dataset | RBM-DD | RBM–DD | RBM–DD | RRBM-DD |
|---|---|---|---|---|
| HYP | 0.58 ± 0.10 | 0.77 ± 0.09 | 0.64 ± 0.09 | 0.82 ± 0.09 |
| LED | 0.66 ± 0.12 | 0.80 ± 0.07 | 0.72 ± 0.10 | 0.84 ± 0.06 |
| RBF | 0.52 ± 0.11 | 0.66 ± 0.09 | 0.59 ± 0.10 | 0.72 ± 0.09 |
| RBF | 0.47 ± 0.08 | 0.58 ± 0.09 | 0.53 ± 0.07 | 0.66 ± 0.08 |
| SEA | 0.68 ± 0.11 | 0.78 ± 0.10 | 0.74 ± 0.10 | 0.82 ± 0.10 |
| TRE | 0.41 ± 0.10 | 0.59 ± 0.06 | 0.48 ± 0.08 | 0.65 ± 0.05 |
| ecbdl14 | 0.48 ± 0.11 | 0.58 ± 0.07 | 0.53 ± 0.09 | 0.61 ± 0.07 |
| higgs | 0.73 ± 0.11 | 0.82 ± 0.09 | 0.77 ± 0.10 | 0.85 ± 0.08 |
| IntelLab | 0.38 ± 0.08 | 0.59 ± 0.08 | 0.52 ± 0.08 | 0.66 ± 0.08 |
| iot | 0.77 ± 0.10 | 0.86 ± 0.05 | 0.83 ± 0.08 | 0.90 ± 0.05 |
| kddcup | 0.61 ± 0.11 | 0.72 ± 0.08 | 0.66 ± 0.10 | 0.79 ± 0.07 |
| susy | 0.70 ± 0.12 | 0.80 ± 0.09 | 0.73 ± 0.09 | 0.83 ± 0.09 |
| avg. rank | 4.00 | 2.05 | 2.95 | 1.00 |
Ablation results for RRBM–DD according to RLR under concept-based poisoning attacks. Results averaged over fully and sparsely labeled benchmarks, presented with standard deviation
| Dataset | RBM-DD | RBM–DD | RBM–DD | RRBM-DD |
|---|---|---|---|---|
| HYP | 0.38 ± 0.11 | 0.44 ± 0.10 | 0.65 ± 0.06 | 0.72 ± 0.07 |
| LED | 0.50 ± 0.09 | 0.57 ± 0.10 | 0.68 ± 0.12 | 0.78 ± 0.11 |
| RBF | 0.36 ± 0.15 | 0.45 ± 0.11 | 0.56 ± 0.07 | 0.64 ± 0.08 |
| RBF | 0.25 ± 0.05 | 0.38 ± 0.05 | 0.50 ± 0.05 | 0.61 ± 0.05 |
| SEA | 0.43 ± 0.10 | 0.57 ± 0.11 | 0.68 ± 0.10 | 0.78 ± 0.10 |
| TRE | 0.16 ± 0.04 | 0.31 ± 0.06 | 0.42 ± 0.09 | 0.55 ± 0.08 |
| ecbdl14 | 0.14 ± 0.04 | 0.29 ± 0.05 | 0.40 ± 0.08 | 0.51 ± 0.07 |
| higgs | 0.51 ± 0.13 | 0.65 ± 0.11 | 0.76 ± 0.10 | 0.83 ± 0.11 |
| IntelLab | 0.20 ± 0.04 | 0.41 ± 0.07 | 0.53 ± 0.08 | 0.64 ± 0.08 |
| iot | 0.64 ± 0.13 | 0.69 ± 0.12 | 0.74 ± 0.10 | 0.83 ± 0.11 |
| kddcup | 0.26 ± 0.06 | 0.53 ± 0.05 | 0.62 ± 0.05 | 0.72 ± 0.05 |
| susy | 0.37 ± 0.11 | 0.50 ± 0.09 | 0.63 ± 0.07 | 0.75 ± 0.08 |
| avg. rank | 4.00 | 2.90 | 2.10 | 1.00 |