| Literature DB >> 31146357 |
Mohsin Munir1,2, Shoaib Ahmed Siddiqui3,4, Muhammad Ali Chattha5,6,7, Andreas Dengel8,9, Sheraz Ahmed10.
Abstract
The need for robust unsupervised anomaly detection in streaming data is increasing rapidly in the current era of smart devices, where enormous data are gathered from numerous sensors. These sensors record the internal state of a machine, the external environment, and the interaction of machines with other machines and humans. It is of prime importance to leverage this information in order to minimize downtime of machines, or even avoid downtime completely by constant monitoring. Since each device generates a different type of streaming data, it is normally the case that a specific kind of anomaly detection technique performs better than the others depending on the data type. For some types of data and use-cases, statistical anomaly detection techniques work better, whereas for others, deep learning-based techniques are preferred. In this paper, we present a novel anomaly detection technique, FuseAD, which takes advantage of both statistical and deep-learning-based approaches by fusing them together in a residual fashion. The obtained results show an increase in area under the curve (AUC) as compared to state-of-the-art anomaly detection methods when FuseAD is tested on a publicly available dataset (Yahoo Webscope benchmark). The obtained results advocate that this fusion-based technique can obtain the best of both worlds by combining their strengths and complementing their weaknesses. We also perform an ablation study to quantify the contribution of the individual components in FuseAD, i.e., the statistical ARIMA model as well as the deep-learning-based convolutional neural network (CNN) model.Entities:
Keywords: anomaly detection; deep neural networks; model fusion; sensor data; statistical models; time-series analysis
Year: 2019 PMID: 31146357 PMCID: PMC6603659 DOI: 10.3390/s19112451
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1FuseAD overview: The system consists of two modules, the forecasting pipeline and an anomaly detector.
Figure 2FuseAD forecasting pipeline.
Figure 3(a) Real Tweets (Twitter_volume_AMZN), (b) Artificial With Anomaly (art _increase_spike_density); (c) Real Ad Exchange (exchange-3_cpm_results); (d) Real Traffic (TravelTime_451_whole). Snippets of Numenta Anomaly Benchmark (NAB) time-series from different domains are plotted. Actual time-series are shown in blue, whereas the highlighted area shows an anomaly window.
Figure 4Forecasting and anomaly detection results of FuseAD on the TS11 time-series from the Yahoo A3 sub-benchmark. The upper plot shows the actual time-series and forecasting results on test data, whereas the lower plot shows the anomaly score at each time-stamp. The anomaly label (i.e., 1) is assigned to data points that have a high anomaly score.
Figure 5Zoomed-in plots of two out of three anomalies detected in Figure 4. It shows that FuseAD is capable of correctly detecting point anomalies in streaming data where traditional anomaly detection methods fail normally.
Comparative evaluation of state-of-the-art anomaly detection methods on the Yahoo Webscope dataset. Average AUC per sub-benchmark is shown in this table.
| Benchmark | iForest [ | OCSVM [ | LOF [ | PCA [ | TwitterAD [ | DeepAnT [ | FuseAD |
|---|---|---|---|---|---|---|---|
| A1 | 0.8888 | 0.8159 | 0.9037 | 0.8363 | 0.8239 | 0.8976 |
|
| A2 | 0.6620 | 0.6172 | 0.9011 | 0.9234 | 0.5000 | 0.9614 |
|
| A3 | 0.6279 | 0.5972 | 0.6405 | 0.6278 | 0.6176 | 0.9283 |
|
| A4 | 0.6327 | 0.6036 | 0.6403 | 0.6100 | 0.6534 | 0.8597 |
|
Comparative evaluation of anomaly detection methods on the NAB dataset. Average AUC per domain is reported here. Bold numbers show highest AUC in a particular domain.
| Bayes ChangePT [ | Context OSE [ | EXPoSE [ | HTM Java [ | NUMENTA [ | Relative Entropy [ | Skyline [ | Twitter ADVec [ | Windowed Gaussian [ | DeepAnt [ | FuseAD | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Artificial-nA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Artificial-wA | 0.502 | 0.316 | 0.5144 |
| 0.531 | 0.505 | 0.558 | 0.503 | 0.406 | 0.555 | 0.544 |
| Real-AdE | 0.509 | 0.307 | 0.581 | 0.568 | 0.576 | 0.505 | 0.534 | 0.504 | 0.538 | 0.562 |
|
| Real-AWS | 0.499 | 0.311 | 0.594 | 0.587 | 0.542 | 0.506 | 0.602 | 0.503 |
| 0.583 | 0.572 |
| Real-KC | 0.501 | 0.486 | 0.533 | 0.584 | 0.590 | 0.503 |
| 0.504 | 0.572 | 0.601 | 0.587 |
| Real-Tr | 0.507 | 0.310 | 0.613 |
| 0.679 | 0.508 | 0.556 | 0.505 | 0.553 | 0.637 | 0.619 |
| Real-Tw | 0.498 | 0.304 |
| 0.549 | 0.586 | 0.500 | 0.559 | 0.505 | 0.560 | 0.554 | 0.546 |
Ablation study on the Yahoo Webscope dataset.
| A1 | A2 | A3 | A4 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARIMA | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
| CNN | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
| AUC | 0.920 | 0.936 |
|
|
|
| 0.992 | 0.986 |
| 0.949 | 0.928 |
|
Ablation study on the NAB dataset.
| Artificial-nA | Artificial-wA | Read-AdE | Real-AWS | Real-KC | Real-Tr | Real-Tw | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARIMA | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||
| CNN | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||||||
| AUC | 0 | 0 | 0 | 0.49 | 0.53 |
| 0.56 |
|
| 0.55 |
| 0.57 | 0.50 |
| 0.58 | 0.58 |
|
|
|
| 0.54 |
Figure 6Comparative analysis of FuseAD and anomalies detected by ARIMA and CNN models on two sample time-series. The first row shows results on TS29 and the second row shows results on TS18 from the A3 Yahoo benchmark. Respective F-scores are shown in brackets.