| Literature DB >> 34177356 |
Lin Zhang1, Wenyu Zhang2, Maxwell J McNeil1, Nachuan Chengwang1, David S Matteson2, Petko Bogdanov1.
Abstract
The ability to accurately and consistently discover anomalies in time series is important in many applications. Fields such as finance (fraud detection), information security (intrusion detection), healthcare, and others all benefit from anomaly detection. Intuitively, anomalies in time series are time points or sequences of time points that deviate from normal behavior characterized by periodic oscillations and long-term trends. For example, the typical activity on e-commerce websites exhibits weekly periodicity and grows steadily before holidays. Similarly, domestic usage of electricity exhibits daily and weekly oscillations combined with long-term season-dependent trends. How can we accurately detect anomalies in such domains while simultaneously learning a model for normal behavior? We propose a robust offline unsupervised framework for anomaly detection in seasonal multivariate time series, called AURORA. A key innovation in our framework is a general background behavior model that unifies periodicity and long-term trends. To this end, we leverage a Ramanujan periodic dictionary and a spline-based dictionary to capture both seasonal and trend patterns. We conduct experiments on both synthetic and real-world datasets and demonstrate the effectiveness of our method. AURORA has significant advantages over existing models for anomaly detection, including high accuracy (AUC of up to 0.98), interpretability of recovered normal behavior ( 100 % accuracy in period detection), and the ability to detect both point and contextual anomalies. In addition, AURORA is orders of magnitude faster than baselines.Entities:
Keywords: Alternating optimization; Multivariate time series; Offline anomaly detection; Periodic dictionary; Spline dictionary
Year: 2021 PMID: 34177356 PMCID: PMC8220123 DOI: 10.1007/s10618-021-00771-7
Source DB: PubMed Journal: Data Min Knowl Discov ISSN: 1384-5810 Impact factor: 3.670
Fig. 1An example of multivariate time series with seasonality, trends, and different types of anomalies: a The map shows three neighbouring countries in south America: Peru, Brazil and Bolivia; b the graph shows weekly time series of Google flu searches for these three countries spanning from 2002 to 2015. Point and segment annotations are predicted anomalies
Fig. 2Comparison of anomaly detection quality for different types of anomalies in synthetic and varying signal-to-noise ratio (SNR) taking values of [10, 40, 70, 100]. We consider a point anomalies; b contextual anomalies (size range: [3, 4, 5, 6]); c mixture of both types
Fig. 3Comparison of AUC for anomaly detection by varying the magnitude of injected anomalies in the a Google flu; b Power plant datasets
AUC comparison for anomaly detection in the Yahoo benchmark. The performance of the best method in each benchmark dataset are marked in bold font
| 1 | 2 | 4 | 5 | 6 | 8 | 10 | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| AURORA | ||||||||||
| Donut | 0.58 | 0.04 | 0.02 | 0.34 | 0.61 | 0 | 0.47 | 0.42 | 0.63 | 0.46 |
| TwitterR | 0.52 | 0.6 | 0.56 | 0.42 | 0.52 | 0.39 | 0.59 | 0.61 | 0.61 | 0.75 |
| 11 | 12 | 14 | 17 | 19 | ||||||
| AURORA | 0 | |||||||||
| Donut | 0 | 0.11 | 0.3 | 0.17 | 0.17 | 0.41 | 0.45 | 0.41 | 0.77 | 0.57 |
| TwitterR | 0.35 | 0.35 | 0.42 | 0.55 | 0.46 | 0.51 | 0.48 | 0.56 | 0.46 | 0.54 |
| 21 | 22 | 23 | 24 | 25 | ||||||
| AURORA | 0 | 0 | 0 | 0 | 0.3 | |||||
| Donut | 0.98 | 0.78 | 0.49 | 0.05 | 0.31 | 0.5 | 0.42 | 0.44 | 0.03 | |
| TwitterR | 0.85 | 0.45 | 0.50 | 0.38 | 0.13 | 0.52 | 0.55 | 0.92 | ||
| 31 | 32 | 33 | 40 | |||||||
| AURORA | 0 | 0 | 0.43 | 0.01 | ||||||
| Donut | 0.71 | 0 | 0.72 | 0 | 0.38 | 0.71 | 0.45 | 0.47 | ||
| TwitterR | 0.5 | 0.5 | 0.25 | 0.37 | 0 | 0.72 | 0.74 | 0.66 | 0.45 | |
| 41 | 42 | 43 | 45 | 48 | 50 | |||||
| AURORA | 0.25 | 0.02 | 0.35 | 0 | ||||||
| Donut | 0.41 | 0.42 | 0.64 | 0.97 | 0.49 | 0.42 | 0.36 | 0.41 | 0 | |
| TwitterR | 0.53 | 0.47 | 0.45 | 0.52 | 0.17 | 0.44 | 0.6 | |||
| 53 | 58 | |||||||||
| AURORA | 0.39 | 0.11 | 0.5 | 0.23 | 0.34 | 0 | ||||
| Donut | 0.70 | 0.58 | 0.1 | 0.4 | 0.53 | 0.53 | 0.48 | 0 | 0.3 | |
| TwitterR | 0.47 | 0.3 | 0.52 | 0.58 | 0.48 | 0 | 0.52 | |||
| 61 | 62 | 63 | 66 | 67 | ||||||
| AURORA | 0 | |||||||||
| Donut | 0.43 | 0.44 | 0 | 0 | 0 | 0.41 | 0.51 | |||
| TwitterR | 0.54 | 0.36 | 0.12 | 0 | 0.48 | 0.44 | 0.56 |
Indices marked with asterisk (*) have been reported to have questionable anomaly labels in recent work (De Paepe et al. 2020), but we include them for completeness
AUC comparison for anomaly detection on the NAB benchmark, realTwitter series. The performance of the best method in each benchmark dataset are marked in bold font
| APPL | AMZN | CRM | CVS | FB | GOOG | IBM | KO | PFE | UPS | |
|---|---|---|---|---|---|---|---|---|---|---|
| AURORA | 0.76 | |||||||||
| Donut | 0.64 | 0.93 | 0.42 | 0.52 | 0.48 | 0.49 | 0.85 | 0.56 | 0.77 | |
| TwitterR | 0.72 | 0.97 | 0.94 | 0.36 | 0.82 | 0.59 | 0.40 | 0.76 | 0.73 | 0.85 |
Fig. 4Comparison between AURORA and a univariate alternative AURORAuni on a synthetic and b the Google Flu datasets
Fig. 5Case study on Google flu. Blue diamonds and red circles annotate anomalies detected within and outside of flu season, respectively
Fig. 6The comparison of period learning by varying SNR under different types of anomalies
Fig. 7Parameter sensitivity of AURORA
Fig. 8Comparison of CPU running time as a function of the time series length and the number of samples (univariate time series) between a AURORA, b TwitterR and c Donut. Note, that AURORA’s time is reported in seconds while those of baselines in hours