| Literature DB >> 35401030 |
Raisa Dzhamtyrova1,2, Carsten Maple1,3.
Abstract
The increasing value of data held in enterprises makes it an attractive target to attackers. The increasing likelihood and impact of a cyber attack have highlighted the importance of effective cyber risk estimation. We propose two methods for modelling Value-at-Risk (VaR) which can be used for any time-series data. The first approach is based on Quantile Autoregression (QAR), which can estimate VaR for different quantiles, i. e. confidence levels. The second method, we term Competitive Quantile Autoregression (CQAR), dynamically re-estimates cyber risk as soon as new data becomes available. This method provides a theoretical guarantee that it asymptotically performs as well as any QAR at any time point in the future. We show that these methods can predict the size and inter-arrival time of cyber hacking breaches by running coverage tests. The proposed approaches allow to model a separate stochastic process for each significance level and therefore provide more flexibility compared to previously proposed techniques. We provide a fully reproducible code used for conducting the experiments.Entities:
Keywords: Competitive prediction; Cyber breach modelling; Cyber risk; Dynamic risk estimation; Quantile Autoregression; Time-series
Year: 2022 PMID: 35401030 PMCID: PMC8964664 DOI: 10.1007/s10618-021-00814-z
Source DB: PubMed Journal: Data Min Knowl Discov ISSN: 1384-5810 Impact factor: 3.670
Fig. 1Visualisation of breach sizes and inter-arrival times
Summary statistics of breach sizes
| Type of organisation | Min | Median ( | Mean ( | Sd ( | Max ( | Number of observations |
|---|---|---|---|---|---|---|
| BSF | 6 | 1.7 | 4.8 | 21.3 | 145.5 | 111 |
| BSO | 2 | 10.4 | 26.4 | 214.8 | 3000.0 | 208 |
| BSR | 1 | 2.1 | 6.7 | 33.3 | 327.0 | 138 |
| EDU | 12 | 8.5 | 222.5 | 2.7 | 40.0 | 223 |
| GOV | 8 | 6.0 | 457.7 | 2.4 | 21.5 | 93 |
| MED | 1 | 4.0 | 200.1 | 2.9 | 78.8 | 805 |
| NGO | 13 | 4.0 | 142.1 | 0.6 | 3.0 | 24 |
| Total | 1 | 4.6 | 4.5 | 78.6 | 3000 | 1602 |
Summary statistics of breach inter-arrival times
| Type of organisation | Min | Median | Mean | Sd | Max | Number of observations |
|---|---|---|---|---|---|---|
| BSF | 0.0111 | 2.00 | 4.16 | 5.78 | 36 | 111 |
| BSO | 0.0480 | 1.00 | 3.08 | 4.18 | 38 | 208 |
| BSR | 0.0233 | 2.00 | 3.52 | 5.09 | 33 | 138 |
| EDU | 0.0134 | 3.00 | 5.86 | 8.12 | 59 | 223 |
| GOV | 0.0842 | 2.00 | 3.66 | 5.06 | 28 | 93 |
| MED | 0.0019 | 1.00 | 2.85 | 4.10 | 37 | 805 |
| NGO | 0.0131 | 1.00 | 2.70 | 3.56 | 13 | 24 |
| Total | 0.0019 | 2.00 | 3.49 | 5.20 | 59 | 1602 |
Fig. 2ACF and PACF
Fig. 3BIC for different lags
Fig. 4Predictions of QAR
Coverage tests for QAR for breach sizes at test data
| Method | Quantile | Exp | Act | uc.LRp | cc.LRp | uc.D | cc.D |
|---|---|---|---|---|---|---|---|
| QAR(6) | 0.90 | 63 | 55 | 0.2509 | 0.4784 | FR | FR |
| QAR(6) | 0.92 | 50 | 44 | 0.3095 | 0.5103 | FR | FR |
| QAR(6) | 0.95 | 31 | 29 | 0.6116 | 0.7456 | FR | FR |
Coverage tests for QAR for inter-arrival times at test data
| Method | Quantile | Exp | Act | uc.LRp | cc.LRp | uc.D | cc.D |
|---|---|---|---|---|---|---|---|
| QAR(5) | 0.90 | 63 | 56 | 0.3062 | 0.3539 | FR | FR |
| QAR(5) | 0.92 | 50 | 41 | 0.1360 | 0.0146 | FR | R |
| QAR(5) | 0.95 | 31 | 26 | 0.2765 | 0.1463 | FR | FR |
Parameters of CQAR on training
| (a) Acceptance ratio | (b) Pinball losses | ||||||
|---|---|---|---|---|---|---|---|
| a \ | 0.5 | 0.7 | 1 | a \ | 0.5 | 0.7 | 1 |
| 0.1 | 0.69 | 0.47 | 0.22 | 0.1 | 281.69 | 281.74 | 268.00 |
| 0.5 | 0.61 | 0.36 | 0.12 | 0.5 | 177.52 | 171.76 | 172.50 |
| 1 | 0.53 | 0.27 | 0.06 | 1 | 137.20 | 138.21 | |
Coverage tests for CQAR for inter-arrival times at test data
| Method | Quantile | Exp | Act | uc.LRp | cc.LRp | uc.D | cc.D |
|---|---|---|---|---|---|---|---|
| CQAR(5) | 0.90 | 63 | 69 | 0.4808 | 0.0785 | FR | FR |
| CQAR(5) | 0.92 | 50 | 54 | 0.6514 | 0.1025 | FR | FR |
| CQAR(5) | 0.95 | 31 | 27 | 0.3705 | 0.4844 | FR | FR |
Fig. 5CQAR
BIC of different orders of ARMA(p, q)
| p \ q | 0 | 1 | 2 |
|---|---|---|---|
| 0 | 3608.2 | 3604.9 | 3605.1 |
| 1 | 3603.1 | 3586.7 | |
| 2 | 3602.1 | 3586.7 | 3593.0 |
Fig. 6Predictions of ARMA(1, 1)-GARCH(1, 1)
Coverage tests for ARMA(1, 1)-GARCH(1, 1) for breach sizes at test data
| Method | Quantile | Exp | Act | uc.LRp | cc.LRp | uc.D | cc.D |
|---|---|---|---|---|---|---|---|
| ARMA-GARCH | 0.90 | 58 | 51 | 0.2737 | 0.2510 | FR | FR |
| ARMA-GARCH | 0.92 | 47 | 40 | 0.2730 | 0.5399 | FR | FR |
| ARMA-GARCH | 0.95 | 29 | 25 | 0.3933 | 0.4830 | FR | FR |
Coverage tests for CQAR(2) for breach sizes at test data
| Method | Quantile | Exp | Act | uc.LRp | cc.LRp | uc.D | cc.D |
|---|---|---|---|---|---|---|---|
| CQAR(2) | 0.90 | 58 | 41 | 0.0101 | 0.0361 | R | R |
| CQAR(2) | 0.92 | 47 | 35 | 0.0561 | 0.0674 | FR | FR |
| CQAR(2) | 0.95 | 29 | 22 | 0.1435 | 0.3333 | FR | FR |
BIC of different orders of ARMA(p, q)
| p \ q | 0 | 1 | 2 |
|---|---|---|---|
| 0 | 3004.5 | 3002.2 | 2985.4 |
| 1 | 2999.0 | 2926.9 | |
| 2 | 2978.0 | 2926.8 | 2931.4 |
Coverage tests of ARMA(1, 1)-GARCH(1, 1) and CQAR(2) for inter-arrival times at test data
| method | quantile | exp | act | uc.LRp | cc.LRp | uc.D | cc.D |
|---|---|---|---|---|---|---|---|
| ARMA-GARCH | 0.90 | 61 | 65 | 0.6801 | 0.4293 | FR | FR |
| ARMA-GARCH | 0.92 | 49 | 45 | 0.4969 | 0.7878 | FR | FR |
| ARMA-GARCH | 0.95 | 30 | 18 | 0.0097 | 0.0207 | R | R |
| CQAR(2) | 0.90 | 61 | 70 | 0.2867 | 0.3782 | FR | FR |
| CQAR(2) | 0.92 | 49 | 54 | 0.5125 | 0.0954 | FR | FR |
| CQAR(2) | 0.95 | 30 | 31 | 0.9926 | 0.0519 | FR | FR |