| Literature DB >> 25673036 |
Abstract
This paper proposes a new algorithm to estimate the maximum likelihood parameters of an Epidemic Type Aftershock Sequences (ETAS) model. It is based on Simulated Annealing, a versatile method that solves problems of global optimization and ensures convergence to a global optimum. The procedure is tested on both simulated and real catalogs. The main conclusion is that the method performs poorly as the size of the catalog decreases because the effect of the correlation of the ETAS parameters is more significant. These results give new insights into the ETAS model and the efficiency of the maximum-likelihood method within this context.Entities:
Year: 2015 PMID: 25673036 PMCID: PMC4325320 DOI: 10.1038/srep08417
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Background spatial probabilities ui (see eq. 9) adopted for ETAS simulations.
The map was created using the software Generic Mapping Tools (http://gmt.soest.hawaii.edu/).
Figure 2Plot of median estimates versus the pseudo-real values for the 8 parameters of the ETAS model {μ, k, p, c, α, d, q, γ} and for all the simulated catalogs.
The color is scaled with the size of the catalog (see text for details).
Figure 3Analysis of bias, accuracy and precision of ML estimation of the ETAS parameters, obtained by the VFSA algorithm proposed in the present study.
(a) Distribution of the bias for all the simulated ETAS catalogs and for the 8 parameters of the ETAS model. Symbols mark the median values (circles for 1 year catalogs, stars for 5 year catalogs). The bounds show the 5-th and the 95-th percentiles (of the values obtained for each catalog). (b) The same of (a), except for accuracy. (c) The same of (a), except for precision. (d) Results of statistical tests applied to check the correlation/dependence between the number of earthquakes N and the accuracy/precision of the 8 parameters of the ETAS model. The hypothesis of independence is rejected for all parameters, except for μ.
Figure 4Plot of the estimated median (on 100 runs) background probabilities u versus the pseudo real values for all cells with at least an event and for all catalogs.
The color is scaled with the size of the catalog N. (a) Plot for catalogs with N ≤ 1000. (b) The same as a), but for 1000 < N ≤ 2000. (c) The same as (a), but for N > 2000.
Figure 5Analysis of log-likelihood values.
(a) Difference of log-likelihoods, computed on the pseudo-real and the estimated parameters (ΔLL), as a function of the size of the catalog (N). (b) Plot of the estimated values of the 8 parameters of the ETAS model, obtained by assuming the background probabilities u known/unknown. The x and y axes are equal and are scaled with the range of each parameter. On the right of each x-axis, the corresponding parameter and symbol are reported. (c) Range of variability of log-likelihoods (RG, eq.16) versus the number of events (N) for all catalogs.
Results of model estimations for the smallest simulated catalog. The first column lists the pseudo-real parameters. In the second column, the median (on 100 runs) estimated values are reported. The third and the fourth columns list the log-likelihood values for pseudo-real and estimated parameters. The fifth column reports the expected number of events. The values in brackets are the 5-th and the 95-th percentiles of relative variables. The numbers in boldface mark the results obtained by fixing the background probabilities u to pseudo-real values. The last column lists significant iterations of the run giving the higher maximum likelihood (the first 8 values, in curly braces, are the parameters; the last is the log-likelihood)
| 1 year, 102 events | |||||
|---|---|---|---|---|---|
| Nev | Run Hist | ||||
| −665.3 | −682.6 (−682.7, −682.6) | 102 (101, 103) | {5.43E−2, 1.19E−2, 1.64, 8.95E−2, 0.98, 0.44, 1.34, 0.44}; −694.0 | ||
| 102 (101, 103) | {7.41E−2 4.76E−3 1.66 7.19E−2 1.07 0.75 1.79 0.82}; −687.7 | ||||
| {0.14, 4.70E−3, 1.16, 8.64E−2, 1.53, 0.71, 1.87, 0.75}; −687.6 | |||||
| {0.13, 4.48E−3, 1.55, 8.77E−2, 1.82, 0.85, 1.32, 0.43}; −685.7 | |||||
| {9.40E−2, 5.94E−3, 1.44, 4.25E−2, 1.59, 0.90, 1.67, 0.80}; −683.7 | |||||
| {9.37E−2, 5.110E−3, 1.52, 6.75E−2, 1.70, 0.87, 1.88, 0.78}; −683.4 | |||||
| {0.10, 5.88E−3, 1.46, 6.26E−2, 1.82, 0.83, 1.85, 0.77}; −683.2 | |||||
| {0.10, 4.55E−3, 1.46, 6.17E−2, 1.82, 0.84, 1.76, 0.77}; −683.1 | |||||
| {0.10, 4.60E−3, 1.48, 7.46E−2, 1.80, 0.83, 1.82, 0.90}; −683.1 | |||||
| {0.10, 6.21E−3, 1.47, 7.44E−2, 1.90, 0.93, 1.84, 0.83}; −683.1 | |||||
| {0.10 6.02E−3 1.49 7.42E−2 1.89 0.97 1.83 0.79}; −683.0 | |||||
| {0.10, 5.06E−3, 1.48, 7.36E−2, 1.89, 0.97, 1.84, 0.79}; −682.9 | |||||
| {0.11, 5.19E−3, 1.47, 7.80E−2, 1.91, 0.96, 1.84, 0.81}; −682.9 | |||||
| {0.10, 4.48E−3, 1.47, 6.26E−2, 1.91, 0.96, 1.84, 0.81}; −682.8 | |||||
| {0.10, 4.43E−3, 1.46, 6.62E−2, 1.91, 0.96, 1.95, 0.81}; −682.7 | |||||
| {0.11, 4.70E−3, 1.46, 6.63E−2, 1.90, 0.94, 1.97, 0.81}; −682.7 | |||||
| {0.11, 4.70E−3, 1.49, 7.77E−2, 1.90, 0.94, 1.97, 0.81}; −682.7 | |||||
| {0.11, 4.80E−3, 1.47, 7.80E−2, 1.92, 0.99, 1.97, 0.81}; −682.6 | |||||
| {0.11, 4.94E−3, 1.48, 7.97E−2, 1.86, 0.98, 1.98, 0.76}; −682.6 | |||||
| {0.10, 4.91E−3, 1.49, 8.01E−2, 1.87, 0.98, 1.98, 0.76}; −682.6 | |||||
| {0.11, 4.54E−3, 1.49, 8.07E−2, 1.92, 0.99, 1.99, 0.74}; −682.6 | |||||
Results of model estimations for the largest simulated catalog. The first column lists the pseudo-real parameters. In the second column, the median (on 100 runs) estimated values are reported. The third and the fourth columns list the log-likelihood values for pseudo-real and estimated parameters. The fifth column reports the expected number of events. The values in brackets are the 5-th and the 95-th percentiles of relative variables. The last column lists significant iterations of the run giving the higher maximum likelihood (the first 8 values, in curly braces, are the parameters; the last is the log-likelihood)
| 5 years, 3889 events | |||||
|---|---|---|---|---|---|
| Nev | Run Hist | ||||
| −31145 | −31018 (−31020, −31017) | 3890 (3865,3905) | {0.77, 1.68E−2, 1.85, 4.35E−2, 0.89, 0.90, 1.26, 2.98E−2}; −31725.0 | ||
| {0.56, 3.68E−2, 1.54, 6.25E−2, 0.84, 0.58, 1.86, 0.69}; −31343.1 | |||||
| {0.91, 3.50E−2, 1.38, 7.60E−2, 1.08, 0.58, 1.83, 0.29}; −31325.7 | |||||
| {0.95, 6.23E−2, 1.56, 9.76E−2, 0.58, 0.90, 1.96, 0.30}; −31228.5 | |||||
| {0.98, 2.95E−2, 1.85, 9.39E−2, 1.36, 0.68, 1.47, 0.50}; −31227.4 | |||||
| {0.96, 5.17E−2, 1.48, 8.41E−2, 0.97, 0.73, 1.56, 0.24}; −31197.4 | |||||
| {0.95, 3.42E−2, 1.47, 4.39E−2, 1.20, 0.78, 1.53, 0.27}; −31174.1 | |||||
| {0.83, 2.24E−2, 1.45, 2.93E−2, 1.27, 0.78, 1.62, 0.18}; −31110.2 | |||||
| {0.91, 2.37E−2, 1.55, 5.38E−2, 1.20, 0.75, 1.62, 2.40E−2}; −31094.5 | |||||
| {0.91, 2.21E−2, 1.53, 4.57E−2, 1.20, 0.79, 1.62, 4.25E−2 }; −31088.3 | |||||
| {0.87, 2.56E−2, 1.53, 5.29E−2, 1.24, 0.75, 1.83, 0.57}; −31067.4 | |||||
| {0.86, 2.52E−2, 1.53, 5.38E−2, 1.14, 0.72, 1.83, 0.51}; −31062.8 | |||||
| {0.94, 2.51E−2, 1.50, 5.35E−2, 1.32, 0.64, 1.78, 0.52}; −31055.3 | |||||
| {0.93, 2.24E−2, 1.61, 5.18E−2, 1.35, 0.64, 1.80, 0.52}; −31041.4 | |||||
| {0.94, 2.11E−2, 1.65, 6.13E−2, 1.32, 0.64, 1.74, 0.52}; −31032.5 | |||||
| {0.93, 2.27E−2, 1.72, 7.10E−2, 1.28, 0.61, 1.72, 0.48}; −31026.4 | |||||
| {0.99, 2.32E−2, 1.73, 7.27E−2, 1.29, 0.60, 1.72, 0.48}; −31025.4 | |||||
| {0.99, 2.53E−2, 1.74, 7.97E−2, 1.28, 0.67, 1.78, 0.45}; −31020.3 | |||||
| {0.97, 2.52E−2, 1.74, 8.03E−2, 1.27, 0.69, 1.76, 0.42}; −31018.2 | |||||
| {0.97, 2.58E−2, 1.74, 7.95E−2, 1.25, 0.71, 1.76, 0.39}; −31017.0 | |||||
Estimation of ETAS parameters for the Italian catalog. The results obtained by using the VFSA and the Quasi-Newton algorithms are compared. For the VFSA algorithm, the median (on 100 runs) values are reported. The values in the brackets are the 5-th and the 95-th percentiles of the relative parameter
| Parameter | VFSA algorithm | Quasi-Newton algorithm |
|---|---|---|
| 2.0 (2.0, 2.1) ·10−1 | 2.1 ·10−1 | |
| 2.3 (2.2, 2.6) ·10−2 | 2.2 ·10−2 | |
| 1.13 (1.12, 1.15) | 1.15 | |
| 7.0 (6.0, 9.0) ·10−3 | 8.0 ·10−3 | |
| 1.4 (1.4, 1.5) | 1.5 | |
| 9.3 (9.0, 9.7) ·10−1 | 8.8 ·10−1 | |
| 1.78 (1.75, 1.80) | 1.78 | |
| 4.8 (4.4, 4.9) ·10−1 | 4.9 ·10−1 | |
| Nev | 1789 (1777, 1806) | 1793 |
Figure 6Analysis of dependence/correlation for all possible pairs of the 8 parameters of the ETAS model.
The probability of independence/non-correlation is computed on values obtained by 100 runs for each catalog, each pair of parameters and each statistical test. The proportion of catalogs with p-values < 0.05 is plotted as a function of the pairs of parameters (the parameters are labeled on the top and bottom x-axes).
Figure 7“Partial” estimation of the ETAS model on 1-year simulated catalogs.
Each parameter is estimated one at a time by keeping the remaining parameters and the background probabilities fixed to the pseudo-real values. (a) Distribution of accuracy and precision. The symbols mark the median values (circles for accuracy, stars for precision). The bounds indicate the 5-th and the 95-th percentiles (of the values obtained for each catalog). (b) Difference of the median log-likelihoods computed on pseudo-real and estimated parameters (ΔLL), as a function of the “free” parameter. Due to the high precision of this type of estimation, the log-likelihoods of 100 runs for each catalog are close.