| Literature DB >> 26900339 |
L Evers1, D A Molinari1, A W Bowman1, W R Jones2, M J Spence2.
Abstract
Fitting statistical models to spatiotemporal data requires finding the right balance between imposing smoothness and following the data. In the context of P-splines, we propose a Bayesian framework for choosing the smoothing parameter, which allows the construction of fully automatic data-driven methods for fitting flexible models to spatiotemporal data. An implementation, which is highly computationally efficient and exploits the sparsity of the design and penalty matrices, is proposed. The findings are illustrated using a simulation study and two examples, all concerned with the modelling of contaminants in groundwater. This suggests that the proposed strategy is more stable that competing methods based on the use of criteria such as generalised cross-validation and Akaike's Information Criterion.Entities:
Keywords: Bayesian methods; groundwater; smoothing; spatiotemporal models; splines
Year: 2015 PMID: 26900339 PMCID: PMC4744788 DOI: 10.1002/env.2347
Source DB: PubMed Journal: Environmetrics ISSN: 1099-095X Impact factor: 1.900
Figure 1Predictions of the concentration (in μ g/ℓ) for the benzene data on one particular day (Section 6.1) using the penalisation parameter chosen by optimising the different criteria, as well as fully Bayesian model averaging. The left column was obtained by using all wells, the right column was obtained after removing four wells (marked by crosses). Using generalised cross‐validation or observation‐based cross‐validation gives results very similar to AICc
Integrated squared errors of the predictions averaged over the convex hull of the data for the three well scenarios
| Criterion used to select smoothness | Scenario 1 | Scenario 2 | Scenario 3 | |||
|---|---|---|---|---|---|---|
| mean | (std. err.) | mean | (std. err) | mean | (std. err) | |
| AICc | 313.326 | (258.319) | 0.137 | (0.003) | 0.870 | (0.009) |
| Generalised cross‐validation | 348.947 | (276.003) | 0.139 | (0.001) | 3.780 | (2.282) |
| obs.‐based CV | 3.552 | (0.216) | 0.141 | (0.001) | 1.023 | (0.016) |
| well‐based CV | 0.809 | (0.012) | 0.138 | (0.001) | 0.902 | (0.012) |
| BIC | 1.028 | (0.013) | 0.179 | (0.001) | 0.875 | (0.003) |
| Bayesian MAP | 1.662 | (0.020) | 0.136 | (0.001) | 0.863 | (0.005) |
| Bayesian model avg. | 1.639 | (0.019) | 0.136 | (0.001) | 0.858 | (0.005) |
Figure 2Density strip plots of the smoothing parameters chosen by the different methods for both scenarios. The dashed red line indicates the median
Figure 3Different objective functions that can be used to determine the optimal amount of smoothing, applied to the benzene data. The solid lines were obtained using all wells. The dotted lines were obtained after removing four wells. The vertical dashed lines indicate the location of the minimum when all wells are used
Figure 4Plan of the refinery site and wells. The wells are colour‐coded according to observed concentrations of methyl tertiary butyl ether immediately after release
Figure 5Predicted levels of methyl tertiary butyl ether concentration across space obtained using the maximum a posteori estimate of the smoothing parameter for four time points. The colour scale is the same as that used in Figure 4