| Literature DB >> 36151389 |
Abstract
Spatially referenced data arise in many fields, including imaging, ecology, public health, and marketing. Although principled smoothing or interpolation is paramount for many practitioners, regression, too, can be an important (or even the only or most important) goal of a spatial analysis. When doing spatial regression it is crucial to accommodate spatial variation in the response variable that cannot be explained by the spatially patterned explanatory variables included in the model. Failure to model both sources of spatial dependence-regression and extra-regression, if you will-can lead to erroneous inference for the regression coefficients. In this article I highlight an under-appreciated spatial regression model, namely, the spatial Gaussian copula regression model (SGCRM), and describe said model's advantages. Then I develop an intuitive, unified, and computationally efficient approach to inference for the SGCRM. I demonstrate the efficacy of the proposed methodology by way of an extensive simulation study along with analyses of a well-known dataset from disease mapping.Entities:
Mesh:
Year: 2022 PMID: 36151389 PMCID: PMC9508247 DOI: 10.1038/s41598-022-20171-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Stomach cancer incidence and socioeconomic status for the municipalities of Slovenia. Figure created using R version 4.1.2 (https://www.r-project.org).
Figure 2The adjacency structure for the municipalities of Slovenia. Figure created using R version 4.1.2 (https://www.r-project.org).
Scenarios for the simulation study.
| Scenario | Response | Copula | Sample size | Copula parameter(s) | |
|---|---|---|---|---|---|
| 1 | Poisson | Proper CAR | |||
| 2 | |||||
| 3 | Binomial | Leroux | |||
| 4 | Bernoulli | Exponential | |||
| 5 | Negative binomial | Matérn | |||
| 6 |
Figure 3Left panel: the mean structure for the Poisson (Scenarios 1 and 2) and negative binomial (Scenarios 5 and 6) simulation studies. Right panel: the mean structure for the binomial (Scenario 3) and ungrouped binary (Scenario 4) simulation studies. Figure created using R version 4.1.2 (https://www.r-project.org).
Regression results for the simulation scenarios described in Table 1.
| Scenario | Interval (%) | Parameter | Ordinary coverage rate (%) | Spatial coverage rate (%) | Ordinary type II rate (%) | Spatial type II rate (%) | Running time (s) |
|---|---|---|---|---|---|---|---|
| 1 | 95 | 27 | 84 | 0 | 0 | 11 | |
| 32 | 88 | 3 | 29 | ||||
| 99 | – | 95 | – | 0 | |||
| – | 95 | – | 50 | ||||
| 2 | 95 | 70 | 94 | 0 | 0 | 10 | |
| 69 | 94 | 0 | 0 | ||||
| 3 | 95 | 29 | 91 | 0 | 0 | 48 | |
| 31 | 90 | 2 | 26 | ||||
| 99 | – | 98 | – | 0 | |||
| – | 98 | – | 48 | ||||
| 4 | 95 | 31 | 59 | 6 | 13 | 16 | |
| 36 | 63 | 30 | 58 | ||||
| 5 | 95 | 31 | 89 | 0 | 7 | 101 | |
| 31 | 88 | 16 | 68 | ||||
| 99 | – | 95 | – | 12 | |||
| – | 94 | – | 80 | ||||
| 6 | 95 | 72 | 97 | 0 | 0 | 90 | |
| 70 | 95 | 2 | 19 |
Results for nuisance parameters for the simulation scenarios described in Table 1.
| Scenario | Parameter | Median estimate | Remarks |
|---|---|---|---|
| 1 | 0.976 | Small negative bias; | |
| 2 | 0.763 | Small negative bias; | |
| 3 | 0.931 | Small negative bias; | |
| 4 | 0.109 | Large negative bias (63%); | |
| 5 | 0.114 | Small bias; | |
| 0.654 | Substantial negative bias; | ||
| 5.533 | Positive bias; distribution of | ||
| 6 | 0.047 | Positive bias; | |
| 0.509 | Large negative bias; | ||
| 3.286 | Small positive bias; |
Figure 4Standardized residuals for an ordinary GLM fit (Poisson regression with offset) to the Slovenia stomach cancer data. Darker gray means larger value. The residuals exhibit substantial, but short-range, positive dependence. Figure created using R version 4.1.2 (https://www.r-project.org).
Results for analyses of the Slovenia stomach cancer data.
| Approach | Intercept | Effect | Copula parameter | Running time |
|---|---|---|---|---|
| Ordinary GLM | – | |||
| SGCRM (CAR) | 18 s | |||
| SGCRM (Leroux) | 11 s | |||
| SGLMM (CAR) | 1 h |
The first row shows results for an ordinary Poisson regression with offset. The second row shows results for the two-stage SGCRM procedure, where the first stage employed an ordinary Poisson regression with offset. The third row shows results for the SGCRM procedure with Leroux copula. The fourth row shows results for an SGLMM fit such that the linear predictor for the Poisson regression with offset was augmented with proper CAR spatial random effects.