| Literature DB >> 30481193 |
Juan C Duque1,2, Henry Laniado1, Adriano Polo2,3.
Abstract
This work presents a nonparametric statistical test, S-maup, to measure the sensitivity of a spatially intensive variable to the effects of the Modifiable Areal Unit Problem (MAUP). To the best of our knowledge, S-maup is the first statistic of its type and focuses on determining how much the distribution of the variable, at its highest level of spatial disaggregation, will change when it is spatially aggregated. Through a computational experiment, we obtain the basis for the design of the statistical test under the null hypothesis of non-sensitivity to MAUP. We performed an exhaustive simulation study for approaching the empirical distribution of the statistical test, obtaining its critical values, and computing its power and size. The results indicate that, in general, both the statistical size and power improve with increasing sample size. Finally, for illustrative purposes, an empirical application is made using the Mincer equation in South Africa, where starting from 206 municipalities, the S-maup statistic is used to find the maximum level of spatial aggregation that avoids the negative consequences of the MAUP.Entities:
Mesh:
Year: 2018 PMID: 30481193 PMCID: PMC6258515 DOI: 10.1371/journal.pone.0207377
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Computational experiments on MAUP.
| Author (Year) | Dimension / Effect on… | Grouping operator | Data | Variable | Size |
|---|---|---|---|---|---|
| [ | Scale / | Sum | Census Tracts in Cleveland | Male juvenile delinquency and monthly income. Agricultural products and the number of farmers | 1) 252 areas into 200, 175, 150, 125, 100, 50, and 25 regions 2) 1,000 areas into 63, 40, 31, and 8 regions |
| [ | Scale / | Proportions | Nine geographic divisions of the USA in 1930 | Race and illiteracy | 97,272 individuals into 9 regions |
| [ | Scale / | Mean | Agricultural counties in England | Production of wheat and potatoes per acre | 48 areas into 24, 12, 6, and 3 regions |
| [ | Scale / | Mean | Metropolitan area of Los Angeles | Household income and education level of the head of household | 1,556 census tracts into 134 Welfare Planning Council Study areas and 35 Regional Planning Commission Statistical Areas |
| [ | Scale—Zoning / | - | Counties in Iowa and simulated data with | % of Republican votes and % population over 60 years. | 99 areas into 6, 12, 18, 24, 30, 36, 42, 48, 54, 60, 66, and 72 regions |
| [ | Scale—Zoning / | Mean | Quadrat in Hukuno Town, Japan and weights of wheat plots of grain | Quadrat counts of houses and weights of wheat plots of grain | 1) Regular lattice of 32x32 into 16x16, 8x8, 4x4, and 2x2 regions 2) Regular lattice of 25x20 cells into 8x8, 4x4, and 2x2 regions |
| [ | Scale—Zoning / | Mean and Proportion | Metropolitan area of Buffalo | Household income, % of population per area, % of population over 65 years | 871 areas into 800, 400, 200, 100, 50, and 25 regions |
| [ | Zoning / | Mean and weighted average | Regular lattices | Simulated data with Uniform, Normal and Poisson distribution | 10,000 areas into 10x10, 7x7, and 3x3 regions |
| [ | Zoning / | Mean | City of Adelaide, Australia | 82 socioeconomic variables | 917,000 people into 1,584 districts |
| [ | Zoning / | Mean | Lancashire, UK | 8 census variables | 304 areas into 137, 122, 106, 91, 76, 61, 46, and 30 regions |
| [ | Scale—Zoning / | Mean | UK | Census variables and simulated variables | Regular lattice of 120x120 into 1x1, 2x2, 3x3, 4x4, and 5x5 regions |
| [ | Scale / | Mean | Malasia | Biomass areas and elevation data | Regular Lattice of 220x188 into 2x2, 3x3, 4x4, …, and 20x20 regions |
| [ | Scale / | Mean | Manitoba, Canada | Normalized Difference Vegetation Index (NDVI) | Regular lattice of 300x300 into 3x3, 5x5, 7x7, 9x9, 11x11, 13x13, and 15x15 areas |
| [ | Zoning / | Mean | Regular lattices | Simulated variables with different levels of spatial autocorrelation and variance | 400 areas into 180, 160, 140, 120, 100, 80, 60, and 40 regions |
| [ | Zoning / | Mean and Median | Regular lattices | Simulated data with different levels of spatial autocorrelation | Regular lattice of 512x512 into 3x3, 9x9, 11x11, 21x21, 31x31, 41x41, 51x51, 61x61, 71x71, and 81x81 pixel window sizes |
| [ | Scale / | Mean | Regular lattice | Simulated data | Regular lattice of 64x64 into 32x32, 16x16, 8x8, and 4x4. |
r: Correlation, μ: Media, σ2: Variance, σ: Covariance, ρ: Spatial autocorrelation, β: Regression coefficients.
Fig 1Instance of the experiment.
Fig 2Example of spatial autocorrelation generation.
Fig 3Relative change in mean—Average effect.
(a) N = 25; (b) N = 100; (c) N = 225; (d) N = 400; (e) N = 625; (f) N = 900.
Effect on mean.
| Number of areas | ||||||
|---|---|---|---|---|---|---|
| Proportion | 0 | 0.00063 | 0.00014 | 0.00041 | 0.00063 | 0.0012 |
*Number of t-test rejections divided by |i| * |ρ| * |K| * r, where |⋅| indicates the cardinality. It includes aggregations with k ≥ 10.
Fig 4Relative change in variance—Average effect.
(a) N = 25; (b) N = 100; (c) N = 225; (d) N = 400; (e) N = 625; (f) N = 900.
Fig 5Proportion of instances for which the Levene test rejects the null hypothesis of equality of variance, with a level of significance α = 0.05.
(a) N = 25; (b) N = 100; (c) N = 225; (d) N = 400; (e) N = 625; (f) N = 900.
Fig 6MAUP effects at three levels of spatial autocorrelation, (a) ρ = −0.9, (b) ρ = 0, and (c) ρ = 0.9.
Solid line: original variable with N = 900; dashed lines: 30 aggregations with k = 240. The vertical lines indicate μ and μ.
Fig 7Median for N = 100.
Fig 8Adjustments of robust linear regression models.
(a) Linearized logistic function (L); (b) Linearized power function (η); (c) Linear function (τ).
Critical values (M).
| Number of areas ( | |||||||
|---|---|---|---|---|---|---|---|
| 25 | 100 | 225 | 400 | 625 | 900 | ||
| -0.9 | 0.01 | 0.83702 | 0.09218 | 0.23808 | 0.05488 | 0.07218 | 0.02621 |
| 0.05 | 0.83699 | 0.08023 | 0.10962 | 0.04894 | 0.04641 | 0.02423 | |
| 0.1 | 0.69331 | 0.06545 | 0.07858 | 0.04015 | 0.03374 | 0.02187 | |
| -0.7 | 0.01 | 0.83676 | 0.16134 | 0.13402 | 0.06737 | 0.05486 | 0.02858 |
| 0.05 | 0.83662 | 0.12492 | 0.08643 | 0.05900 | 0.04280 | 0.02459 | |
| 0.1 | 0.79421 | 0.09566 | 0.06777 | 0.05058 | 0.03392 | 0.02272 | |
| -0.5 | 0.01 | 0.83597 | 0.16524 | 0.13446 | 0.06616 | 0.06247 | 0.02851 |
| 0.05 | 0.83578 | 0.13796 | 0.08679 | 0.05927 | 0.04260 | 0.02658 | |
| 0.1 | 0.68900 | 0.10707 | 0.07039 | 0.05151 | 0.03609 | 0.02411 | |
| -0.3 | 0.01 | 0.83316 | 0.19276 | 0.13396 | 0.06330 | 0.06090 | 0.03696 |
| 0.05 | 0.78849 | 0.16932 | 0.08775 | 0.05464 | 0.04787 | 0.03042 | |
| 0.1 | 0.73592 | 0.14282 | 0.07076 | 0.04649 | 0.04001 | 0.02614 | |
| 0.0 | 0.01 | 0.82370 | 0.17925 | 0.15514 | 0.07732 | 0.07988 | 0.09301 |
| 0.05 | 0.81952 | 0.15746 | 0.11126 | 0.06961 | 0.06066 | 0.05234 | |
| 0.1 | 0.71632 | 0.13621 | 0.08801 | 0.06112 | 0.04937 | 0.03759 | |
| 0.3 | 0.01 | 0.76472 | 0.23404 | 0.24640 | 0.11588 | 0.10715 | 0.07070 |
| 0.05 | 0.70466 | 0.21088 | 0.15360 | 0.09766 | 0.07938 | 0.06461 | |
| 0.1 | 0.63718 | 0.18239 | 0.12101 | 0.08324 | 0.06347 | 0.05549 | |
| 0.5 | 0.01 | 0.67337 | 0.28921 | 0.25535 | 0.13992 | 0.12975 | 0.09856 |
| 0.05 | 0.59461 | 0.23497 | 0.18244 | 0.11682 | 0.10129 | 0.08860 | |
| 0.1 | 0.46548 | 0.17541 | 0.14248 | 0.10008 | 0.08137 | 0.07701 | |
| 0.7 | 0.01 | 0.52155 | 0.47399 | 0.29351 | 0.23923 | 0.20321 | 0.16250 |
| 0.05 | 0.48958 | 0.37226 | 0.22280 | 0.20540 | 0.16144 | 0.14123 | |
| 0.1 | 0.34720 | 0.28774 | 0.18170 | 0.16442 | 0.13395 | 0.12354 | |
| 0.9 | 0.01 | 0.28599 | 0.28938 | 0.43520 | 0.44060 | 0.34437 | 0.55967 |
| 0.05 | 0.21580 | 0.22532 | 0.27122 | 0.29043 | 0.23648 | 0.31424 | |
| 0.1 | 0.17640 | 0.18835 | 0.21695 | 0.23031 | 0.19435 | 0.22411 | |
Example S-maup.
| Pseudo-v | ||||||
|---|---|---|---|---|---|---|
| 1,000 | 400 | 0.007 | 0.24002 | 0.05234 | 0.0 | |
| 1,000 | 600 | 0.007 | 0.05871 | 0.05234 | 0.034 | |
| 1,000 | 800 | 0.007 | 0.01187 | 0.05234 | 0.616 | |
| 500 | 100 | -0.634 | 0.09237 | 0.05900 | 0.0 | |
| 500 | 280 | -0.634 | 0.05466 | 0.05900 | 0.078 | |
| 500 | 380 | -0.634 | 0.00767 | 0.05900 | 0.852 | |
| 220 | 60 | 0.562 | 0.32197 | 0.18244 | 0.00 | |
| 220 | 90 | 0.562 | 0.18513 | 0.18244 | 0.046 | |
| 220 | 150 | 0.562 | 0.04357 | 0.18244 | 0.443 | |
| 150 | 15 | 0.801 | 0.29201 | 0.22532 | 0.009 | |
| 150 | 50 | 0.801 | 0.08072 | 0.22532 | 0.366 | |
| 150 | 90 | 0.801 | 0.00997 | 0.22532 | 0.883 |
*** p < 0.01,
** p < 0.05,
* p < 0.1.
Estimated power of S-maup.
| Number of areas ( | |||
|---|---|---|---|
| -0.9 | 0.989 | 0.985 | 0.997 |
| -0.7 | 0.986 | 0.996 | 1.000 |
| -0.5 | 0.981 | 0.998 | 1.000 |
| -0.3 | 0.982 | 0.998 | 1.000 |
| 0.0 | 0.997 | 0.999 | 0.999 |
| 0.3 | 0.986 | 0.996 | 1.000 |
| 0.5 | 0.986 | 0.996 | 0.999 |
| 0.7 | 0.783 | 0.985 | 0.995 |
| 0.9 | 0.977 | 0.703 | 0.492 |
Level of significance α = 0.05.
Estimated size of S-maup.
| Number of areas ( | |||
|---|---|---|---|
| -0.9 | 0.163 | 0.087 | 0.065 |
| -0.7 | 0.080 | 0.037 | 0.080 |
| -0.5 | 0.091 | 0.043 | 0.083 |
| -0.3 | 0.073 | 0.097 | 0.136 |
| 0 | 0.102 | 0.066 | 0.026 |
| 0.3 | 0.081 | 0.057 | 0.038 |
| 0.5 | 0.098 | 0.062 | 0.032 |
| 0.7 | 0.043 | 0.032 | 0.045 |
| 0.9 | 0.110 | 0.024 | 0.009 |
Level of significance α = 0.05.
Descriptive statistics.
| Variable | Municipalities | ||||
|---|---|---|---|---|---|
| Obs. | Mean | Desv. Std. | Mín. | Máx. | |
| LNW | 206 | 10.51 | 0.35 | 9.64 | 11.73 |
| YRSCHOOL | 206 | 9.95 | 0.81 | 7.43 | 11.87 |
| EXP | 206 | 21.69 | 1.71 | 15.28 | 26.64 |
| Districts | |||||
| LNW | 52 | 10.56 | 0.25 | 10.15 | 11.20 |
| YRSCHOOL | 52 | 10.06 | 0.61 | 8.28 | 10.99 |
| EXP | 52 | 21.59 | 1.21 | 18.66 | 24.24 |
| Provinces | |||||
| LNW | 9 | 10.57 | 0.19 | 10.31 | 10.86 |
| YRSCHOOL | 9 | 10.00 | 0.44 | 9.35 | 10.59 |
| EXP | 9 | 21.77 | 0.77 | 20.54 | 23.09 |
Fig 9Municipalities: (a), (b) and (c). Districts: (d), (e) and (f). Provinces: (g), (h) and (i).
Mincer model estimate: South Africa.
| LNW | Coef. | Desv. Std | Confidence Interval at 95% | ||
|---|---|---|---|---|---|
| YRSCHOOL | 0.3364 | 0.0259 | 0.000 | 0.2852 | 0.3876 |
| EXP | 0.4008 | 0.1499 | 0.008 | 0.1051 | 0.6965 |
| EXP2 | -0.0085 | 0.0034 | 0.016 | -0.0153 | -0.0016 |
| CONST. | 2.4796 | 1.6243 | 0.128 | -0.7232 | 5.6825 |
| Num. Obs. 206 | |||||
| F(3,202) = 68.84 | |||||
*** p < 0.01,
** p < 0.05,
*p < 0.1.
Estimator of the statistic S -maup: South Africa.
| LNW | YRSCHOOL | EXP | EXP2 | |||||
|---|---|---|---|---|---|---|---|---|
| Ps-v | Ps-v | Ps-v | Ps-v | |||||
| 200 | 0.011 | 0.806 | 0.011 | 0.820 | 0.011 | 0.819 | 0.011 | 0.833 |
| 180 | 0.022 | 0.589 | 0.021 | 0.619 | 0.021 | 0.619 | 0.020 | 0.656 |
| 150 | 0.057 | 0.242 | 0.052 | 0.330 | 0.053 | 0.327 | 0.048 | 0.414 |
| 136 | 0.087 | 0.101 | 0.079 | 0.197 | 0.079 | 0.194 | 0.072 | 0.302 |
| 135 | 0.091 | 0.094 | 0.081 | 0.187 | 0.082 | 0.185 | 0.073 | 0.295 |
| 134 | 0.093 | 0.089 | 0.083 | 0.181 | 0.084 | 0.179 | 0.076 | 0.290 |
| 132 | 0.099 | 0.077 | 0.088 | 0.166 | 0.089 | 0.166 | 0.079 | 0.273 |
| 124 | 0.124 | 0.036 | 0.111 | 0.115 | 0.112 | 0.114 | 0.099 | 0.208 |
| 122 | 0.131 | 0.032 | 0.117 | 0.107 | 0.118 | 0.104 | 0.104 | 0.186 |
| 120 | 0.139 | 0.025 | 0.123 | 0.094 | 0.125 | 0.091 | 0.110 | 0.167 |
| 118 | 0.147 | 0.019 | 0.131 | 0.081 | 0.132 | 0.080 | 0.142 | 0.101 |
| 110 | 0.182 | 0.003 | 0.161 | 0.043 | 0.163 | 0.042 | 0.149 | 0.093 |
| 108 | 0.192 | 0.001 | 0.169 | 0.034 | 0.172 | 0.033 | 0.149 | 0.093 |
| 52 | 0.584 | 0.000 | 0.527 | 0.001 | 0.533 | 0.001 | 0.461 | 0.00 |
| 9 | 0.863 | 0.000 | 0.847 | 0.001 | 0.849 | 0.001 | 0.822 | 0.00 |
*** p < 0.01,
** p < 0.05,
*p < 0.1.
Fig 10Distribution of coefficients, k = 136: (a) YRSCHOOL; (b) EXP; (c) EXP2.
Horizontal black line: coefficient (206 municipalities), dashed lines are the respective confidence intervals 95%.
Fig 11Distribution of coefficients.
Line:k = 136, dotted line:k = 52: (a) YRSCHOOL; (b) EXP; (c) EXP2. horizontal black line: coefficient (206 municipalities). horizontal dotted line: coefficient (52 districts).
| Number of areas. | |
| SAR process with | |
| Index and set of number of regions, such that: | |
| for | |
| for | |
| for | |
| for | |
| for | |
| for | |
| Number of random spatial aggregations. |