| Literature DB >> 27774035 |
Rebecca Allen1, Simon Burgess2, Russell Davidson3, Frank Windmeijer2.
Abstract
The most widely used measure of segregation is the so-called dissimilarity index. It is now well understood that this measure also reflects randomness in the allocation of individuals to units (i.e. it measures deviations from evenness, not deviations from randomness). This leads to potentially large values of the segregation index when unit sizes and/or minority proportions are small, even if there is no underlying systematic segregation. Our response to this is to produce adjustments to the index, based on an underlying statistical model. We specify the assignment problem in a very general way, with differences in conditional assignment probabilities underlying the resulting segregation. From this, we derive a likelihood ratio test for the presence of any systematic segregation, and bias adjustments to the dissimilarity index. We further develop the asymptotic distribution theory for testing hypotheses concerning the magnitude of the segregation index and show that the use of bootstrap methods can improve the size and power properties of test procedures considerably. We illustrate these methods by comparing dissimilarity indices across school districts in England to measure social segregation.Entities:
Keywords: Bootstrap methods; Dissimilarity index; Hypothesis testing; Segregation
Year: 2015 PMID: 27774035 PMCID: PMC5054828 DOI: 10.1111/ectj.12039
Source DB: PubMed Journal: Econom J ISSN: 1368-4221 Impact factor: 4.571
Figure 1Bias , , , equal expected unit sizes.
Bias and rmse of D and bias‐corrected estimators for , and combinations of p and D pop
|
| ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.056 | 0.127 | 0.225 | 0.382 | 0.634 | |||||||
|
| bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse |
|
| ||||||||||||
|
| 0.60 | 0.61 | 0.55 | 0.55 | 0.48 | 0.49 | 0.40 | 0.40 | 0.29 | 0.29 | 0.15 | 0.15 |
|
| 0.48 | 0.49 | 0.43 | 0.43 | 0.37 | 0.37 | 0.29 | 0.30 | 0.20 | 0.20 | 0.097 | 0.11 |
|
| 0.40 | 0.41 | 0.35 | 0.35 | 0.29 | 0.29 | 0.21 | 0.22 | 0.13 | 0.14 | 0.058 | 0.086 |
|
| ||||||||||||
|
| 0.41 | 0.42 | 0.36 | 0.36 | 0.30 | 0.30 | 0.23 | 0.24 | 0.15 | 0.16 | 0.071 | 0.084 |
|
| 0.26 | 0.27 | 0.21 | 0.22 | 0.15 | 0.17 | 0.097 | 0.12 | 0.043 | 0.077 | 0.009 | 0.058 |
|
| 0.26 | 0.27 | 0.21 | 0.22 | 0.15 | 0.16 | 0.094 | 0.11 | 0.040 | 0.072 | 0.011 | 0.056 |
|
| ||||||||||||
|
| 0.31 | 0.31 | 0.26 | 0.26 | 0.20 | 0.21 | 0.15 | 0.15 | 0.089 | 0.097 | 0.039 | 0.053 |
|
| 0.19 | 0.20 | 0.14 | 0.15 | 0.090 | 0.11 | 0.046 | 0.067 | 0.011 | 0.051 | −0.002 | 0.044 |
|
| 0.17 | 0.18 | 0.12 | 0.13 | 0.070 | 0.082 | 0.024 | 0.052 | −0.009 | 0.050 | −0.015 | 0.047 |
|
| ||||||||||||
|
| 0.26 | 0.26 | 0.21 | 0.21 | 0.16 | 0.16 | 0.11 | 0.11 | 0.063 | 0.072 | 0.026 | 0.042 |
|
| 0.16 | 0.16 | 0.11 | 0.12 | 0.063 | 0.074 | 0.027 | 0.050 | 0.004 | 0.043 | −0.002 | 0.038 |
|
| 0.15 | 0.15 | 0.095 | 0.10 | 0.048 | 0.060 | 0.009 | 0.041 | −0.013 | 0.045 | −0.012 | 0.040 |
Bias and rmse reported for 5,000 replications. Number of bootstrap repetitions 250.
Bias and rmse of D and bias‐corrected estimators for , and combinations of p and D pop
|
| ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.056 | 0.127 | 0.225 | 0.382 | 0.634 | |||||||
|
| bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse |
|
| ||||||||||||
|
| 0.33 | 0.34 | 0.28 | 0.28 | 0.22 | 0.23 | 0.16 | 0.17 | 0.099 | 0.11 | 0.044 | 0.057 |
|
| 0.21 | 0.21 | 0.16 | 0.16 | 0.10 | 0.11 | 0.055 | 0.074 | 0.015 | 0.054 | −0.003 | 0.046 |
|
| 0.18 | 0.18 | 0.13 | 0.13 | 0.072 | 0.084 | 0.024 | 0.054 | −0.009 | 0.053 | −0.015 | 0.049 |
|
| ||||||||||||
|
| 0.24 | 0.24 | 0.19 | 0.19 | 0.14 | 0.14 | 0.093 | 0.098 | 0.052 | 0.061 | 0.022 | 0.036 |
|
| 0.14 | 0.15 | 0.095 | 0.10 | 0.051 | 0.063 | 0.019 | 0.043 | 0.000 | 0.040 | −0.003 | 0.034 |
|
| 0.13 | 0.14 | 0.084 | 0.089 | 0.038 | 0.051 | 0.005 | 0.038 | −0.010 | 0.041 | −0.008 | 0.035 |
|
| ||||||||||||
|
| 0.18 | 0.18 | 0.13 | 0.13 | 0.088 | 0.090 | 0.054 | 0.059 | 0.029 | 0.039 | 0.012 | 0.026 |
|
| 0.11 | 0.11 | 0.060 | 0.065 | 0.024 | 0.038 | 0.005 | 0.031 | −0.001 | 0.031 | −0.001 | 0.025 |
|
| 0.099 | 0.10 | 0.051 | 0.056 | 0.014 | 0.030 | −0.006 | 0.031 | −0.008 | 0.032 | −0.004 | 0.026 |
|
| ||||||||||||
|
| 0.15 | 0.15 | 0.10 | 0.11 | 0.065 | 0.068 | 0.038 | 0.044 | 0.020 | 0.030 | 0.008 | 0.021 |
|
| 0.090 | 0.092 | 0.045 | 0.050 | 0.014 | 0.029 | 0.002 | 0.027 | −0.001 | 0.026 | −0.000 | 0.021 |
|
| 0.083 | 0.086 | 0.038 | 0.043 | 0.005 | 0.024 | −0.007 | 0.027 | −0.006 | 0.027 | −0.003 | 0.021 |
Bias and rmse reported for 5,000 replications. Number of bootstrap repetitions 250.
Bias and rmse of D and bias‐corrected estimators for , and combinations of p and D pop
|
| ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.056 | 0.127 | 0.225 | 0.382 | 0.634 | |||||||
|
| bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse |
|
| ||||||||||||
|
| 0.26 | 0.26 | 0.21 | 0.21 | 0.15 | 0.16 | 0.11 | 0.11 | 0.060 | 0.069 | 0.026 | 0.040 |
|
| 0.15 | 0.16 | 0.11 | 0.11 | 0.061 | 0.072 | 0.024 | 0.048 | 0.003 | 0.042 | −0.003 | 0.035 |
|
| 0.15 | 0.15 | 0.098 | 0.10 | 0.052 | 0.063 | 0.016 | 0.042 | −0.003 | 0.041 | −0.005 | 0.035 |
|
| ||||||||||||
|
| 0.19 | 0.19 | 0.14 | 0.14 | 0.093 | 0.096 | 0.058 | 0.063 | 0.031 | 0.041 | 0.013 | 0.027 |
|
| 0.11 | 0.11 | 0.064 | 0.070 | 0.027 | 0.040 | 0.007 | 0.032 | −0.001 | 0.031 | −0.001 | 0.026 |
|
| 0.10 | 0.11 | 0.056 | 0.061 | 0.017 | 0.032 | −0.003 | 0.031 | −0.007 | 0.032 | −0.004 | 0.027 |
|
| ||||||||||||
|
| 0.14 | 0.14 | 0.093 | 0.094 | 0.057 | 0.060 | 0.033 | 0.038 | 0.017 | 0.027 | 0.008 | 0.020 |
|
| 0.082 | 0.085 | 0.039 | 0.044 | 0.011 | 0.026 | 0.001 | 0.024 | −0.001 | 0.023 | 0.000 | 0.020 |
|
| 0.076 | 0.078 | 0.032 | 0.037 | 0.003 | 0.023 | −0.006 | 0.025 | −0.005 | 0.024 | −0.002 | 0.020 |
|
| ||||||||||||
|
| 0.12 | 0.12 | 0.072 | 0.073 | 0.041 | 0.044 | 0.023 | 0.029 | 0.012 | 0.022 | 0.005 | 0.016 |
|
| 0.069 | 0.071 | 0.027 | 0.032 | 0.005 | 0.021 | −0.000 | 0.020 | −0.000 | 0.020 | −0.000 | 0.017 |
|
| 0.064 | 0.066 | 0.021 | 0.027 | −0.002 | 0.020 | −0.006 | 0.022 | −0.003 | 0.020 | −0.001 | 0.017 |
Bias and rmse reported for 5,000 replications. Number of bootstrap repetitions 250.
Rejection frequencies of D randomization and likelihood ratio tests, for , level
|
| |||||||
|---|---|---|---|---|---|---|---|
|
| Test | 0 | 0.056 | 0.127 | 0.225 | 0.382 | 0.634 |
|
| |||||||
| 0.05 |
| 0.096 | 0.104 | 0.131 | 0.237 | 0.619 | 0.998 |
|
| 0.066 | 0.074 | 0.098 | 0.194 | 0.594 | 0.999 | |
| 0.10 |
| 0.056 | 0.069 | 0.112 | 0.307 | 0.878 | 1.000 |
|
| 0.069 | 0.083 | 0.132 | 0.362 | 0.919 | 1.000 | |
| 0.20 |
| 0.067 | 0.086 | 0.192 | 0.618 | 0.999 | 1.000 |
|
| 0.062 | 0.080 | 0.183 | 0.606 | 0.998 | 1.000 | |
| 0.35 |
| 0.065 | 0.090 | 0.269 | 0.827 | 1.000 | 1.000 |
|
| 0.053 | 0.077 | 0.232 | 0.791 | 1.000 | 1.000 | |
|
| |||||||
| 0.05 |
| 0.060 | 0.071 | 0.165 | 0.534 | 0.992 | 1.000 |
|
| 0.051 | 0.067 | 0.160 | 0.546 | 0.995 | 1.000 | |
| 0.10 |
| 0.056 | 0.086 | 0.285 | 0.882 | 1.000 | 1.000 |
|
| 0.054 | 0.080 | 0.275 | 0.877 | 1.000 | 1.000 | |
| 0.20 |
| 0.057 | 0.117 | 0.553 | 0.997 | 1.000 | 1.000 |
|
| 0.050 | 0.108 | 0.537 | 0.997 | 1.000 | 1.000 | |
| 0.35 |
| 0.055 | 0.147 | 0.775 | 1.000 | 1.000 | 1.000 |
|
| 0.050 | 0.138 | 0.777 | 1.000 | 1.000 | 1.000 | |
Rejection frequencies reported for 10,000 replications. Number of bootstrap repetitions 599.
Figure 2P‐value plot, , , , .
Bias and rmse of D and bias‐corrected estimators
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
| ||||
| bias | rmse | bias | rmse | bias | rmse | |
|
| 0.031 | 0.038 | 0.106 | 0.111 | 0.022 | 0.032 |
|
| −0.000 | 0.027 | 0.020 | 0.051 | −0.001 | 0.026 |
|
| −0.008 | 0.029 | −0.000 | 0.046 | −0.007 | 0.028 |
in all designs. Results from 10,000 Monte Carlo replications.
Figure 3P‐value plot, , , , .
Average lower limit, upper limit and length of 95% confidence intervals
| Test | Lower limit | Upper limit | Length |
|---|---|---|---|
|
| 0.228 | 0.568 | 0.340 |
|
| 0.212 | 0.378 | 0.166 |
|
| 0.209 | 0.374 | 0.167 |
, , , .
Figure 4P‐value plot, , size properties.
Figure 5P‐value plot, , power properties.
Bias and rmse of D beta1 and D ct
|
| ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.056 | 0.127 | 0.225 | 0.382 | 0.634 | |||||||
| bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse | bias | rmse | |
|
| 0.180 | 0.180 | 0.130 | 0.130 | 0.088 | 0.090 | 0.054 | 0.059 | 0.029 | 0.039 | 0.012 | 0.026 |
|
| 0.099 | 0.100 | 0.051 | 0.056 | 0.014 | 0.030 | −0.006 | 0.031 | −0.008 | 0.032 | −0.004 | 0.026 |
|
| −0.012 | 0.037 | −0.020 | 0.037 | −0.033 | 0.042 | −0.054 | 0.060 | ||||
|
| −0.032 | 0.075 | −0.067 | 0.089 | −0.085 | 0.091 | −0.100 | 0.110 | −0.099 | 0.100 | −0.064 | 0.085 |
, , . No results reported for D beta1 in first two columns owing to the convergence problems of the estimator.
Results for designs with unequal expected unit sizes
|
|
| |||
|---|---|---|---|---|
| bias | rmse | bias | rmse | |
| Design 1 | ||||
|
| 0.022 | 0.032 | 0.083 | 0.091 |
|
| −0.003 | 0.027 | 0.018 | 0.050 |
|
| −0.010 | 0.029 | 0.009 | 0.047 |
| Design 2 | ||||
|
| 0.027 | 0.035 | 0.094 | 0.100 |
|
| −0.002 | 0.027 | 0.016 | 0.049 |
|
| −0.009 | 0.028 | 0.005 | 0.045 |
| Rejection frequencies for tests of | ||||
| Nominal size | 0.10 | 0.05 | 0.10 | 0.05 |
| Design 1 | ||||
|
| 0.192 | 0.116 | 0.596 | 0.457 |
|
| 0.106 | 0.056 | 0.200 | 0.119 |
|
| 0.162 | 0.100 | 0.175 | 0.100 |
|
| 0.135 | 0.078 | 0.127 | 0.071 |
| Design 2 | ||||
|
| 0.246 | 0.154 | 0.726 | 0.596 |
|
| 0.115 | 0.064 | 0.167 | 0.096 |
|
| 0.161 | 0.095 | 0.167 | 0.095 |
|
| 0.134 | 0.075 | 0.134 | 0.076 |
, ; 10,000 Monte Carlo replications, 599 bootstrap repetitions.
Test results for
| Design 1 | Design 2 | |||||||
|---|---|---|---|---|---|---|---|---|
| Size |
|
|
|
|
|
|
|
|
| 0.10 | 0.293 | 0.221 | 0.193 | 0.139 | 0.352 | 0.208 | 0.186 | 0.138 |
| 0.05 | 0.184 | 0.140 | 0.119 | 0.079 | 0.234 | 0.125 | 0.118 | 0.081 |
See Table 6.
Key parameters of primary schools across English LAs
| Number | Number | Average | % | ||
|---|---|---|---|---|---|
| LA name | of pupils | of schools | cohort size | FSM |
|
| North‐East Lincolnshire | 2005 | 46 | 44 | 21 | 0.43 |
| North Lincolnshire | 2011 | 57 | 35 | 13 | 0.36 |
| Blackburn | 2105 | 51 | 41 | 26 | 0.34 |
| Oldham | 2990 | 86 | 35 | 21 | 0.47 |
| Camden | 1394 | 41 | 34 | 42 | 0.23 |
| Greenwich | 2666 | 66 | 40 | 36 | 0.29 |
| Hackney | 2194 | 54 | 41 | 43 | 0.22 |
| Hammersmith and Fulham | 1177 | 39 | 30 | 45 | 0.30 |
| Islington | 1845 | 48 | 38 | 41 | 0.26 |
| Kensington and Chelsea | 881 | 27 | 33 | 36 | 0.32 |
| Lambeth | 2428 | 60 | 40 | 40 | 0.24 |
| Lewisham | 2833 | 70 | 40 | 29 | 0.30 |
| Southwark | 2929 | 72 | 41 | 36 | 0.21 |
| Tower Hamlets | 2703 | 68 | 40 | 61 | 0.20 |
| Wandsworth | 2124 | 60 | 35 | 27 | 0.29 |
| Westminster | 1336 | 39 | 34 | 39 | 0.33 |
Bias‐corrected dissimilarity indices, confidence intervals and test statistics for North‐East and North Lincolnshire
| North‐East Lincolnshire | North Lincolnshire | |
|---|---|---|
|
| 0.433 | 0.364 |
|
| 0.420 | 0.322 |
|
| 0.416 | 0.334 |
| LR‐test, bootstrap | 0 | 0 |
| CI‐ | [0.386–0.481] | [0.306–0.421] |
| CI‐ | [0.380–0.487] | [0.275–0.452] |
| CI‐ | [0.371–0.466] | [0.265–0.371] |
| CI‐ | [0.367–0.465] | [0.278–0.390] |
|
| ||
|
| 0.067 | |
|
| 0.114 | |
|
| 0.000 | |
|
| 0.032 | |
CI are 95% confidence intervals. Number of bootstrap repetitions 999.
Bias‐corrected dissimilarity indices, confidence intervals and test statistics for Blackburn and Oldham
| Blackburn | Oldham | |
|---|---|---|
|
| 0.342 | 0.472 |
|
| 0.306 | 0.446 |
| LR test, bootstrap | 0 | 0 |
| CI‐ | [0.288–0.362] | [0.420–0.485] |
| CI‐ | [0.263–0.348] | [0.410–0.483] |
|
| ||
|
| 0.000 | |
|
| 0.000 | |
CI are 95% confidence intervals. Number of bootstrap repetitions 999.
Bias‐corrected dissimilarity indices and confidence intervals for Inner London
|
|
| LR( | CI– | CI– | |
|---|---|---|---|---|---|
| Tower Hamlets | 0.197 | 0.162 | 0 | [0.126–0.192] | [0.125–0.198] |
| Southwark | 0.206 | 0.165 | 0 | [0.137–0.201] | [0.128–0.202] |
| Hackney | 0.219 | 0.184 | 0 | [0.154–0.225] | [0.142–0.226] |
| Camden | 0.231 | 0.188 | 0 | [0.153–0.236] | [0.140–0.237] |
| Lambeth | 0.240 | 0.209 | 0 | [0.172–0.241] | [0.170–0.248] |
| Islington | 0.257 | 0.231 | 0 | [0.183–0.258] | [0.188–0.273] |
| Wandsworth | 0.290 | 0.243 | 0 | [0.219–0.292] | [0.200–0.286] |
| Greenwich | 0.286 | 0.251 | 0 | [0.226–0.291] | [0.213–0.288] |
| Hammersmith and Fulham | 0.303 | 0.264 | 0 | [0.226–0.323] | [0.208–0.319] |
| Lewisham | 0.304 | 0.274 | 0 | [0.244–0.312] | [0.235–0.312] |
| Kensington and Chelsea | 0.317 | 0.296 | 0 | [0.231–0.347] | [0.230–0.361] |
| Westminster | 0.328 | 0.302 | 0 | [0.257–0.347] | [0.252–0.352] |
CI are 95% confidence intervals. Number of bootstrap repetitions 999.
P‐values for tests of equivalence of D pop for Inner London
| Sou | Hac | Cam | Lam | Isl | Wan | Gre | Ham | Lew | Ken | Wes | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Tower | 0.725 | 0.202 | 0.176 | 0.040 | 0.014 | 0.003 | 0.000 | 0.001 | 0.000 | 0.000 | 0.000 |
| Hamlets | 0.899 | 0.426 | 0.389 | 0.081 | 0.016 | 0.005 | 0.001 | 0.003 | 0.000 | 0.000 | 0.000 |
| Southwark | 0.368 | 0.310 | 0.094 | 0.030 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |
| 0.502 | 0.454 | 0.107 | 0.022 | 0.007 | 0.001 | 0.004 | 0.000 | 0.001 | 0.000 | ||
| Hackney | 0.875 | 0.466 | 0.234 | 0.016 | 0.000 | 0.006 | 0.000 | 0.002 | 0.000 | ||
| 0.897 | 0.388 | 0.123 | 0.054 | 0.019 | 0.024 | 0.002 | 0.005 | 0.000 | |||
| Camden | 0.631 | 0.338 | 0.020 | 0.016 | 0.008 | 0.002 | 0.004 | 0.000 | |||
| 0.511 | 0.195 | 0.098 | 0.045 | 0.044 | 0.007 | 0.009 | 0.001 | ||||
| Lambeth | 0.555 | 0.046 | 0.034 | 0.028 | 0.002 | 0.014 | 0.002 | ||||
| 0.457 | 0.250 | 0.128 | 0.111 | 0.020 | 0.025 | 0.004 | |||||
| Islington | 0.214 | 0.126 | 0.098 | 0.038 | 0.064 | 0.008 | |||||
| 0.690 | 0.489 | 0.351 | 0.142 | 0.101 | 0.033 | ||||||
| Wandsworth | 0.853 | 0.587 | 0.376 | 0.314 | 0.124 | ||||||
| 0.795 | 0.561 | 0.300 | 0.187 | 0.081 | |||||||
| Greenwich | 0.655 | 0.494 | 0.350 | 0.150 | |||||||
| 0.696 | 0.398 | 0.239 | 0.106 | ||||||||
| Hammersmith | 0.911 | 0.653 | 0.396 | ||||||||
| and Fulham | 0.776 | 0.467 | 0.318 | ||||||||
| Lewisham | 0.663 | 0.388 | |||||||||
| 0.571 | 0.382 | ||||||||||
| Kensington | 0.779 | ||||||||||
| and Chelsea | 0.880 |
Top and bottom rows are for T pb and W dc, respectively. Number of bootstrap repetitions 999.