| Literature DB >> 24605061 |
Abstract
We consider two types of spatial symmetry, namely, symmetry in the mixed or shared nearest neighbor (NN) structures. We use Pielou's and Dixon's symmetry tests which are defined using contingency tables based on the NN relationships between the data points. We generalize these tests to multiple classes and demonstrate that both the asymptotic and exact versions of Pielou's first type of symmetry test are extremely conservative in rejecting symmetry in the mixed NN structure and hence should be avoided or only the Monte Carlo randomized version should be used. Under RL, we derive the asymptotic distribution for Dixon's symmetry test and also observe that the usual independence test seems to be appropriate for Pielou's second type of test. Moreover, we apply variants of Fisher's exact test on the shared NN contingency table for Pielou's second test and determine the most appropriate version for our setting. We also consider pairwise and one-versus-rest type tests in post hoc analysis after a significant overall symmetry test. We investigate the asymptotic properties of the tests, prove their consistency under appropriate null hypotheses, and investigate finite sample performance of them by extensive Monte Carlo simulations. The methods are illustrated on a real-life ecological data set.Entities:
Mesh:
Year: 2014 PMID: 24605061 PMCID: PMC3926298 DOI: 10.1155/2014/698296
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
The NNCT for two classes.
| NN class | Total | ||
|---|---|---|---|
| Class 1 | Class 2 | ||
| Base class | |||
| Class 1 |
|
|
|
| Class 2 |
|
|
|
|
| |||
| Total |
|
|
|
The empirical significance levels of the symmetry tests under CSR independence Case 1: n 1 = n 2 = n = 10,20,…, 50 and Case 2: n 1 = 20, n 2 = 20,30,…, 60 with N mc = 10000 at α = .05. and stand for the empirical significance levels for Pielou's first type of symmetry test using χ 2 approximation with and without Yates' continuity correction, respectively; stands for the exact binomial version of Pielou's first type of symmetry test conditional on N 12 + N 21 = n ; stands for the empirical significance level for Pielou's second type of symmetry test; stands for Dixon's symmetry test.
| CSR independence Case 1 | |||||
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| 10 | .0002 | .0011 | .0111 | .0483 | .0466 |
| 20 | .0001 | .0006 | .0080 | .0533 | .0480 |
| 30 | .0000 | .0008 | .0073 | .0487 | .0492 |
| 40 | .0001 | .0006 | .0061 | .0501 | .0514 |
| 50 | .0000 | .0007 | .0044 | .0522 | .0484 |
|
| |||||
| CSR independence Case 2 | |||||
|
|
|
|
|
|
|
|
| |||||
| 20 | .0001 | .0006 | .0093 | .0533 | .0473 |
| 30 | .0002 | .0001 | .0088 | .0527 | .0500 |
| 40 | .0001 | .0008 | .0089 | .0536 | .0539 |
| 50 | .0002 | .0008 | .0092 | .0491 | .0483 |
| 60 | .0002 | .0010 | .0080 | .0508 | .0483 |
The empirical significance levels for Fisher's two-sided exact tests for the Q-symmetry contingency tables under CSR independence Cases 1 and 2 with N mc = 10000, for some combinations of n 1, n 2 at α = .05. is for the empirical significance level for the table-inclusive version of the two-sided test, is for table-exclusive version, is for mid-p value version, and is for Tocher corrected version.
| CSR independence Case 1 | |||||
|---|---|---|---|---|---|
| n |
|
|
|
|
|
| 10 | .0392 | .0424 | .0413 | .0410 | .0319 |
| 20 | .0466 | .0505 | .0480 | .0487 | .0451 |
| 30 | .0459 | .0484 | .0465 | .0466 | .0430 |
| 40 | .0457 | .0481 | .0475 | .0472 | .0434 |
| 50 | .0475 | .0490 | .0478 | .0479 | .0462 |
|
| |||||
| CSR independence Case 2 | |||||
|
|
|
|
|
|
|
|
| |||||
| 20 | .0454 | .0497 | .0462 | .0469 | .0444 |
| 30 | .0484 | .0533 | .0504 | .0512 | .0435 |
| 40 | .0479 | .0505 | .0489 | .0490 | .0460 |
| 50 | .0485 | .0524 | .0504 | .0509 | .0460 |
| 60 | .0474 | .0500 | .0489 | .0487 | .0445 |
The empirical significance levels of the tests under RL Cases 1–3 with N mc = 1000 for each of 100 background realization at α = .05. The empirical size labeling is as in Table 3. stands for the empirical size estimates of the exact tests for Pielou's second type of symmetry (the table exclusive version).
| RL Case 1 | ||||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| 10 | .00011 | .00089 | .00092 | .04368 | .04989 | .04205 |
| 20 | .00018 | .00096 | .00106 | .04797 | .05283 | .04831 |
| 30 | .00017 | .00095 | .00110 | .05037 | .05147 | .04974 |
| 40 | .00016 | .00056 | .00042 | .05156 | .05242 | .04994 |
| 50 | .00028 | .00070 | .00050 | .04981 | .05020 | .04934 |
|
| ||||||
| RL Case 2 | ||||||
|
|
|
|
|
|
|
|
|
| ||||||
| 20 | .00016 | .00089 | .00109 | .04907 | .05298 | .04987 |
| 30 | .00016 | .00090 | .00063 | .05087 | .05143 | .05258 |
| 40 | .00015 | .00069 | .00048 | .04880 | .05087 | .05222 |
| 50 | .00026 | .00099 | .00079 | .04700 | .05010 | .05271 |
| 60 | .00027 | .00097 | .00084 | .04991 | .04985 | .05104 |
|
| ||||||
| RL Case 3 | ||||||
|
|
|
|
|
|
|
|
|
| ||||||
| 2 | .00018 | .00063 | .00065 | .05158 | .05241 | .05006 |
| 4 | .00023 | .08451 | .00083 | .04848 | .05024 | .04988 |
| 6 | .00012 | .00051 | .00048 | .04953 | .05061 | .05057 |
| 8 | .00024 | .00076 | .00074 | .05075 | .05007 | .04963 |
| 10 | .00024 | .00070 | .00083 | .04939 | .05087 | .05063 |
The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 under CSR independence Case 1 and RL Case 1 with n 1 = n 2 = 40 at α = .05.
| Mean ± SD | |||
|---|---|---|---|
|
|
|
| |
| CSR-ind Case 1 | 20.2 ± 3.3 | 20.3 ± 3.4 | − .03 ± 3.60 |
| RL Case 1 | 20.3 ± 3.4 | 20.3 ± 3.4 | − .01 ± 3.57 |
The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case II patterns in (34) with N mc = 10000, n 1 = n 2 = 40 at α = .05. Column labeling is as in Table 6.
| Mean ± SD | Rejection rate estimates for Case II patterns | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| II-(i) | 28.0 ± 3.9 | 27.3 ± 3.5 | .68 ± 3.80 | .0002 | .0002 | .0030 | .0198 | .3313 | .3225 |
| II-(ii) | 36.4 ± 4.4 | 33.5 ± 3.0 | 2.84 ± 4.07 | .0001 | .0009 | .0084 | .0124 | .8395 | .8326 |
| II-(iii) | 45.3 ± 4.6 | 38.1 ± 1.8 | 7.21 ± 4.43 | .0049 | .0079 | .0524 | .0434 | .9923 | .9913 |
The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case III patterns with N mc = 10000, n 1 = n 2 = 40 at α = .05. Column labeling is as in Table 6.
| Mean ± SD | Rejection rate estimates for Case III patterns | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| III-(i) | 14.4 ± 3.1 | 14.4 ± 3.1 | .04 ± 3.09 | .0004 | .0009 | .0086 | .0225 | .0385 | .0368 |
| III-(ii) | 10.1 ± 2.7 | 10.1 ± 2.7 | − .01 ± 2.58 | .0000 | .0011 | .0096 | .0073 | .0341 | .0321 |
| III-(iii) | 5.9 ± 2.1 | 5.9 ± 2.1 | .03 ± 1.98 | .0002 | .0013 | .0128 | .0005 | .0336 | .0308 |
The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case IV patterns with N mc = 10000, n 1 = n 2 = 40 at α = .05. The rejection rate labeling and superscripting for “<” and “>” are as in Table 6.
|
| Mean ± SD | Rejection rate estimates for Case IV patterns | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| ||
| IV-(i) | 1/7 | 10.5 ± 3.2 | 10.5 ± 3.2 | − .05 ± 3.12 | .0012 | .0055 | .0256 | .0361 | .0565 | .0525 |
| 1/8 | 9.3 ± 3.1 | 9.3 ± 3.1 | − .03 ± 2.99 | .0021 | .0052 | .0307 | .0318 | .0572 | .0552 | |
| 1/9 | 8.2 ± 3.0 | 8.2 ± 3.0 | .00 ± 2.86 | .0023 | .0071 | .0351 | .0295 | .0579 | .0562 | |
|
| ||||||||||
| IV-(ii) | 1/7 | 9.0 ± 3.0 | 9.0 ± 3.1 | − .01 ± 2.78 | .0014 | .0039 | .0235 | .0176 | .0526 | .0509 |
| 1/8 | 8.1 ± 3.0 | 8.1 ± 3.0 | − .02 ± 2.70 | .0015 | .0064 | .0312 | .0192 | .0609 | .0583 | |
| 1/9 | 7.2 ± 2.9 | 7.2 ± 2.9 | − .01 ± 2.63 | .0028 | .0075 | .0395 | .0172 | .0616 | .0601 | |
|
| ||||||||||
| IV-(iii) | 1/7 | 6.9 ± 2.9 | 6.9 ± 2.8 | .01 ± 2.42 | .0014 | .0039 | .0286 | .0070 | .0496 | .0470 |
| 1/8 | 6.3 ± 2.8 | 6.3 ± 2.7 | .02 ± 2.37 | .0020 | .0061 | .0345 | .0094 | .0539 | .0518 | |
| 1/9 | 5.6 ± 2.7 | 5.6 ± 2.6 | .01 ± 2.30 | .0027 | .0074 | .0392 | .0070 | .0590 | .0565 | |
The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case V patterns with N mc = 10000, n 1 = n 2 = 40 at α = .05. Column labeling is as in Table 6.
| Mean ± SD | Rejection rate estimates for Case V patterns | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| V-(i) | 19.5 ± 3.3 | 21.9 ± 3.2 | − 2.44 ± 3.52 | .0004 | .0019 | .0076 | .0982 | .0805 | .0982 |
| V-(ii) | 22.9 ± 3.3 | 24.5 ± 3.2 | − 1.61 ± 3.62 | .0002 | .0007 | .0033 | .0752 | .0699 | .0676 |
| V-(iii) | 25.9 ± 3.1 | 26.5 ± 3.1 | − .60 ± 3.58 | .0001 | .0003 | .0021 | .0565 | .0503 | .0490 |
| V-(iv) | 27.8 ± 2.9 | 27.9 ± 3.0 | − .01 ± 3.52 | .0001 | .0002 | .0012 | .0513 | .0485 | .0454 |
The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case VI patterns with N mc = 10000, m 1 = 20, m 2 = 10 (hence n 1 = n 2 = 40) at α = .05. Column labeling is as in Table 6.
| Mean ± SD | Rejection rate estimates for Case VI patterns | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| VI-(i) | 20.3 ± .6 | 29.0 ± 2.6 | − 8.73 ± 2.58 | .0019 | .0091 | .0629 | .8883 | .9911 | .9907 |
| VI-(ii) | 21.6 ± 1.4 | 27.7 ± 2.8 | − 6.13 ± 2.89 | .0003 | .0012 | .0073 | .5460 | .7782 | .7730 |
| VI-(iii) | 23.1 ± 2.1 | 26.4 ± 3.0 | − 3.39 ± 3.25 | .0000 | .0000 | .0009 | .1860 | .3029 | .2980 |
Figure 1The scatter plot of the locations of black oaks (circles ∘), maples (triangles ▵), and white oaks (pluses +) in the Lansing Woods, Clinton County, MI, USA.
The NNCT for the Lansing Woods data set containing black oak, maple, and white oak trees.
| NN species | Total | |||
|---|---|---|---|---|
| Black oak | Maple | White oak | ||
| Base species | ||||
| Black oak | 53 | 35 | 47 | 135 |
| Maple | 28 | 366 | 120 | 514 |
| White oak | 50 | 161 | 237 | 448 |
|
| ||||
| Total | 131 | 562 | 404 | 1097 |
The (reduced) Q-symmetry contingency table for the Lansing Woods data. The values in the parentheses are relative frequencies of the cells in each row with respect to the row sums.
| Number of times a point serving as a NN | Total | |||
|---|---|---|---|---|
| 0 | 1 | ≥2 | ||
| Classes | ||||
| Black oak | 37 (.27) | 67 (.50) | 31 (.23) | 135 |
| Maple | 113 (.22) | 259 (.50) | 142 (.28) | 514 |
| White oak | 143 (.32) | 220 (.49) | 85 (.19) | 448 |
|
| ||||
| Total | 293 (.27) | 546 (.50) | 258 (.24) | 1097 |
The test statistics and the p-values for the overall symmetry analysis for the Lansing Woods data. TS stands for the test statistic, p asy for the p-values based on asymptotic critical values (except for the exact tests), and p rand for the p-values based on Monte Carlo randomization. *The p-values for the exact tests are not the asymptotic p-values but computed as described in Section 3.
| Overall test statistics and | |||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| TS | 13.482 | 5.694 | 6.182 | 16.595 | — |
|
| .004 | .128 | .103 | .002 | .002* |
|
| .006 | .002 | .002 | .004 | .004 |
The test statistics and the p-values for the unrestricted pairwise symmetry analysis for the Lansing Woods data. Row labelings and the asterisks are in Table 15.
| Unrestricted pairwise test statistics and | |||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| Black oaks versus maples | |||||
| TS | .769 | .246 | .385 | 2.245 | — |
|
| .442 | .620 | .535 | .326 | .331* |
|
| .412 | .311 | .282 | .331 | .337 |
|
| |||||
| Black oaks versus white oaks | |||||
| TS | −.657 | .092 | .163 | 1.520 | — |
|
| .511 | .762 | .686 | .468 | .466* |
|
| .473 | .483 | .483 | .481 | .481 |
|
| |||||
| Maples versus white oaks | |||||
| TS | −3.470 | 5.356 | 5.634 | 16.554 | — |
|
| <.001 | .021 | .018 | <.001 | <.001* |
|
| <.001 | <.001 | <.001 | <.001 | <.001 |
The test statistics and the p-values for the unrestricted pairwise symmetry analysis for the Lansing Woods data. Row labelings and the asterisks are in Table 15.
| Restricted pairwise test statistics and | |||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| Black oaks versus maples | |||||
| TS | .246 | .010 | .038 | .144 | — |
|
| .806 | .922 | .845 | .930 | .937* |
|
| .819 | .778 | .773 | .932 | .945 |
|
| |||||
| Black oaks versus white oaks | |||||
| TS | −.597 | .115 | .180 | .603 | — |
|
| .551 | .734 | .671 | .740 | .759* |
|
| .598 | .497 | .455 | .764 | .778 |
|
| |||||
| Maples versus white oaks | |||||
| TS | −2.923 | 3.717 | 3.939 | 10.806 | — |
|
| .003 | .054 | .047 | .005 | .005* |
|
| <.001 | <.001 | <.001 | .004 | .004 |
The test statistics and the p-values for the unrestricted pairwise symmetry analysis for the Lansing Woods data. Row labelings and the asterisks are in Table 15.
| One-versus-rest test statistics and | |||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| Black oak versus rest | |||||
| TS | .118 | .000 | .006 | .049 | — |
|
| .906 | 1.000 | .938 | .976 | .970* |
|
| .889 | 1.000 | .866 | .968 | .968 |
|
| |||||
| Maples versus rest | |||||
| TS | −3.471 | 5.547 | 5.802 | 16.125 | — |
|
| <.001 | .019 | .016 | <.001 | <.001* |
|
| <.001 | <.001 | <.001 | <.001 | <.001 |
|
| |||||
| White oaks versus rest | |||||
| TS | 3.447 | 4.840 | 5.068 | 13.832 | — |
|
| <.001 | .028 | .024 | <.001 | <.001* |
|
| .001 | <.001 | <.001 | .002 | .002 |
(a)
| The number of times a point serving as a NN | Total | ||||||
|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | ||
| Classes | |||||||
| Class 1 |
|
|
|
|
|
|
|
| Class 2 |
|
|
|
|
|
|
|
|
| |||||||
| Total |
|
|
|
|
|
|
|
(b)
| The number of times a point serving as a NN | Total | ||||||
|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | ||
| Classes | |||||||
| Class 1 |
|
|
|
|
|
|
|
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
| Class |
|
|
|
|
|
|
|
|
| |||||||
| Total |
|
|
|
|
|
|
|
(a)
| Mean ± SD | Rejection rate estimates for Case I patterns | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| I-(i) | 8.5 ± 2.5 | 6.8 ± 2.7 | 1.66 ± 2.26 | .0018 | .0060 | .0721 | .0092 | .0379 | .0359 |
| I-(ii) | 4.1 ± 1.7 | 2.2 ± 1.7 | 1.94 ± 1.58 | .0118 | .0511 | .3109 | .0017 | .0345 | .0324 |
| I-(iii) | 2.9 ± 1.4 | 1.0 ± 1.2 | 1.87 ± 1.34 | .0338 | .0919 | .5345 | .0002 | .0324 | .0296 |
(b)
| Rejection rate estimates for Case I patterns based on Monte Carlo randomization | |||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| I-(i) | .641219 | .480581 | .409682 | .546873 | .546491 |
| I-(ii) | .670268 | .402391 | .271784 | .569909 | .569605 |
| I-(iii) | .696817 | .397304 | .196840 | .577398 | .576899 |
(a)
| NN species | Total | ||
|---|---|---|---|
| B.O. | Maple | ||
| Base species | |||
| B.O. | 82 | 53 | 135 |
| Maple | 49 | 465 | 514 |
|
| |||
| Total | 131 | 518 | 649 |
(b)
| NN species | Total | ||
|---|---|---|---|
| B.O. | W.O. | ||
| B.O. | 78 | 67 | 135 |
| W.O. | 72 | 376 | 448 |
|
| |||
| Total | 140 | 443 | 583 |
(c)
| NN species | Total | ||
|---|---|---|---|
| Maple | W.O. | ||
| Maple | 379 | 135 | 514 |
| W.O. | 172 | 276 | 448 |
|
| |||
| Total | 551 | 411 | 962 |
(a)
| Number of times a point serving as a NN | Total | |||
|---|---|---|---|---|
| 0 | 1 | ≥2 | ||
| Classes | ||||
| B.O. | 38 (.28) | 65 (.48) | 32 (.24) | 135 |
| M. | 142 (.28) | 242 (.47) | 130 (.25) | 514 |
|
| ||||
| Total | 180 (.28) | 307 (.47) | 162 (.25) | 649 |
(b)
| Number of times a point serving as a NN | Total | |||
|---|---|---|---|---|
| 0 | 1 | ≥2 | ||
| B.O. | 36 (.27) | 64 (.47) | 35 (.26) | 135 |
| W.O. | 135 (.30) | 203 (.45) | 110 (.25) | 448 |
|
| ||||
| Total | 171 (.29) | 267 (.46) | 145 (.25) | 583 |
(c)
| Number of times a point serving as a NN | Total | |||
|---|---|---|---|---|
| 0 | 1 | ≥2 | ||
| M. | 117 (.23) | 258 (.50) | 139 (.27) | 514 |
| W.O. | 136 (.30) | 224 (.50) | 88 (.20) | 448 |
|
| ||||
| Total | 253 (.26) | 482 (.50) | 227 (.24) | 962 |
(a)
| NN species | Total | ||
|---|---|---|---|
| B.O. | Rest | ||
| Base species | |||
| B.O. | 53 | 82 | 135 |
| Rest | 78 | 884 | 964 |
|
| |||
| Total | 131 | 966 | 1097 |
(b)
| NN species | Total | ||
|---|---|---|---|
| Maple | Rest | ||
| Maple | 352 | 150 | 514 |
| Rest | 196 | 387 | 583 |
|
| |||
| Total | 560 | 537 | 1097 |
(c)
| NN species | Total | ||
|---|---|---|---|
| W.O. | Rest | ||
| W.O. | 236 | 212 | 448 |
| Rest | 167 | 482 | 649 |
|
| |||
| Total | 403 | 694 | 1097 |
(a)
| Number of times a point serving as a NN | Total | |||
|---|---|---|---|---|
| 0 | 1 | ≥2 | ||
| Classes | ||||
| B.O. | 37 (.27) | 67 (.50) | 31 (.23) | 135 |
| R. | 256 (.27) | 479 (.50) | 227 (.24) | 962 |
|
| ||||
| Total | 293 (.27) | 546 (.50) | 258 (.24) | 1097 |
(b)
| Number of times a point serving as a NN | Total | |||
|---|---|---|---|---|
| 0 | 1 | ≥2 | ||
| M. | 112 (.22) | 259 (.50) | 148 (.29) | 514 |
| R. | 181 (.31) | 286 (.49) | 116 (.20) | 583 |
|
| ||||
| Total | 293 (.27) | 545 (.50) | 259 (.24) | 1097 |
(c)
| Number of times a point serving as a NN | Total | |||
|---|---|---|---|---|
| 0 | 1 | ≥2 | ||
| W.O. | 143 (.32) | 219 (.49) | 86 (.19) | 448 |
| R. | 150 (.23) | 327 (.50) | 172 (.27) | 649 |
|
| ||||
| Total | 293 (.27) | 546 (.50) | 258 (.24) | 1097 |