Literature DB >> 24605061

Testing spatial symmetry using contingency tables based on nearest neighbor relations.

Abstract

We consider two types of spatial symmetry, namely, symmetry in the mixed or shared nearest neighbor (NN) structures. We use Pielou's and Dixon's symmetry tests which are defined using contingency tables based on the NN relationships between the data points. We generalize these tests to multiple classes and demonstrate that both the asymptotic and exact versions of Pielou's first type of symmetry test are extremely conservative in rejecting symmetry in the mixed NN structure and hence should be avoided or only the Monte Carlo randomized version should be used. Under RL, we derive the asymptotic distribution for Dixon's symmetry test and also observe that the usual independence test seems to be appropriate for Pielou's second type of test. Moreover, we apply variants of Fisher's exact test on the shared NN contingency table for Pielou's second test and determine the most appropriate version for our setting. We also consider pairwise and one-versus-rest type tests in post hoc analysis after a significant overall symmetry test. We investigate the asymptotic properties of the tests, prove their consistency under appropriate null hypotheses, and investigate finite sample performance of them by extensive Monte Carlo simulations. The methods are illustrated on a real-life ecological data set.

Entities: Chemical Disease Mutation Species

Mesh：

Year: 2014 PMID： 24605061 PMCID： PMC3926298 DOI： 10.1155/2014/698296

Source DB: PubMed Journal: ScientificWorldJournal ISSN： 1537-744X

1. Introduction

The analysis of spatial point patterns in natural populations (in ℝ2 and ℝ3) has been studied extensively. In particular, spatial patterns in epidemiology, population biology, and ecology have important practical consequences. Since the early days of this research, most of the research has been on data from one class, that is, on spatial pattern of one class with respect to the ground (e.g., intensity, clustering, etc). An example of a pattern in a one-class framework is aggregation [1]. It is also of practical importance to investigate the spatial interaction between two or more classes, for example, spatial patterns of one class with respect to other classes [2]. Two frequently studied spatial patterns between multiple classes or species are segregation and association. Segregation occurs when an individual is more likely to be found near conspecifics (i.e., individuals of the same species) [3] and association occurs when an individual from one class is more likely to be found near individuals from the other class. There are many tests available in the literature for the analysis of spatial point patterns in various fields. An extensive survey is provided by Kulldorff [4] who enumerates more than 100 such tests, most of which need adjustment for some sort of inhomogeneity. However, none of the tests surveyed by Kulldorff [4] are designed for testing spatial symmetry. Most of the tests for multiple classes deal with the existence (or lack) of spatial interaction (in the form of spatial association or segregation) between the classes. In the literature, Baczkowski and Mardia [5] proposed methods for testing spatial symmetry based on the sample semivariogram. Their methods are applicable for a Gaussian doubly geometric process on a regular lattice. The latest methods for testing and detecting isotropy, symmetry, and separability in spatiotemporal models are discussed in a recent book by Sherman [6] who investigated these properties in the directional sense. For example, isotropy is assessed in the sense of direction-independence of the second-order properties of the spatial point pattern. Spatial symmetry is not only useful in ecological contexts (as in spatial symmetry of plant species in a region of interest), but also in socioeconomic theory to help understand spatial equilibrium configurations [7]. Axial symmetry methods based on the sample periodogram for data collected on a rectangular lattice are also considered and shown to perform well in Scaccia and Martin [8]. The methods discussed in the current paper are the spatial symmetry tests based on NN relationships. There are at least six different groups of NN methods for spatial patterns (see, e.g., [9]). These methods are based on some measure of (dis)similarity between a point and its NN, such as the distance between the point and its NN or the class types of the point and its NN. For example, Pielou [2] constructed nearest neighbor contingency tables (NNCTs) which yield tests of segregation (positive or negative), symmetry, and niche specificity, and a coefficient of segregation in a two-class setting. Additionally, Dixon devised overall, class- and cell-specific tests based on NNCTs for the two-class case in [10] and extended his methodology to multiclass case in [3]. Pielou's and Dixon's symmetry tests are designed to detect the symmetry (or lack of it) in the mixed or shared NN structure and are the only tests for detecting such symmetry structure (to the authors knowledge). Symmetry in mixed NN structure implies equality of the expected values of the number of NN pairs in which the points in the pair are from different classes, while symmetry in shared NN structure implies that the proportion—with respect to the class size—of number of times points from one class serving as NN to other classes is equal for all classes. Asymmetry in mixed NN structure would be suggestive of different types or levels of spatial interaction between the two classes of points, while asymmetry in shared NN structure would indicate differences in spatial distribution of points from one class with respect to all the points (from both classes) in the study region compared to that of points from the other class. Pielou has described her symmetry tests for completely mapped data in ℝ2, although her tests are not appropriate for such data [10, 11]. A data set is completely mapped, if the locations of all events in a defined space are observed. We assume that data is sparsely sampled; that is, only a (random) subset of NN pairs is observed for Pielou's first type of symmetry test. Pielou's first type of symmetry and Dixon's symmetry test are based on the NNCTs that are constructed using the NN frequencies. Both tests are defined for the two-class case only. Pielou's second type of symmetry test is based on the frequencies of number of times points in a class serve as NNs yielding a contingency table which we call Q-symmetry contingency table. So, points from each class are categorized into six groups, namely 0,1,…, 5, where a point serving as a NN to no other point is in category “0,” to one other point is in category “1,” and so on. Due to geometric constraints, in ℝ2, a point can not serve as a NN to more than six points. For data from a continuous distribution, a point can serve as a NN to at most five points almost surely. Under spatial symmetry in shared NN structure in a multiclass case, the frequencies of these six categories should have the same distributional form for each class. Pielou's symmetry tests were introduced and illustrated in [2], while Dixon's symmetry test was introduced in passing in [10]. None of tests were extensively studied nor investigated for size and/or power performance. In this paper, we investigate the underlying assumptions for these symmetry tests. We derive their asymptotic distributions under appropriate null hypotheses and extend these symmetry tests to multiclass case. In particular, we demonstrate that Pielou's first type of symmetry test is extremely conservative when used as McNemar's test with its asymptotic critical value and hence should be avoided in practice (or its Monte Carlo randomized version can be used). We also show that various patterns can constitute as the null case for Dixon's symmetry test and Pielou's second type of symmetry test but derive the asymptotic distributions of these tests under CSR independence and RL patterns only. We also investigate the use of Fisher's exact test on the Q-symmetry contingency table used for Pielou's second type of symmetry test (for shared NN structure). Moreover, the tests discussed in this paper are constructed using the NN relations based on the usual Euclidean distance; so we discuss the generalization of the tests for the case in which NN relations are defined by a dissimilarity measure. Furthermore, we discuss the extension of the methodology to high or infinite dimensional data. In a multiclass setting, first the overall symmetry is tested and if the overall test is significant we propose various post hoc tests such as pairwise symmetry tests or one-versus-rest type tests. The local asymptotic power of the tests is also investigated using the local approximation of the power function or Pitman asymptotic efficiency. Finite sample empirical (size and power) performance comparisons are investigated by Monte Carlo simulations. We describe and discuss switch the order of these two parts, tests of symmetry in NN structure, their extension to multiclass case, and the corresponding sampling frameworks for the cell counts (i.e., entries in the contingency tables) in Section 2. We discuss the variants of Fisher's exact test for the Q-symmetry contingency table in Section 3 and asymptotic power analysis (i.e., consistency of the tests and their asymptotic efficiency) in Section 4 and provide an extensive empirical performance analysis by Monte Carlo simulations in Section 5. We discuss the use of one-versus-rest and pairwise tests as post hoc tests in Section 6, illustrate the methodology on an ecological data set in Section 7, discuss the extension of the methodology to the case where NN relations are defined with dissimilarity measures in Section 8, and provide some guidelines and discussion in Section 9.

2. Tests of Symmetry in the NN Structure

Two or more classes may exhibit many different forms of spatial asymmetry. Although it is not possible to list all possible asymmetry types or configurations, existence of asymmetry can be detected by an analysis of the NN relationships of the class members.

2.1. Preliminaries

The null case for asymmetry alternatives is that there is symmetry in the allocations of points with respect to each other. In particular, consider symmetry in mixed NN structure for two classes i and j. Then the null case is that the expected number of times class i points serving as NN to class j would be the same as the expected number of times class j points serving as NN to class i. On the other hand, for symmetry in shared NN structure, the vector of relative frequencies (with respect to the class size) of points from each class serving as NN to other points is the same for all classes. In general, the null hypothesis for symmetry in mixed NN structure would be implied by a more general pattern, namely, if there is randomness in the NN structure in such a way that the probability of a NN of a point being from a class is proportional to the relative frequency of that class. This assumption holds, for example, under RL or CSR independence of the points from each class. Under CSR independence, the points from each class are independent realizations of homogeneous Poisson process (HPP) with fixed class sizes. In particular, conditioned on the class sizes, the points are independently uniformly distributed in the region of interest. Under RL, class labels are independently and randomly assigned to a set of given locations, where these locations could be from HPP or some other clustered or regular pattern. The null hypothesis for symmetry in shared NN structure would be implied if there is randomness in the NN structure in such a way that the probability of a point from a class serving as NN to m other points is proportional to the relative frequency of that class. This assumption also holds under RL or CSR independence of the points from each class. Therefore, both CSR independence and RL patterns would imply symmetry in the mixed or shared NN structure. Pielou suggests two types and Dixon suggests one type of symmetry tests in the two-class case. Pielou's first type of symmetry test and Dixon's symmetry test are defined for the two-class case only and are based on the corresponding 2 × 2 NNCT. We provide a brief description of NNCTs; for a more detailed discussion see, for example, [12]. Suppose that there are k = 2 classes labeled as {1,2}. NNCTs are constructed using NN frequencies for each class. Let n be the number of points from class i for i ∈ {1,2} and n = n 1 + n 2. If we record the class of each point and its NN, the NN relationships fall into k 2 = 4 categories: (1,1), (1,2); (2,1), (2,2), where in category (i, j), class i is the base class and class j is the class of the NN. Denoting N as the observed frequency of category (i, j) for i, j ∈ {1,2}, we obtain the NNCT in Table 1, where C is the sum of column j; that is, a number of times class j points serve as NNs for j ∈ {1,2}. Note also that n = ∑ 2 N , C = ∑ 2 N , and n = ∑ N = ∑ 2 n = ∑ 2 C . Throughout the paper, we adopt the convention that random variables are denoted with upper case letters and fixed quantities with lower case letters. Notice that row sums (i.e., class sizes) are assumed to be fixed, while column sums (i.e., number of times a class serves as NN) is random in our NNCTs.

Table 1

The NNCT for two classes.

	NN class		Total
	Class 1	Class 2	Total
Base class
Class 1	N ₁₁	N ₁₂	n ₁
Class 2	N ₂₁	N ₂₂	n ₂

Total	C ₁	C ₂	n

2.2. Pielou's First Type of Symmetry Test

Pielou's first type of symmetry test involves testing equality of expected values of mixed NN frequencies, that is, the equality of expected values of off-diagonal entries in the NNCT. So Pielou's first type of symmetry test is used to detect the symmetry in the “mixed NN structure.” In this case, if N 12 ≈ N 21, spatial allocation of points from two classes is symmetric with respect to the (mixed) NN structure; otherwise, the population is asymmetric. When two classes, X and Y, are of equal size, in a symmetric population, points from each class are equally likely to serve as NN to points from the other class, and, in an asymmetric population, points from one class, say class X, tend to serve more as NN to points from class Y compared to class Y serving as NN to points from class X. So the null hypothesis is which may have various forms based on the assumed underlying frameworks for the contingency tables in general and for the NNCTs. The two-sided alternative is usually a more reasonable alternative, although one-sided alternatives are also possible. Pielou [2] tests for significant differences between N 12 and N 21 with a χ 2 test (with Yates' correction) with 1 df using which is the same as the McNemar's test with continuity correction [13]. This test is appropriate only for sparsely sampled data and large N 12 + N 21 with neither N 12 or N 21 being too small compared to each other and applicable only for the two-class case. So we suggest the approach recommended in Remark 1 below. The discussion till the end of this subsection is for (properly) sparsely sampled data. Furthermore, in a population in which two classes are highly segregated or the intensities (number of points per unit area) of the classes are very different, the frequencies N 12 and N 21 can be too small, which renders the χ 2 approximation inappropriate for the test in (2). In such a case, one can use the exact finite sample distribution of N 12 which follows a binomial distribution (conditionally). Given that N 12 + N 21 = n , the test statistic N 12 has a BIN(n , 1/2) distribution under H for properly sparsely sampled data, where BIN(n, p) stands for the binomial distribution with n independent trials with probability of success p. So, for small n , the statistic, N 12, can be used with the binomial critical values. For large n , has approximately N(0,1) distribution, so Z I can be used for the one-sided alternatives. Furthermore, the test 𝒳 I 2 in (2) has approximately χ 1 2 distribution, which can only be used for the two-sided alternative. Pielou's first type of symmetry test can be extended to the multiclass case (with k > 2) as Under H : “p = p , for all i ≠ j”, 𝒳 I 2 in (4) is the same as Bowker's test of symmetry, which is an extension of McNemar's test for the k × k contingency tables [14]. The test statistic in this case has χ 2 distribution asymptotically.

2.2.1. The Row-Wise Multinomial Framework

In general, a contingency table may result from various frameworks. The first type of framework is the row-wise multinomial framework, where each row in a k × k contingency table is independent of other rows and is from a multinomial distribution. That is, letting the entries of the contingency table be denoted as N (as in the NNCT in Table 1), we have entries in row i having (N , N ,…, N ) ~ ℳ(n , p , p ,…, p ), where p is the probability of an experimental unit being from row category i and column category j simultaneously and ℳ(n, p 1, p 2,…, p ) standing for the multinomial distribution with n independent trials and the probability of a trial resulting in category k is p with ∑ p = 1. In the 2 × 2 contingency table, the rows will have two entries, so the multinomial distribution reduces to a binomial distribution. More specifically, we would have N ~ BIN(n , p ) (or N ~ BIN(n , p )) for i = 1,2. In a NNCT, k is the number of classes and p is the probability of a point from class j serving as a NN to a point from class i for i, j ∈ {1,2,…, k}. However, a NNCT is unlikely to result from a row-wise multinomial framework. In a NNCT, a trial is the categorization of a base-NN pair; that is, a trial is “determining the type of a base-NN pair.” For entry (i, j), a trial results in success, if a base-NN pair belongs to category i, j (i.e., base point is from class i and its NN point is from class j). For example, in a 2 × 2 contingency table, in general, (N 11, N 12) and (N 21, N 22) are assumed to be independent and so are the individual trials under the row-wise multinomial framework. This assumption is invalid when the NNCT is based on completely mapped data because independence between rows is violated (see also Remark 1). If the NNCT is constructed using a random sample of base-NN pairs, then the usual contingency table assumptions under the row-wise multinomial framework would hold. Such a NNCT can be (approximately) obtained only if a (small) subset of all the base-NN pairs obtained from the data in the study region was randomly selected; that is, if the data is obtained by an appropriate sparse sampling. When the data were properly sparsely sampled, we will assume that the NNCT satisfies the usual independence assumptions in the row-wise multinomial framework henceforth. In this framework, the explicit form of the null hypothesis becomes When the 2 × 2 NNCT is constructed from a sparsely sampled data, the rows are assumed to be from the same multinomial distribution, so the entries in row i satisfy N ~ BIN(n , κ ) for j = 1,2, where κ is the probability of a NN point being from class j. Then, under H in (5), we have E[N 12] = E[N 21], which holds if and only if n 1 p 12 = n 2 p 21 if and only if n 1 κ 2 = n 2 κ 1. Since κ 1 + κ 2 = 1 in a two-class setting, we have n 1(1 − κ 1) = n 2 κ 1 if and only if n 1 = (n 1 + n 2) κ 1 = nκ 1. Letting ν be the proportion of points from class i in our sample, we have nν 1 = nκ 1 if and only if ν 1 = κ 1. One-sided or two-sided alternatives are possible for the H in (5).

2.2.2. The Overall Multinomial Framework

An alternative framework for a general contingency table is the overall multinomial framework. In this case, the cell counts are assumed to arise from independent multinomial trials. That is, for example, for a k × k contingency table, For a NNCT, if the data is completely mapped, independence between trials is violated again. Under sparse sampling, this framework is able to model a NNCT approximately. That is, if the NNCT is based on a random sample of base-NN pairs, it will (approximately) satisfy the assumptions in the overall multinomial framework because of the inherent correlation between components or entries of a multinomially distributed random variable. For example, in a two-class setting with sparsely sampled data, we have where ν 1 + ν 2 = 1 and κ 1 + κ 2 = 1. Then the null hypothesis of symmetry becomes which is equivalent to H : ν 1 κ 2 = ν 2 κ 1 or equivalently H : ν 1 = κ 1 since κ 2 = 1 − κ 1 and ν 2 = 1 − ν 1. Row-wise and overall multinomial frameworks are closely related. Conditional on N = n , the overall multinomial framework reduces to the row-wise multinomial framework. But a NNCT for completely mapped data does not fit to the overall multinomial framework either, due to the inherent spatial dependence and the row sums being fixed. McNemar's test (and hence Bowker's test) is only appropriate in the overall multinomial framework and is extremely conservative for the row-wise multinomial framework. In particular, in a NNCT for completely mapped data, we have row sums (i.e., class sizes) fixed as in the row-wise multinomial framework. Hence, Pielou's first type of symmetry test would also be extremely conservative for the NNCTs.

Remark

In Pielou's first type of symmetry test, both of the above multinomial frameworks assume that the trials are independent multinomial trials. However, when a trial is the base-NN relation, the assumption of independence between trials is violated. The dependence mainly originates from the fact that a point is more likely to be the NN of its own NN (i.e., more likely to form a reflexive (base, NN) pair); hence, many reflexive pairs are possible. Thus, Pielou's test is influenced by deviations not only from the null case but also by deviations from dependence on trials. The dependence due to reflexivity can not merely be avoided by random subsampling but can be circumvent by an appropriate sparse sampling [15]. The assessment of various sparse sampling schemes for these tests is a topic of ongoing research. Furthermore, Pielou's first type of symmetry test requires the NNCT resulting from an overall multinomial framework, which does not hold for a NNCT based on completely mapped data either. So Pielou's first type of symmetry test is only appropriate under the overall multinomial framework (with random row sums), which can be satisfied by an appropriate sparse sampling. Our suggestion for Pielou's first type of symmetry test is as follows. If the data is properly sparsely sampled under the overall multinomial framework, then one can employ it. But if the data is completely mapped, to remove the influence of spatial dependence on Pielou's first type of symmetry test, we suggest the usual Monte Carlo randomization, where class labels are randomly assigned to the given points a large number of times and test statistics are computed, and the p value of the test is based on the rank (scaled by the number of Monte Carlo replications) of the test statistic of the original data in the sample of test statistics obtained from Monte Carlo randomization procedure.

2.3. Dixon's Symmetry Test

Dixon [10] also suggested a symmetry test for testing the equality of frequency of mixed NNs (or between class NNs), that is, the equality of the expected values of the off-diagonal entries in the 2 × 2 NNCTs. So the null hypothesis is given by and, under RL or CSR independence, E[N ] = n n /n for i ≠ j. Notice that the null hypotheses for Dixon's symmetry test and Pielou's first type of symmetry test look identical; however, the corresponding underlying assumptions are different. Pielou's first type of symmetry test is only appropriate when the data are (properly) sparsely sampled under overall multinomial framework, whereas Dixon's symmetry test is appropriate when the data are completely mapped. That is, Pielou's first type of symmetry test is appropriate when we have a random sample of base-NN pairs under overall multinomial framework; that is, between-row independence assumptions are satisfied (up to the inherent correlation for multinomial entries) and, hence, ignore the spatial information. On the other hand, Dixon's test can be used for completely mapped data and takes the spatial dependence into account. Under RL, the test statistic for Dixon's symmetry test is given by where E[N 12 − N 21] = E[N 12] − E[N 21] = 0 since E[N 12] = E[N 21] = n 1 n 2/2 and V a r[N 12 − N 21] = V a r[N 12] + V a r[N 21] − 2C o v[N 12, N 21] with for (i, j)∈{(1,2), (2,1)} and Here p , p , and p are the probabilities that a randomly picked pair, triplet, or quartet of points, respectively, are from the indicated classes and are given by Furthermore, Q is the number of points with shared NNs, which occurs when two or more points share a NN and R is twice the number of reflexive pairs. Then Q = 2(Q 2 + 3Q 3 + 6Q 4 + 10Q 5 + 15Q 6), where Q is the number of points that serves as a NN to other points j times. For large n , Z asymptotically has N(0,1) distribution. A two-sided alternative and one-sided alternatives are possible with the test statistic, Z . We describe this setting in a broader context with k ≥ 2 classes. Let ν be the probability of an arbitrary point being from class i and ν be the probability of a base-NN pair with base point being from class i and its NN being from class j. Then, under RL, we have ν = ν ν and the expression (n (n − 1)/n(n − 1))I (i = j)+(n n /n(n − 1))I (i ≠ j) can be viewed as an estimator or approximation for ν for large n , where I(·) stands for the indicator function. Furthermore, for large n , the null hypothesis of symmetry is equivalent to In Dixon's framework, for large n , the row marginals satisfy n /n ≈ ν and the column marginals satisfy κ = E[C /n] = ∑ ν ν = ν . The symmetry in mixed NN frequencies may result from various patterns. In particular, under RL or CSR independence, H in (9) would hold. In a RL framework with fixed allocation of points, the quantities Q and R are also fixed, but, for a CSR allocation of points, Q and R are random with E[Q/n]≈.63 and E[R/n]≈.62 for large n (estimated empirically by Monte Carlo simulations for homogeneous planar Poisson pattern). Hence, Z in (10) and its distribution is conditional on Q and R under CSR independence but unconditional under RL. To be more precise, under CSR independence, the expected values for Z are as in RL, but the variances and covariances are conditional on Q and R. Given the difficulty in finding the distribution of Q and R under CSR, we use their observed values even if the null hypothesis is implied by CSR independence.

2.3.1. Extension of Dixon's Symmetry Test to Multiclass Case

Dixon's symmetry test can be extended to the k > 2 case as follows. Consider the N − N values for i < j. Combining N − N values for i < j, we obtain the vector which has length k(k − 1)/2. Under RL, E[N − N ] = E[N ] − E[N ] = n n /n − n n /n = 0 and V a r[N − N ] = V a r[N ] + V a r[N ] − 2C o v[N , N ], where C o v[N , N ] = Rp + (n − R)(p + p )+(n 2 − 3n − Q + R)p p − n 2 p p . For large n , approximately has N(0,1) distribution. To combine the entries of the vector T in one overall test statistic for symmetry, we also need the covariance matrix of T , denoted Σsym. The diagonal entries of T are V a r[N − N ] in the order of entries of T . For the off-diagonal entries, we need the covariance terms C o v[N − N , N − N ]. By construction, we have i < j and k < l and there are six cases regarding these covariance terms. Case 1 (i = k and j = l). In this case, the covariance term is just the variance term, V a r[N − N ]. Case 2 (i = k and j ≠ l). C o v[N − N , N − N ] = C o v[N , N ] − C o v[N , N ] − C o v[N , N ] + C o v[N , N ]. Case 3 (i ≠ k and j = l). C o v[N − N , N − N ] = C o v[N , N ] − C o v[N , N ] − C o v[N , N ] + C o v[N , N ]. Case 4 (i = l and j ≠ k). C o v[N − N , N − N ] = C o v[N , N ] − C o v[N , N ] − C o v[N , N ] + C o v[N , N ]. Case 5 (i ≠ l and j = k). C o v[N − N , N − N ] = C o v[N , N ] − C o v[N , N ] − C o v[N , N ] + C o v[N , N ]. Case 6 (i ≠ k, i ≠ l and j ≠ l, j ≠ k). C o v[N − N , N − N ] = C o v[N , N ] − C o v[N , N ] − C o v[N , N ] + C o v[N , N ]. The covariance term in Case 6 above is zero, since Notice that Σsym is a k × k matrix with k = k(k − 1)/2 and E[T ] = (0,0,…, 0). Then asymptotically has χ 2 distribution.

2.4. Pielou's Second Type of Symmetry Test

In the two-class case, a more elaborate test of symmetry due to Pielou [2] is based on a 2 × 6 contingency table, called the Q-symmetry contingency table, where the class of each observation and the number of times it serves as a NN are recorded. A point can only serve as a NN to 0,1, 2,3, 4, or 5 other observations due to geometric constraints in ℝ2 provided that the points are from a continuous distribution (as in CSR independence). For a two-class population, the observations are sorted into two sets of frequencies, namely, Q , Q ,…, Q , for i = 1,2, where Q is the frequency of class i observations serving as a NN to m other points for m ∈ {0,1, 2,3, 4,5}. So Pielou's second type of symmetry test uses more spatial information than just the categorization of base-NN relations. Notice that Q is also the number of class i points shared as a NN by m other points. The corresponding contingency table for the two-class case is given in Table 2(a), where Q is the column sum, that is, the total number of points serving as a NN m times or the number of points shared as a NN by m other points for m ∈ {0,1, 2,3, 4,5}. Only if the allocation of the points from both populations is symmetric in terms of frequency of “serving as a NN” property, the expected proportions of classes 1 and 2 points serving as NNs m times will be the same for each m value. Hence, this type of symmetry refers to “symmetry in shared NN structure.” Let be the vector of probabilities (or proportions) associated with row i for i = 1,2 in the Q-symmetry contingency table under the row-wise multinomial framework. In a Q-symmetry contingency table, sum of row i equals n (i.e., size of class i). Hence, Q-symmetry contingency table may not result from the overall multinomial framework, since row sums in a Q-symmetry contingency table are fixed for completely mapped data. Furthermore, under RL, column sums Q are fixed and hence can be denoted as Q = q , but, under CSR independence, Q are random quantities. Thus, the null hypothesis of symmetry in the shared NN structure is given by In general, if the independence assumptions in the row-wise multinomial framework hold, we would have E[Q ] = n Q /n. Then we may test the equality of proportions by using the usual Pearson's χ 2 test which has approximately a χ 5 2 distribution for large n. Under RL, although it would be possible for a point to serve as NN to 6 other points with a positive probability (depending on the fixed allocation of the points), we will only consider up to 5 (and combine 5 and 6 categories and treat them as one category). If these categories have nonnegligible counts, then the above discussion can easily be extended to the case that shared NN frequencies have 7 levels, and the corresponding test has χ 6 2 distribution for large n. A conservative requirement for the cell frequencies in the contingency table is that no expected cell count is less than 1 and no more than 20% of the cell counts are less than 5 [16]. Otherwise, it is recommended to merge some of the categories. For the Q-symmetry contingency table, in practice, such a merging would usually be necessary for m ≥ 2, whence the dimension of the contingency table becomes 2 × 3 and df becomes 2. Large values of 𝒳 II 2 indicate deviations from the null case. Hence, if the p value is significant, then the population can be assumed to be asymmetric in the shared NN structure in the sense that the distribution of the rows in the Q-symmetry contingency table would be different for the two classes; that is, there is significant asymmetry in the shared NN structure. Pielou's second type of symmetry test can immediately be extended to the multiclass case. With k > 2, we record the frequency of class i members serving as NN m times in a k × 6 contingency table (merging cells when necessary which might be needed for m ≥ 2). Then we obtain the contingency table given in Table 2(b). In the k-class case, the null hypothesis is The corresponding test statistic 𝒳 II 2 = ∑ 5∑ ((Q − E[Q ])2/E[Q ]) would be approximately distributed as χ 5( 2 (and when columns are merged for m ≥ 2, we obtain a k × 3 Q-symmetry contingency table and the asymptotic distribution is χ 2( 2) for large Q and n provided that the independence assumptions in the row-wise multinomial framework hold. This test seems to arise from the row-wise multinomial framework by construction, with the test statistic, 𝒳 II 2, given in (20). Furthermore, the trials here are “base point-” “number of times the point serving as a NN” or “base point-” “number of times the point is shared as a NN.” Under RL or CSR independence, between row or column independence is violated for the Q-symmetry contingency table. For example, under RL with two classes, Q 1, and Q 2, are highly correlated; in fact, correlation between them is −1 when k = 2, since Q 2, = q − Q 1,. Furthermore, Q and Q are also highly dependent and so are Q and Q . Hence, the suggested asymptotic distribution for 𝒳 II 2 should be appropriate under sparse sampling only. However, our extensive Monte Carlo simulations suggest that the asymptotic approximation with the reduced contingency table using χ 2( 2 distribution seems to hold for completely mapped data as well. Therefore, the test seems to be appropriate for both sparsely sampled or completely mapped data. Yet, finding the exact and asymptotic distribution of 𝒳 II 2 is still open problems.

3. Fisher's Exact Test for the Q-Symmetry Contingency Table

Fisher's exact test is widely used for contingency tables for small sample sizes (see, e.g., [17]). However, it can neither be used to test Pielou's first type of symmetry nor Dixon's symmetry test for two classes nor for their extensions to k > 2 case, since we only consider the equality of off-diagonal entries in these tests, while Fisher's exact test is used to detect any departure from independence for all cell count in the contingency table. An alternative exact test for small n = N 12 + N 21 can be obtained by using the usual binomial test for Pielou's first type of symmetry test under the appropriate sampling framework. The use of exact tests on NNCTs for testing segregation/association is discussed in Ceyhan [18]. We can apply Fisher's exact test for the 2 × 6 Q-symmetry contingency table given in Table 2 (or the reduced 2 × 3 contingency table) for Pielou's second type of symmetry test. If calculated manually, Fisher's exact test is feasible only for small size contingency tables. Furthermore, the underlying assumption of the Fisher's exact test is that the total number of observations, row and column sums are fixed, so Fisher's exact test is a test conditional on the marginals. For k × l contingency tables, when k = l = 2, then Fisher's exact test can be one-sided or two-sided, whereas, when min⁡(k, l) > 2 (hence for the Q-symmetry contingency table), it is two-sided only [17]. There are numerous ways to obtain p values for the two-sided alternatives for exact inference on contingency tables [17]. These variants of Fisher's exact test are described below. The p values based on Fisher's exact tests tend to be more conservative than most approximate (asymptotic) ones [17].

3.1. Variants of Fisher's Exact Test for Two-Sided Alternatives

To find the p values for Fisher's exact test, we find the probabilities of the contingency tables obtained from the distribution with the same row and column marginal sums. For the two-sided alternatives, a recommended method is adding up probabilities of contingency tables of the same size and smaller than the probability associated with the current table. Alternatively, twice the one-sided p value can also be used for a 2 × 2 contingency table [17]. Let the probability of the k × l contingency table, C , be denoted as f(C ), where min⁡(k, l) > 2, and let sum of row i be r , let sum of column j be c , and let entry i, j be N . Then the probability of the contingency table, C , is [13] In particular, for the 2 × 3 reduced Q-symmetry contingency table, we get Let the probability of the current contingency table be denoted as p . For summing the p values of more extreme tables than the current table in both directions, the following variants of the exact test are obtained. The p value is calculated as p = ∑ f(C ) for the appropriate choice of the set of contingency tables, S, as follows:Tocher's correction makes Fisher's exact test less conservative, by including the probability for the current table based on a randomized test [19]. When table-inclusive version of the p value, p inc, is larger than the level of the test α, but table-exclusive version, p exc, is less than α, a random number, U, is generated from uniform distribution in (0,1), and if U ≥ (α − p exc)/p , then p inc is used as the p value, otherwise p exc is used as the p value. That is, table-inclusive version, denoted as p inc: take S = {C : f(C ) ≤ p }; twice-table-inclusive version, p : the probability of the observed table is included twice, once for each side; table-exclusive version, p exc: table-inclusive minus p ; mid-p version, p mid: table-exclusive plus one-half the p ; Tocher corrected version, p Toc, is obtained as follows. Observe that p exc = p inc − p and p mid = p exc + p /2. Additionally, p exc ≤ p Toc ≤ p inc < p and p exc < p mid < p inc < p .

4. Asymptotic Power Analysis

The null hypotheses are different for the symmetry tests and so are the alternative hypotheses. This makes the comparison of the tests inappropriate even for large samples; however, under specific alternatives and assumptions, we can estimate asymptotic efficiency scores, such as those of Pitman asymptotic efficiency. A reasonable test should have more power as the sample size increases. So, we first prove the consistency of the tests in question under appropriate hypotheses.

4.1. Consistency of Tests

The consistency of Pielou's second and first types of symmetry tests is shown as below.

Theorem

For properly sparsely sampled data under the row-wise multinomial framework, Pielou's second type of symmetry test for the multiclass case with the k × 6 contingency table; that is, the test rejecting for all i ∈ {1,2,…, k} for 𝒳 2 > χ 5( 2(1 − α) with 𝒳 2 = ∑ 5∑ ((Q − N Q /n)2/(N Q /n)) is consistent.

Proof

In the multiclass case with k ≥ 2, deviations from H may have many possible forms. In any deviation from H , that is, under H , for large n, 𝒳 II 2 is approximately distributed as a χ 2 distribution with noncentrality parameter and 5(k − 1) df, which is denoted as . The noncentrality parameter is a quadratic form which can be written as for some positive definite matrix A of rank 5(k − 1) (see, e.g., [20]); hence, under H . Then, for large n, the null and alternative hypotheses are equivalent to H : λ = 0 versus . Then, by standard arguments for the consistency of χ 2-tests, the result follows. The consistency of Pielou's second type of symmetry test for the 2 × 3 (reduced) contingency table can be shown similarly. Let the NNCT be constructed by a random sample of base-NN pairs (i.e., data is obtained by an appropriate sparse sampling) under an overall multinomial framework. Then, Pielou's first type of symmetry test, that is, the test rejecting H : E[N ] = E[N ] for all i, j with i ≠ j against H : E[N ] ≠ E[N ] for some i, j with i ≠ j for 𝒳 2 > χ 2(1 − α) with 𝒳 2 as in (4) is consistent. The corresponding one-sided tests using Z given in (3) are also consistent. In the two-class case, recall that this test is the same as McNemar's test with a continuity correction. Given that N 12 + N 21 = n , the correction is used for small n and its impact vanishes as n → ∞. So we prove the consistency for the uncorrected version (i.e., for the test without continuity correction), 𝒳 I 2 = (N 12 − N 21)2/(N 12 + N 21). Let T = N 12/n − 1/2. Then, under H , we have N 12 ~ BIN(n , 1/2). So . Hence, Z I is approximately distributed as N(0,1) for large n under the null hypothesis and a normal distribution under alternative hypothesis. Notice that Z I 2 = 𝒳 I 2 in the uncorrected version. Under H , E[T ] = 0 and, under H , E[T | H ] = ɛ > 0 or E[T | H ] = ɛ < 0. Then, by the standard arguments for the consistency of z-tests, the test using Z I is consistent. The α-level test based on 𝒳 I 2 is equivalent to α-level two-sided test based on Z I. Hence, the consistency of 𝒳 I 2 follows as well. For k > 2, consistency of is similar to Z I with (i, j) = (1,2) and consistency of 𝒳 I 2 follows as in the proof of Theorem 2.

Theorem 4

Let the NNCT be constructed from a completely mapped data under RL. Then Dixon's symmetry test, that is, the test rejecting H : E[N ] = E[N ] for all i, j with i ≠ j against H : E[N ] ≠ E[N ] for some i, j with i ≠ j for 𝒳 2 > χ 2(1 − α) with 𝒳 2 as in (18) is consistent. The corresponding one-sided tests using Z given in (10) are also consistent. In the two-class case, let ; then T = Z . Under RL, E[Z ] = 0 since E[N 12] = E [N 21] and Z is approximately distributed as N(0,1) for large n under the null hypotheses. Under H , E[Z | H ] = ɛ > 0 or E[Z | H ] = ɛ < 0 and V a r[N /n] = p (ɛ)/n + Qp (ɛ)/n 2 + (1 − 3/n − Q/n 2)p (ɛ)−(p (ɛ))2 and C o v[N /n, N /n] = Rp (ɛ)/n 2 + (1/n − R/n 2)(p (ɛ) + p (ɛ))+(1 − 3/n − Q/n 2 + R/n 2)p (ɛ) − p (ɛ)p (ɛ). So, under H , V a r[N ] → 0 and C o v[N , N ] → 0 as n → ∞. Hence, the test using Z is consistent. The α-level test based on 𝒳 2 is consistent as in the proof of Theorem 2, since 𝒳 2 is a quadratic based on Z values; that is, 𝒳 2 ~ χ df 2(λ(ɛ)) for some λ(ɛ) > 0.

Remark 5

The consistency result for Pielou's first type of symmetry test is only for sparsely sampled data with contingency table from the overall multinomial framework. Pielou's second type of symmetry test is consistent only for sparsely sampled data with the row-wise multinomial framework. For completely mapped data, these tests do not have the appropriate size. In particular, Monte Carlo simulations suggest that Pielou's first type of symmetry test (with χ 2 approximation or exact binomial version) is extremely conservative. See also Section 5.

4.2. Asymptotic Power Comparison of the Tests

The power of a test in hypothesis testing depends on the statistic being employed, sample size, the level of the test α, and the parameter(s) under H . To be able to compare the tests, we should consider the asymptotics with only n → ∞, where the asymptotic power tends to 1 for consistent tests. Since the power depends on multiple parameters, many asymptotic efficiency methods are introduced to compare asymptotic power performance. See [21] for a brief survey of asymptotic efficiency measures. The tests with small level and high power under alternatives close to null hypothesis have practical importance. Hence, Pitman asymptotic efficiency (PAE) is widely used in practice. PAE analysis provides for an investigation of local power around H , which involves the limit as n → ∞ together with the limit of alternative parameter converging to the null parameter. See, for example [22, 23] for more details. Suppose that the distribution F under consideration can be indexed by Θ⊆ℝ and consider H : θ = θ 0 versus H : θ > θ 0. If the test statistic satisfies central limit theorem together with the Pitman's conditions [23] with μ = E[T ] and σ 2 = V a r[T ], then PAE of T is given by If a test statistic, T , converges in law to χ 2 distribution as n → ∞, then the local power approximation using asymptotic normality of T is not appropriate [24]. By suitable transformations, the corresponding test asymptotically boils down to H : λ = 0 versus H : λ > 0, where λ is the noncentrality parameter for the χ 2 distribution. Therefore, we investigate the local power around λ = 0. Let f (x, λ) and F (x, λ) be the pdf and cdf of χ 2(λ) distribution, respectively. Suppose that C 2 is a test statistic which converges in law to χ 2(λ) with λ = 0 under H and to χ 2(λ) with λ > 0 under H . Then the local power for small λ (λ around 0) is given by The proof is provided in the Appendix.

4.2.1. Asymptotic Local Power Analysis of the Tests

Pielou's first type of symmetry test is used for testing H : E[N 12] = E[N 21] versus H : E[N 12] ≠ E[N 21]. Under H , given that N 12 + N 21 = n , N 12 ~ BIN(n , 1/2), since E[N 12]/(E[N 12] + E[N 21]) = 1/2. Under H , N 12 ~ BIN(n , 1/2 + ɛ 1) for ɛ 1 ∈ (0,1/2). Let T = N 12/n − 1/2. Then T 2 = (N 12 − N 21)2/n which is equal to 𝒳 I 2 (without Yates' correction) in (2). Under H , E[T ] = 0 and V a r[T ] = 1/4n and under H , let E[T | H ] = ɛ 1. Next, let μ = E[T ] and σ 2(T ) = V a r[T ]. Then μ and σ satisfy the Pitman conditions and μ ′(ɛ 1 = 0) = 1 (see [23]). Then by Remark 6, the PAE of T (for the parameterization H : E[N 12] − E[N 21] = ɛ 1) is The asymptotic local power for Dixon's symmetry test for the two-class case can also be investigated with PAE analysis. For Dixon's symmetry test for the two-class case, consider T = (N 12 − N 21)/n. Then Let μ = E[T ], σ 2 = V a r[T ], p = E[Q/n], and p = E[R/n]. That is, p is the probability of a point being a shared NN and p is the probability of a pair being reflexive. Then, under H , E[T | H ] = 0 and where for large n for (i, j)∈{(1,2), (2,1)} and and, under H , let E[T | H ] = ɛ 2. Then, by Remark 6, PAE of Z (for the parameterization H : E[(N 12 − N 21)/2] = ɛ 2) is given by For the asymptotic relative efficiency between Pielou's first type of symmetry test and Dixon's symmetry test to make sense, the null assumptions for these tests should match and so should the alternatives and the parameterizations of the alternatives (under which PAE scores are computed). Otherwise, PAE(Z I) and PAE(Z ) would not be comparable. In particular, since the (appropriate) null and alternatives are different for these tests, we refrain from computing asymptotic relative efficiency for these tests. On the other hand, for Dixon's symmetry test with varying ν 1 and p , notice that PAE(Z ) increases as ν 1 gets closer to 0 or 1 or p gets smaller. For example, for fixed p , PAE of Z gets larger as the relative abundances of the classes get more and more different (which implies that ν 1 gets closer to 0 or 1). The smallest PAE(Z ) values are obtained when ν 1 = ν 2 = 1/2 for any p > 0. That is, the power of Dixon's test for spatial symmetry (in mixed NN structure) highly depends on the relative abundances of the classes. The PAE of Z in the multiclass case is similar.

5. Empirical Performance of the Tests

In this section we investigate the finite sample behavior of the tests under various patterns via Monte Carlo simulations.

5.1. Empirical Performance Analysis under RL and CSR Independence

Both CSR independence and RL patterns imply symmetry in the mixed or shared NN structure. That is, under these cases, the asymmetry would occur at expected levels. More specifically, we expect that E[N 12] = E[N 21] = n 1 n 2/n would hold for symmetry in mixed NN structure, and in (19) would hold for symmetry in shared NN structure. Hence, these patterns imply our null hypotheses and hence can be used to assess the empirical size performance of the tests. In what follows empirical size estimates are based on the asymptotic critical values (except for the exact tests). In particular, for a test, T, with a χ df 2 distribution asymptotically, empirical sizes are estimated as follows. Let T be the value of test statistic for the sample generated at ith Monte Carlo replication for i = 1,2,…, N mc. Then the empirical size of T at level α = 0.05, denoted , is computed as (T > χ df 2(0.95)), where χ df 2(0.95) is the 95th percentile of χ df 2 distribution. For an exact test, let p be the p value for ith sample generated. Then the empirical size of this test, denoted , is computed as (p < 0.05). With N mc = 10000, an empirical size estimate larger than 0.0536 is deemed liberal, while an estimate smaller than 0.0464 is deemed conservative at .05 level (based on binomial critical values with n = 10000 trials and probability of success 0.05).

5.1.1. Empirical Size Analysis under CSR Independence

We consider the two-class case, with classes 1 and 2 (also referred to as the classes X and Y, resp.) of sizes n 1 and n 2, respectively. Let {X 1,…, X } be the set of class 1 points and let {Y 1,…, Y } be the set of class 2 points. Under H , at each of N mc = 10000 replicates, we generate X and Y points independently of each other and iid from 𝒰((0,1)×(0,1)), the uniform distribution on the unit square. We consider two cases for CSR independence. Case 1. We generate n 1 = n 2 = n = 10,20,30,40,50 points iid from 𝒰((0,1)×(0,1)). In this case, the sample sizes are equal and increasing. Case 2. To determine the influence of differences in the sample sizes (i.e., differences in relative abundances) on the empirical levels of the tests, we generate the samples from the CSR independence pattern with n 1 = 20 and n 2 = 20,30,…, 60. The empirical significance levels (under CSR independence Cases 1 and 2) for the symmetry tests are presented in Table 3, where and are the (estimated) empirical significance levels for Pielou's first type of symmetry test using χ 2 approximation with and without Yates' continuity correction, respectively; is for the exact binomial version of Pielou's first type of symmetry test conditional on N 12 + N 21 = n ; is the empirical significance level for Pielou's second type of symmetry test; is for Dixon's symmetry test. Notice that Pielou's first type of symmetry tests and the exact binomial test are extremely conservative. Furthermore, we recommend the use of the Monte Carlo randomized versions of these tests or with Monte Carlo critical values rather than the approximate asymptotic critical values. A Monte critical value is determined as the appropriately ranked value of the test statistic in a certain number of generated data sets under the null hypothesis. The other tests seem to be of the desired level for each sample size considered.

Table 3

The empirical significance levels of the symmetry tests under CSR independence Case 1: n 1 = n 2 = n = 10,20,…, 50 and Case 2: n 1 = 20, n 2 = 20,30,…, 60 with N mc = 10000 at α = .05. and stand for the empirical significance levels for Pielou's first type of symmetry test using χ 2 approximation with and without Yates' continuity correction, respectively; stands for the exact binomial version of Pielou's first type of symmetry test conditional on N 12 + N 21 = n ; stands for the empirical significance level for Pielou's second type of symmetry test; stands for Dixon's symmetry test.

CSR independence Case 1
n	α^IP	α^IP′	α^binP	α^IIP	α^SD
10	.0002	.0011	.0111	.0483	.0466
20	.0001	.0006	.0080	.0533	.0480
30	.0000	.0008	.0073	.0487	.0492
40	.0001	.0006	.0061	.0501	.0514
50	.0000	.0007	.0044	.0522	.0484

CSR independence Case 2
n ₂	α^IP	α^IP′	α^binP	α^IIP	α^SD

20	.0001	.0006	.0093	.0533	.0473
30	.0002	.0001	.0088	.0527	.0500
40	.0001	.0008	.0089	.0536	.0539
50	.0002	.0008	.0092	.0491	.0483
60	.0002	.0010	.0080	.0508	.0483

The empirical significance levels for the exact tests on the Q-symmetry contingency table under CSR independence Cases 1 and 2 are presented in Table 4, where is the empirical significance level for the two-sided test with the table-inclusive version, is for table-exclusive version, is for mid-p value version, and is for Tocher corrected version. Notice that only the table exclusive version is about the desired level, while the others are more conservative. Hence, in what follows, only the table exclusive version will be employed for exact inference on Q-symmetry contingency table.

Table 4

The empirical significance levels for Fisher's two-sided exact tests for the Q-symmetry contingency tables under CSR independence Cases 1 and 2 with N mc = 10000, for some combinations of n 1, n 2 at α = .05. is for the empirical significance level for the table-inclusive version of the two-sided test, is for table-exclusive version, is for mid-p value version, and is for Tocher corrected version.

CSR independence Case 1
n	α^inc	α^exc	α^mid	α^Toc	α^t,inc
10	.0392	.0424	.0413	.0410	.0319
20	.0466	.0505	.0480	.0487	.0451
30	.0459	.0484	.0465	.0466	.0430
40	.0457	.0481	.0475	.0472	.0434
50	.0475	.0490	.0478	.0479	.0462

CSR independence Case 2
n ₂	α^inc	α^exc	α^mid	α^Toc	α^t,inc

20	.0454	.0497	.0462	.0469	.0444
30	.0484	.0533	.0504	.0512	.0435
40	.0479	.0505	.0489	.0490	.0460
50	.0485	.0524	.0504	.0509	.0460
60	.0474	.0500	.0489	.0487	.0445

5.1.2. Empirical Size Analysis under RL

For the RL pattern, the locations of the points are given and the marks or class labels are assigned randomly to these points. The pattern generating these locations is referred to as the background pattern henceforth. Let 𝒵 = {Z 1, Z 2,…, Z } be the given set of locations for n points from the background pattern. We consider RL of class labels of 1 and 2 (or X and Y) to these points which are generated from homogeneous or clustered patterns. We generate 100 different realizations of the background pattern, 𝒵 , to mitigate the influence of a particular background realization on the size performance of the tests. At each background realization, n 1 of the points are labeled as class 1 and the remaining n 2 = n − n 1 points are labeled as class 2. Types of the Background Patterns Case 1. The background points, 𝒵 , are generated iid in the unit square (0,1) × (0,1). That is, for i = 1,2,…, n. To determine the effect of increasing equal sample sizes, we consider n 1 = n 2 = n = 10,20,…, 50. The above RL scheme is repeated 1000 times for each (n 1, n 2) combination of background realization. Case 2. The background points, 𝒵 , are generated as in Case 1 above with n 1 = 20 and n 2 = 20,30,…, 60 to determine the differences in the sample sizes with number of class 1 points fixed and number of class 2 points increasing. The above RL scheme is repeated 1000 times for each (n 1, n 2) combination of background realization. Case 3. We generate the background points from a Matérn cluster process. More specifically, Z points are generated from MatClust(κ, r, μ) process, which is the Matérn cluster process in the unit square [25]. In this process, first “parent” points are generated from a Poisson process with intensity κ and then one replaces each parent point by N new points which are generated iid inside the circle centered at the parent point with radius r. Here N is also random; N ~ Poisson(μ). At each background realization, one realization of 𝒵 is generated from MatClust(κ, r, μ). Let n be the number of points in a particular realization. Then n 1 = ⌊n/2⌋ of these points are labeled as class 1, where ⌊x⌋ stands for the floor of x and n 2 = n − n 1 as class 2. In our simulations, we use κ = 2,4,…, 10, μ = ⌊100/κ⌋, and r = 0.1. That is, we take (κ, μ)∈{(2,50), (4,25)…, (10,10)}, in order to have about 100 Z points, where about half of them are class 1 and the other half are class 2 points on the average. In RL Cases 1 and 2, the points are from HPP in the unit square (with fixed n 1 and n 2), where Case 1 is for assessing the effect of increasing but equal sample sizes on the tests, while Case 2 is for assessing the effect of increasing differences in relative abundances of the classes (with one class size being fixed, while the other is increasing). On the other hand, in Case 3, we have the background realizations with cluster centers and cluster numbers being random. On the average, with increasing κ, the number of clusters tend to increase, and cluster sizes tend to decrease (so as to have fixed class sizes on the average). Hence, in Case 3, we investigate the influence of increasing number of clusters with randomly determined centers on the size performance of the tests. The empirical size estimates of the tests under RL Cases 1–3 are presented in Table 5. The empirical size performance of the tests under Cases 1 and 2 is similar to that under CSR independence Cases 1 and 2, respectively. Tests of Pielou's first type of symmetry are extremely conservative, while the other tests are about the desired level. The empirical size estimates of the exact test for Pielou's second type of symmetry (the table exclusive version) are denoted as for notational convenience. Furthermore, is close to the nominal level for all sample sizes or κ values. Notice also that the size estimates of the tests are not influenced by the number of clusters, κ, when the class sizes are fixed.

Table 5

The empirical significance levels of the tests under RL Cases 1–3 with N mc = 1000 for each of 100 background realization at α = .05. The empirical size labeling is as in Table 3. stands for the empirical size estimates of the exact tests for Pielou's second type of symmetry (the table exclusive version).

RL Case 1
n	α^IP	α^IP′	α^binP	α^SD	α^IIP	α^IIF
10	.00011	.00089	.00092	.04368	.04989	.04205
20	.00018	.00096	.00106	.04797	.05283	.04831
30	.00017	.00095	.00110	.05037	.05147	.04974
40	.00016	.00056	.00042	.05156	.05242	.04994
50	.00028	.00070	.00050	.04981	.05020	.04934

RL Case 2
n ₂	α^IP	α^IP′	α^binP	α^SD	α^IIP	α^IIF

20	.00016	.00089	.00109	.04907	.05298	.04987
30	.00016	.00090	.00063	.05087	.05143	.05258
40	.00015	.00069	.00048	.04880	.05087	.05222
50	.00026	.00099	.00079	.04700	.05010	.05271
60	.00027	.00097	.00084	.04991	.04985	.05104

RL Case 3
κ	α^IP	α^IP′	α^binP	α^SD	α^IIP	α^IIF

2	.00018	.00063	.00065	.05158	.05241	.05006
4	.00023	.08451	.00083	.04848	.05024	.04988
6	.00012	.00051	.00048	.04953	.05061	.05057
8	.00024	.00076	.00074	.05075	.05007	.04963
10	.00024	.00070	.00083	.04939	.05087	.05063

Based on the empirical size performance of the tests, we observe that variants of Pielou's first type of symmetry test are extremely conservative and hence are not reliable in practice. On the other hand, Pielou's second type of symmetry test and Dixon's symmetry test are appropriate for balanced or unbalanced sample sizes. When the relative abundances of the classes are close to one (i.e., n /n ≈ 1 for i ≠ j), we call the class sizes to be balanced, but when the relative abundances deviate substantially from one we call the class sizes to be unbalanced. For the exact tests on Q-symmetry contingency table, we recommend the table-exclusive version.

5.2. Empirical Performance of the Tests under Various Other Patterns

To assess the empirical performance of the tests, we consider six pattern cases for the NN structure. Empirical rejection rate estimates are computed as the size estimates in Section 5.1. Case I. For the first class of patterns, we generate for i = 1,…, n 1 and for j = 1,…, n 2, where BVN(μ 1, μ 2, σ 1, σ 2, ρ) is the bivariate normal distribution with mean (μ 1, μ 2) and covariance . In our simulations, we set σ 1 = σ 2 = σ and ρ = 0. We consider three patterns in which The classes 1 and 2 (i.e., X and Y) have different distributions with different local intensities. In particular, X points constitute a realization of HPP process in the unit square, while Y points are clustered around the center of the unit square, namely (1/2,1/2). In fact, the level of clustering of Y points increases as σ decreases. The means (±SD (standard deviations)) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and empirical rejection rate estimates under the patterns, (i), (ii), and (iii), with n 1 = n 2 = 40 are presented in Table 6(a), where and stand for the empirical rejection rates for Pielou's first type of symmetry test using χ 2 approximation with and without Yates' continuity correction, respectively; is for the exact binomial version of Pielou's first type of symmetry test conditional on N 12 + N 21 = n ; is for Dixon's symmetry test; is for Pielou's second type of symmetry test; is for the exact test on the Q-symmetry contingency table. Notice that, under Case I patterns, the off-diagonal entries, N 12, N 21, in the NNCTs tend to be much smaller than expected under H : E[N 12] = E[N 21] = n 1 n 2/n = 20 and N 12 values tend to be larger than N 21 values which suggests asymmetry in the mixed NN structure. Furthermore, N 12, N 21 tend to decrease with decreasing σ. That is, when the level of clustering of Y points in the center of the unit square increases (i.e., level of segregation of Y points from X points increases), the off-diagonal entries tend to decrease (in a similar fashion). The exact binomial version of Pielou's first type of symmetry test has the highest rejection rates which are increasing as σ is decreasing. The rejection rate estimates for all other symmetry tests are significantly smaller than the nominal level of .05, indicating lack of asymmetry in the mixed and shared NN structure. However, the fact that off-diagonal entries are small seems to render the asymptotic approximations inappropriate. Although the difference of the off-diagonal entries is larger than zero, the standard deviations of the differences are much smaller compared to those under CSR independence or RL (see also Table 7). Moreover, the exact binomial test is not appropriate either due to the dependence between trials (hence dependence between rows of the NNCT) for spatial data. Thus, in this situation, we recommend performing Monte Carlo randomization to determine more reliable rejection rate estimates. To that end, for each of the 100 generated samples under each of Case I patterns, 1000 Monte Carlo resampling is performed, and rejection rate for a test is estimated based on how many of the test statistics on resamplings are at and above the original test statistic. The corresponding Monte Carlo randomization rejection rate estimates are presented in Table 6(b), where the binomial version of Pielou's first type of symmetry test is omitted since it is conditional on N 12 + N 21 = n which is not fixed under Monte Carlo randomization steps. The rejection rate estimates are high for all tests and much higher than the nominal rate of 0.05. Hence, Case I patterns are actually providing significant asymmetry in mixed and shared NN structure, which was not revealed by the asymptotic approximation of the tests. Hence, this pattern is actually an alternative pattern for both symmetry structures, and the rejection rates are in fact power estimates under this alternative pattern. The highest power estimates are observed for Monte Carlo randomized version of Pielou's first type of test (and lowest estimates are for Dixon's symmetry test). Furthermore, the power estimates for Pielou's second type of symmetry tests are very similar. The power estimates for Monte Carlo randomized version of Pielou's first type of symmetry test and the two versions of Pielou's second type of symmetry test increase and those for other tests decrease as σ decreases.

Table 7

The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 under CSR independence Case 1 and RL Case 1 with n 1 = n 2 = 40 at α = .05.

	Mean ± SD
	N ₁₂	N ₂₁	N ₁₂ − N ₂₁
CSR-ind Case 1	20.2 ± 3.3	20.3 ± 3.4	− .03 ± 3.60
RL Case 1	20.3 ± 3.4	20.3 ± 3.4	− .01 ± 3.57

Case II. For Case II, we consider the following three patterns. First, we generate for i = 1,2,…, n 1 and, for each j = 1,2,…, n 2, we generate Y around a randomly picked X with probability p in such a way that Y = X + R (cos⁡T , sinT ), where v represents transpose of the vector v, R ~ 𝒰(0, min⁡ d(X , X )) and T ~ 𝒰(0,2 π), or generate Y uniformly in the unit square with probability 1 − p. In the pattern generated, Y are more associated with X . The three values of p constitute the following patterns: In this case, X points constitute a realization of a HPP process in the unit square, while Y points are clustered around the X points and the level of clustering increases as the parameter p increases. The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the empirical rejection rate estimates for Case II patterns with n 1 = n 2 = 40 are presented in Table 8. Notice that N 12 and N 21 in the NNCTs tend to be similar and larger than expected and N 12 values tend to be slightly larger than N 21 values. Furthermore, N 12, N 21 tend to increase with increasing p. That is, when the level of clustering of Y points around X points increases (i.e., level of association of Y points with X points increases), the off-diagonal entries tend to increase (in a similar fashion), indicating symmetry in the NN structure (but the difference between N 12 and N 21 values tends to increase with increasing p). Variants of Pielou's first type of symmetry test have virtually zero rejection rates, and, although Dixon's symmetry test has higher rejection rates than Pielou's first type, it has rates smaller than 0.05; hence there is symmetry in the mixed NN structure. In fact, under this pattern, expected value of the difference, N 12 − N 21, is mostly positive and with a larger variance compared to those under CSR independence and RL. However, there is severe asymmetry in shared NN structure, since Pielou's second type of symmetry test and its exact version have rejection rate estimates much larger than 0.05, and these estimates increase as p increases. Hence, this pattern type can serve as an alternative to symmetry in the shared NN structure and perhaps a null pattern for the tests of symmetry in the mixed NN structure for the range of p considered. However, using the asymptotic critical values based on the distribution under RL, the tests of symmetry in mixed NN structure would be extremely conservative for this null case. If the correct form of the variance and covariance terms can be determined as a function of p, then the tests for symmetry in mixed NN structure would have the desired level. Otherwise, Dixon's symmetry test and Pielou's first type of symmetry test can be used with Monte Carlo randomization.

Table 8

The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case II patterns in (34) with N mc = 10000, n 1 = n 2 = 40 at α = .05. Column labeling is as in Table 6.

	Mean ± SD			Rejection rate estimates for Case II patterns
	N ₁₂	N ₂₁	N ₁₂ − N ₂₁	β^IP	β^IP′	β^binP	β^SD	β^IIP	β^IIF
II-(i)	28.0 ± 3.9	27.3 ± 3.5	.68 ± 3.80	.0002	.0002	.0030	.0198	.3313	.3225
II-(ii)	36.4 ± 4.4	33.5 ± 3.0	2.84 ± 4.07	.0001	.0009	.0084	.0124	.8395	.8326
II-(iii)	45.3 ± 4.6	38.1 ± 1.8	7.21 ± 4.43	.0049	.0079	.0524	.0434	.9923	.9913

Case III. For the third class of patterns, we consider for i = 1,…, n 1 and for j = 1,…, n 2. The three values of s constitute the following patterns: Notice that these are the segregation patterns considered for Monte Carlo analysis in Ceyhan [12]. The means (±SD)) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the empirical rejection rate estimates for the segregation patterns are presented in Table 9. The off-diagonal entries, N 12, N 21, are very similar under these segregation patterns and are much smaller than expected under RL and tend to decrease as s (i.e., level of segregation) increases. Hence, mixed NN structure seems to be symmetric under these segregation patterns. The symmetry tests and the exact tests have very small rejection rates, with Pielou's first type and Dixon's symmetry tests having virtually zero rates and the others having rates lower than .05. There seems to be symmetry in both mixed and shared NN structure, since the null hypotheses seem to be satisfied. That is, the expected difference N 12 − N 21 is zero, and the cell counts in the Q-symmetry table are as expected under RL. However, the variances seem to be much smaller compared to the ones under RL or CSR independence (see Table 7). Thus, these segregation patterns can form null patterns for both types of symmetry tests; however, the correct variance and covariance terms should be computed; otherwise, the symmetry tests would be extremely conservative when the critical values are based on the distribution under RL or CSR independence.

Table 9

The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case III patterns with N mc = 10000, n 1 = n 2 = 40 at α = .05. Column labeling is as in Table 6.

	Mean ± SD			Rejection rate estimates for Case III patterns
	N ₁₂	N ₂₁	N ₁₂ − N ₂₁	β^IP	β^IP′	β^binP	β^SD	β^IIP	β^IIF
III-(i)	14.4 ± 3.1	14.4 ± 3.1	.04 ± 3.09	.0004	.0009	.0086	.0225	.0385	.0368
III-(ii)	10.1 ± 2.7	10.1 ± 2.7	− .01 ± 2.58	.0000	.0011	.0096	.0073	.0341	.0321
III-(iii)	5.9 ± 2.1	5.9 ± 2.1	.03 ± 1.98	.0002	.0013	.0128	.0005	.0336	.0308

Case IV. We also consider patterns in which self-reflexive pairs are more frequent than expected by construction. We generate for i = 1,…, ⌊n 1/2⌋ and for j = 1,…, ⌊n 2/2⌋. Then, for k = ⌊n 1/2⌋ + 1,…, n 1, we generate X = X + r(cos⁡T , sinT ) and, for l = ⌊n 2/2⌋ + 1,…, n 2, we generate Y = Y + r(cos⁡T , sinT ), where r ∈ (0,1) and T ~ 𝒰(0,2 π). Appropriate small choices of r will yield an abundance of self-reflexive pairs. The three values of r we consider constitute the below self-reflexivity patterns at each support pair (S 1, S 2). Then the nine pattern combinations we consider are given by the following: S 1 = S 2 = (0,1)×(0,1), (a) r = 1/7, (b) r = 1/8, and (c) r = 1/9; S 1 = (0,5/6)×(0,5/6) and S 2 = (1/6,1)×(1/6,1), (a) r = 1/7, (b) r = 1/8, and (c) r = 1/9; S 1 = (0,3/4)×(0,3/4) and S 2 = (1/4,1)×(1/4,1) (a) r = 1/7, (b) r = 1/8, and (c) r = 1/9. The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the empirical rejection rate estimates for Case IV patterns with n 1 = n 2 = 40 are presented in Table 10. In this case, the off-diagonal entries, N 12, N 21, tend to be very similar but smaller than expected under RL, indicating symmetry in mixed NN structure. Furthermore, as pattern changes from (i) to (iii) N 12, N 21 values tend to decrease, and, at each case IV pattern, N 12, N 21 values tend to decrease, as r (i.e., the level of self-reflexivity) decreases. Variants of Pielou's first type of symmetry test have small rejection rates (with the asymptotic versions having virtually zero rates and the exact version slightly higher rates); Dixon's symmetry test has rejection rates smaller than 0.05. Hence, we conclude that, under these self-reflexivity patterns, there is in fact symmetry in mixed NN structure, as the expected difference N 12 − N 21 is zero, but the variance of this difference is much smaller than that under RL. Hence, using the asymptotic distribution under RL, these tests would be extremely conservative. To get the desired level, one needs the correct form of the variances and covariances for Dixon's symmetry test under these patterns. On the other hand, Pielou's second type of symmetry tests has rejection rates about the nominal level of .05, indicating that these self-reflexivity patterns can also be viewed as the null pattern for symmetry in the shared NN structure.

Table 10

The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case IV patterns with N mc = 10000, n 1 = n 2 = 40 at α = .05. The rejection rate labeling and superscripting for “<” and “>” are as in Table 6.

	r	Mean ± SD			Rejection rate estimates for Case IV patterns
	r	N ₁₂	N ₂₁	N ₁₂ − N ₂₁	β^IP	β^IP′	β^binP	β^SD	β^IIP	β^IIF
IV-(i)	1/7	10.5 ± 3.2	10.5 ± 3.2	− .05 ± 3.12	.0012	.0055	.0256	.0361	.0565	.0525
	1/8	9.3 ± 3.1	9.3 ± 3.1	− .03 ± 2.99	.0021	.0052	.0307	.0318	.0572	.0552
	1/9	8.2 ± 3.0	8.2 ± 3.0	.00 ± 2.86	.0023	.0071	.0351	.0295	.0579	.0562

IV-(ii)	1/7	9.0 ± 3.0	9.0 ± 3.1	− .01 ± 2.78	.0014	.0039	.0235	.0176	.0526	.0509
	1/8	8.1 ± 3.0	8.1 ± 3.0	− .02 ± 2.70	.0015	.0064	.0312	.0192	.0609	.0583
	1/9	7.2 ± 2.9	7.2 ± 2.9	− .01 ± 2.63	.0028	.0075	.0395	.0172	.0616	.0601

IV-(iii)	1/7	6.9 ± 2.9	6.9 ± 2.8	.01 ± 2.42	.0014	.0039	.0286	.0070	.0496	.0470
	1/8	6.3 ± 2.8	6.3 ± 2.7	.02 ± 2.37	.0020	.0061	.0345	.0094	.0539	.0518
	1/9	5.6 ± 2.7	5.6 ± 2.6	.01 ± 2.30	.0027	.0074	.0392	.0070	.0590	.0565

Case V. In this case, first, we generate and then generate Y as Y = X + r(cos⁡T , sinT ), where r ∈ (0,1) and T ~ 𝒰(0,2 π). In the pattern generated, appropriate choices of r will cause Y and X more associated. That is, a Y point is more likely to be the NN of an X point and vice versa. The four values of r we consider constitute the four association patterns: The patterns (i)–(iii) are also the association patterns considered for Monte Carlo analysis in Ceyhan [12]. The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the empirical rejection rate estimates for Case V patterns with n 1 = n 2 = 40 are presented in Table 11. Notice that the off-diagonal entries, N 12, N 21, tend to be at or above the expected value under RL and tend to increase as r (i.e., level of association) increases. Furthermore, N 12 values tend to be slightly smaller than N 21 values and the differences between N 12 and N 21 tend to decrease as r decreases. Variants of Pielou's first type of symmetry test have virtually zero rejection rates, and, under stronger association with 1/7 ≤ r ≤ 1/10, Dixon's symmetry test and exact and asymptotic versions of Pielou's second type of symmetry test have rates around .05, and, under moderate association with 1/2 ≤ r ≤ 1/4, these tests have rates mildly above .05. Hence, stronger association with 1/7 ≤ r ≤ 1/10 could serve as the null pattern for both types of symmetry tests, while, under moderate association with 1/2 ≤ r ≤ 1/4, the expected values are smaller in the negative direction compared to those under RL, with the variances about those under RL.

Table 11

The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case V patterns with N mc = 10000, n 1 = n 2 = 40 at α = .05. Column labeling is as in Table 6.

	Mean ± SD			Rejection rate estimates for Case V patterns
	N ₁₂	N ₂₁	N ₁₂ − N ₂₁	β^IP	β^IP′	β^binP	β^SD	β^IIP	β^IIF
V-(i)	19.5 ± 3.3	21.9 ± 3.2	− 2.44 ± 3.52	.0004	.0019	.0076	.0982	.0805	.0982
V-(ii)	22.9 ± 3.3	24.5 ± 3.2	− 1.61 ± 3.62	.0002	.0007	.0033	.0752	.0699	.0676
V-(iii)	25.9 ± 3.1	26.5 ± 3.1	− .60 ± 3.58	.0001	.0003	.0021	.0565	.0503	.0490
V-(iv)	27.8 ± 2.9	27.9 ± 3.0	− .01 ± 3.52	.0001	.0002	.0012	.0513	.0485	.0454

Case VI. In this case, first, we generate for i = 1,2,…, m 1 + m 2 and, for each X generated, we find the distance of NN X point from X , denoted d (i.e., d = min⁡ d(X , X )). Then we generate Y points as follows. First generate R from 𝒰(0, ρd ) and θ from 𝒰(0,2π). Then set Y = X + R (cos⁡(θ ), sin(θ )) for i = 1,2,…, m 1. For j = 1,2,…, m 2, we first generate R from 𝒰(0, ρd ) and θ from 𝒰(0,2π). Then set X ′ = X + R (cos⁡(θ ), sin(θ )) for i = m 1 + 1, m 1 + 2,…, m 1 + m 2 and j = 1,2,…, m 2. Then we merge the X 's and X ′'s to form the X points (which would have n 1 = m 1 + 2m 2 many points). Moreover, we generate for j = 1,2,…, m 2. Let d be the distance of NN Y′ point to Y ′ among the above generated Y′ points. For k = 1,2,…, m 2, we first generate R from 𝒰(0, ρd ) and θ from 𝒰(0,2π). Then set Y ′′ = Y ′ + R (cos⁡(θ ), sin(θ )) for k = 1,2,…, m 2. Then we merge the Y 's, Y ′'s, and Y ′′'s to form the Y points (which would also have n 2 = m 1 + 2m 2 many points). In the pattern generated, appropriate choices of ρ will cause m 1 of the X points to have NNs more from Y points and m 2 of the X points to have NNs more from X points; additionally, m 2 of Y points would have NNs more from Y points. Hence, in this way, the off-diagonal entries (i.e., N 12 and N 21) would tend to be different, indicating asymmetry in mixed NN structure. The three values of ρ we consider constitute the following patterns: The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the empirical rejection rate estimates for these patterns are presented in Table 12. The off-diagonal entries, N 12, N 21, tend to be different at or above the expected value under RL and they tend to increase as ρ increases. However, N 12 values tend to be much smaller than N 21 values, and their difference tends to decrease as ρ increases. The asymptotic versions of Pielou's first type of symmetry tests virtually have zero rejection rate, and the exact version has small rates which are slightly larger than .05 for ρ = 1/3. On the other hand, Dixon's test and versions of Pielou's second type of symmetry test have high rejection rates (much higher than 0.05), which decrease as ρ increases. Hence, there is strong asymmetry in mixed and shared NN structure, and the level of asymmetry is increasing with decreasing ρ. Thus, these patterns can serve as alternative patterns for both types of symmetry tests and the rejection rates are power estimates. Notice also that the asymmetry in the shared NN structure is stronger than the asymmetry in the mixed NN structure.

Table 12

The means (±SD) of the off-diagonal entries, N 12, N 21, and their difference N 12 − N 21 and the rejection rate estimates for Case VI patterns with N mc = 10000, m 1 = 20, m 2 = 10 (hence n 1 = n 2 = 40) at α = .05. Column labeling is as in Table 6.

	Mean ± SD			Rejection rate estimates for Case VI patterns
	N ₁₂	N ₂₁	N ₁₂ − N ₂₁	β^IP	β^IP′	β^binP	β^SD	β^IIP	β^IIF
VI-(i)	20.3 ± .6	29.0 ± 2.6	− 8.73 ± 2.58	.0019	.0091	.0629	.8883	.9911	.9907
VI-(ii)	21.6 ± 1.4	27.7 ± 2.8	− 6.13 ± 2.89	.0003	.0012	.0073	.5460	.7782	.7730
VI-(iii)	23.1 ± 2.1	26.4 ± 3.0	− 3.39 ± 3.25	.0000	.0000	.0009	.1860	.3029	.2980

6. Pairwise versus One-versus-Rest Tests

In the multiclass case with k > 2, we first perform an overall omnibus test (as in ANOVA F-test for multigroup comparisons) and then, if the omnibus test is significant, then we perform post hoc tests to determine the specifics of the differences. These post hoc tests could be pairwise tests (as in pairwise t-tests) or one-versus-rest tests, where one class is compared with respect to all other classes combined. More specifically, with k > 2 classes, in the pairwise comparison, we only consider classes i ≠ j. The pairwise tests for Dixon's symmetry test and Pielou's second type of symmetry test can be defined in two different ways: (i) unrestricted pairwise symmetry tests and (ii) restricted pairwise symmetry tests. In the unrestricted version, for the pairwise test for classes i, j, i ≠ j, we keep all the points in consideration. That is, for Dixon's symmetry test, we extract N and N from the overall k × k NNCT and, in computing Q and R values, we do not ignore but use all the other classes. In the restricted version, we restrict our attention to two classes, i, j, with i ≠ j, only, and treat the classes as in the two-class case. That is, we only consider classes i and j and ignore the remaining classes and hence obtain a 2 × 2 NNCT just for classes i and j extract N 12 and N 21 from this NNCT, and compute Q and R for the data consisting of classes i and j only. The unrestricted version of Pielou's second type symmetry is based on the contingency table extracting only row i and j in the Q-symmetry contingency table. On the other hand, in the restricted version, we compute a 2 × 3 Q-symmetry contingency table based on data consisting of classes i and j only. In the one-versus-rest type of test for class i, we pool the remaining classes and treat them as the other class in a two-class setting and hence the name one-versus-rest test. In a multiclass setting with k classes, there are k one-versus-rest type tests and pairwise tests. As k increases, performing one-versus-rest analysis is computationally less intensive and easier to interpret. Although Pielou's first type of symmetry test and Dixon's symmetry test were designed only for the two-class case, we have extended them to the multiclass case. Hence, if we have more than 2 classes; for Pielou's first type of symmetry, we can perform Bowker's test of symmetry in (4) (under the appropriate sampling distribution framework) as the overall test and use the test in (2) as the post hoc test. For Dixon's symmetry test, the overall test is performed with 𝒳 2 in (18) and the post hoc tests are performed with Z in (10) for the restricted pairwise and one-versus-rest tests or Z for the restricted pairwise tests. For Pielou's second type of symmetry test, the overall test can be performed with 𝒳 II 2 for the k × 3 Q-symmetry contingency table and post hoc tests with 𝒳 II 2 for the 2 × 3 Q-symmetry contingency table. In the mixed or shared NN structure, significant overall tests indicate some form of deviation from symmetry for all classes combined, while the post hoc tests suggest which classes deviate significantly from symmetry. In particular, pairwise tests indicate which pairs are asymmetric in the NN structure, while one-versus-rest tests indicate which class is asymmetric with respect to the remaining classes. In all the above cases, the post hoc tests can give different and seemingly conflicting results (e.g., one class can be symmetric with respect to the rest and at the same time it can be asymmetric with respect to one of the other classes). Even if the pairwise symmetry tests are used, the restricted or unrestricted versions might yield different results. So extra care should be exercised for which post hoc test is used and how it should be interpreted.

7. Example Data: Lansing Woods Data

To illustrate the methodology, we use the Lansing Woods data, which contains locations of trees (in feet (ft)) and botanical classification of trees (according to their species) in a 924 ft × 924 ft (19.6 acre) plot in Lansing Woods, Clinton County, MI, USA [26]. The data set is available in the spatstat package in R [25] and comprise of 2251 trees together with their species as hickories, maples, red oaks, white oaks, black oaks, and miscellaneous trees. In our analysis, we only consider the black oaks, maples, and white oaks which constitute a total of 1097 trees. The scatter plot of these tree locations are presented in Figure 1.

Figure 1

The scatter plot of the locations of black oaks (circles ∘), maples (triangles ▵), and white oaks (pluses +) in the Lansing Woods, Clinton County, MI, USA.

7.1. Overall Symmetry Analysis

The NNCT for this data set is presented in Table 13. Notice that the off-diagonal entries (i.e., N and N values with i ≠ j) are very similar for i = 1, j = 2 and i = 1, j = 3, indicating symmetry in the mixed NN structure for black oaks versus maples and black oaks versus white oaks. But N 23 and N 32 values seem to be very different suggesting strong asymmetry in mixed NN structure for maples versus white oaks. We will be formally testing symmetry and attaching significance to it later in this section.

Table 13

The NNCT for the Lansing Woods data set containing black oak, maple, and white oak trees.

	NN species			Total
	Black oak	Maple	White oak	Total
Base species
Black oak	53	35	47	135
Maple	28	366	120	514
White oak	50	161	237	448

Total	131	562	404	1097

The (reduced) Q-symmetry contingency table is presented in Table 14, where the relative frequencies with respect to row sums are provided in parentheses. Observe that the column relative frequencies (i.e., column sums divided by the grand sum or the overall ratios of shared NNs for 0, 1, and ≥2 shared NNs) are 0.27, 0.50, and 0.24. The ratios of shared NNs for black oaks (i.e., the row entries for black oaks divided by the row sum for black oaks) are 0.27, 0.50, and 0.23, the ratios for maple trees are 0.22, 0.50, and 0.28, and the ratios for white oaks are 0.32, 0.49, and 0.19. Hence, the relative frequencies for black oaks are very similar to the overall frequencies, but those for other species (especially for white oaks) are very different from the overall frequencies. This suggests that there are differences in the shared NN structure for the three species, suggesting asymmetry in the shared NN structure, especially for white oaks compared to the other species.

Table 14

The (reduced) Q-symmetry contingency table for the Lansing Woods data. The values in the parentheses are relative frequencies of the cells in each row with respect to the row sums.

	Number of times a point serving as a NN			Total
	0	1	≥2	Total
Classes
Black oak	37 (.27)	67 (.50)	31 (.23)	135
Maple	113 (.22)	259 (.50)	142 (.28)	514
White oak	143 (.32)	220 (.49)	85 (.19)	448

Total	293 (.27)	546 (.50)	258 (.24)	1097

We present the test statistics and the associated p values for the overall symmetry analysis in Table 15, where 𝒳 2, 𝒳 I 2, and 𝒳 II 2 are as defined in the text, and the superscript u stands for “uncorrected for continuity” or “no Yates correction.” Furthermore, T II stands for the table exclusive version of two-sided Fisher's exact test on the Q-symmetry contingency table (which by definition only yields a p value but not a test statistic). Furthermore, in this table p asy stands for the p value based on the asymptotic approximation (i.e., asymptotic critical value) except for the exact test; p rand is based on Monte Carlo randomization of the labels on the given locations of the trees 1000 times. For the exact tests, the p value written for the p asy row is computed as in Section 3. Notice that p asy and p rand are very different for Pielou's first type of symmetry test with and without Yates correction. This is in agreement with the fact that Pielou's first type of symmetry tests is not appropriate for NNCTs based on completely mapped spatial data (yielding very conservative tests under the null hypotheses). For the tests with the correct asymptotic sampling distributions, p asy and p rand are very similar.

Table 15

The test statistics and the p-values for the overall symmetry analysis for the Lansing Woods data. TS stands for the test statistic, p asy for the p-values based on asymptotic critical values (except for the exact tests), and p rand for the p-values based on Monte Carlo randomization. *The p-values for the exact tests are not the asymptotic p-values but computed as described in Section 3.

Overall test statistics and p-values
	𝒳 _D ²	𝒳 _I ²	𝒳 _I ^2,u	𝒳 _II ²	T _II ^F
TS	13.482	5.694	6.182	16.595	—
p _asy	.004	.128	.103	.002	.002*
p _rand	.006	.002	.002	.004	.004

Notice that the test statistics and the corresponding p values imply that the allocations of the tree species are asymmetric in mixed and shared NN structure (as was suggested in the NNCT and Q-symmetry contingency table), since the corresponding p values for Dixon's symmetry test and Pielou's second type of symmetry test and Fisher's exact test are significant (p values based on Monte Carlo randomization are significant for all tests). Hence, we will perform post hoc symmetry tests to determine which pair(s) of species or which species when compared to the rest exhibit significant asymmetry in the NN structure.

7.2. Post Hoc Symmetry Analysis

7.2.1. Unrestricted Pairwise Symmetry Analysis

For the unrestricted pairwise analysis, we use (parts of) the contingency tables in the overall symmetry analysis. For example, for the unrestricted pairwise tests for Dixon's symmetry test, we use the off-diagonal entries N and N in the NNCT in Table 13 and the test statistic Z in (16). For the unrestricted pairwise tests for Pielou's second type of symmetry test for species i and j, we use the 2 × 3 Q-symmetry contingency table which is obtained by using rows i and j in Table 14. We present the test statistics and the associated p values for the unrestricted pairwise symmetry tests in Table 16. Notice that p asy and p rand are very different for Pielou's first type of symmetry test with and without Yates correction and very similar for other tests. The test statistics and the corresponding p values imply that there is symmetry in NN structure for black oaks versus maples and for black oaks versus white oaks. However, maples versus white oaks exhibit significant asymmetry in NN structure.

Table 16

The test statistics and the p-values for the unrestricted pairwise symmetry analysis for the Lansing Woods data. Row labelings and the asterisks are in Table 15.

Unrestricted pairwise test statistics and p-values
	Z _D ^ij	𝒳 _I ²	𝒳 _I ^2,u	𝒳 _II ²	T _II ^F
Black oaks versus maples
TS	.769	.246	.385	2.245	—
p _asy	.442	.620	.535	.326	.331*
p _rand	.412	.311	.282	.331	.337

Black oaks versus white oaks
TS	−.657	.092	.163	1.520	—
p _asy	.511	.762	.686	.468	.466*
p _rand	.473	.483	.483	.481	.481

Maples versus white oaks
TS	−3.470	5.356	5.634	16.554	—
p _asy	<.001	.021	.018	<.001	<.001*
p _rand	<.001	<.001	<.001	<.001	<.001

7.2.2. Restricted Pairwise Symmetry Analysis

For the restricted pairwise symmetry tests, we construct the contingency tables for the two species in question (ignoring the other species). The NNCTs for the three pairs of species are presented in Table 17. Notice that the off-diagonal entries are very similar for black oaks versus maples and black oaks versus white oaks, indicating symmetry in the mixed NN structure for these pairs of species. The off-diagonal entries are very different for maples versus white oaks indicating strong asymmetry in mixed NN structure for these species. The (reduced) Q-symmetry contingency tables for each pair of species in the restricted sense are presented in Table 18, where relative frequencies of cell counts with respect to row sums are presented in parentheses. Relative frequencies of black oaks and maples seem to be very similar to the overall frequencies for the column sums, and the same holds for black oaks and white oaks, indicating symmetry in the shared NN structure. On the other hand, the relative frequencies for the maples and white oaks seem to be different from the overall frequencies, indicating asymmetry in the shared NN structure. We present the test statistics and the associated p values for the restricted pairwise symmetry analysis in Table 19. Black oaks versus maples exhibit symmetry in the NN structures and likewise for black oaks versus white oaks. However, maples versus white oaks exhibit significant asymmetry in the NN structures.

Table 19

The test statistics and the p-values for the unrestricted pairwise symmetry analysis for the Lansing Woods data. Row labelings and the asterisks are in Table 15.

Restricted pairwise test statistics and p-values
	Z _D ^ij	𝒳 _I ²	𝒳 _I ^2,u	𝒳 _II ²	T _II ^F
Black oaks versus maples
TS	.246	.010	.038	.144	—
p _asy	.806	.922	.845	.930	.937*
p _rand	.819	.778	.773	.932	.945

Black oaks versus white oaks
TS	−.597	.115	.180	.603	—
p _asy	.551	.734	.671	.740	.759*
p _rand	.598	.497	.455	.764	.778

Maples versus white oaks
TS	−2.923	3.717	3.939	10.806	—
p _asy	.003	.054	.047	.005	.005*
p _rand	<.001	<.001	<.001	.004	.004

7.2.3. One-versus-Rest Symmetry Analysis

For the one-versus-rest type symmetry tests, we construct the contingency tables for each species in question pooling the other species in one class. The NNCTs for the three species are presented in Table 20. Notice that the off-diagonal entries are very similar for black oaks versus rest, indicating symmetry in mixed NN structure. The off-diagonal entries for maples versus rest and white oaks versus rest are very different suggesting asymmetry in mixed NN structure. The (reduced) Q-symmetry contingency tables for each species in the one-versus-rest sense are presented in Table 21 which also contains the relative frequencies with respect to row sums in parentheses. The relative frequencies for black oaks versus rest are similar to the overall frequencies indicating symmetry in the shared NN structure, while they are different for maples versus rest and white oaks versus rest, indicating asymmetry in the shared NN structure. We present the test statistics and the associated p values for the one-versus-rest symmetry analysis in Table 22. There is symmetry in NN structures for black oaks versus rest, while significant asymmetry in NN structures for maples versus rest and white oaks versus rest.

Table 22

The test statistics and the p-values for the unrestricted pairwise symmetry analysis for the Lansing Woods data. Row labelings and the asterisks are in Table 15.

One-versus-rest test statistics and P-values
	Z _D ^ij	𝒳 _I ²	𝒳 _I ^2,u	𝒳 _II ²	T _II ^F
Black oak versus rest
TS	.118	.000	.006	.049	—
p _asy	.906	1.000	.938	.976	.970*
p _rand	.889	1.000	.866	.968	.968

Maples versus rest
TS	−3.471	5.547	5.802	16.125	—
p _asy	<.001	.019	.016	<.001	<.001*
p _rand	<.001	<.001	<.001	<.001	<.001

White oaks versus rest
TS	3.447	4.840	5.068	13.832	—
p _asy	<.001	.028	.024	<.001	<.001*
p _rand	.001	<.001	<.001	.002	.002

8. Interpoint Dissimilarity Measures

A dissimilarity measure, ρ, on a set of objects E is the ℝ valued function on E × E such that ρ * = ρ(x, x) ≤ ρ(x, y) = ρ(y, x) < ∞ for all x, y ∈ E. A similarity measure, s, on E is the ℝ valued function on E × E such that s * = s(x, x) ≥ s(x, y) = s(y, x) ≥ 0 for all x, y ∈ E. Generally, ρ * = ρ* and s * = s* for all x ∈ E. In particular, if s* = 0, then ρ* = 1. We focus on dissimilarity measures only, since any similarity measure can easily be converted to a dissimilarity measure [27]. Any distance metric is by definition a dissimilarity measure. In practice, the term distance is often used to describe precisely the differences of actual measurements, while “dissimilarity” might be an estimation of a distance we can not measure physically. Among the widely used distances are Euclidean, Minkowski, Mahalanobis, and taxi-cab distances; among the nonmetric dissimilarity measures are maximum coordinate difference, minimum coordinate difference, dot product, Pearson's linear dissimilarity, and Spearman's rank dissimilarity. In the literature usually NN relationships defined with distance metrics are used. In particular, Euclidean distance in ℝ2 is the only metric used in this paper. The use of distances for obtaining the NN relations can be generalized to dissimilarity measures in such a way that the NN of an object, x, refers to the object with the minimum dissimilarity to x. We assume that the objects (events) lie in a finite or infinite dimensional space satisfying the symmetry conditions. Under RL, the objects are fixed in the sense that they yield fixed interpoint dissimilarity measures, but the labels are assigned randomly. The spatial patterns have broader interpretations in this extension. Symmetry occurs when the classes have similar NN structures with respect to each other. The extension of Pielou's first type of symmetry test is straightforward. However, Pielou's second type of symmetry test and Dixon's tests are constructed assuming that data are in ℝ2 in the literature. In Dixon's tests, the term Q which is the number of points with shared NNs needs to be updated for higher dimensional data. The general form of Q is defined as . In practice, usually . One may check the appropriateness of this assumption by using the interpoint dissimilarity matrix in the classical multidimensional scaling of the data to ℝ2. If the NN relations do not change considerably, it might be more practical to just use Q instead of for computational reasons. Furthermore, with non-Euclidean distances or dissimilarity measures, a point can serve as a NN to more than 6 points, so the Q-symmetry contingency table should be updated accordingly. Here is a possible example for which we have dissimilarity measures between objects that lie in a high or infinite dimensional space. In medical image analysis the differences in morphometry (shape and size) of tissues are measured by a distance metric called LDDMM (see, e.g., [28]). Based on the distances measured between certain brain tissues (like hippocampus), one is interested, say, in the symmetry of the shapes of the tissues with respect to NN relationships. This aspect of spatial dependence in the (abstract) morphometric space is a topic of prospective research.

9. Discussion and Conclusions

In this paper, we investigate tests of symmetry in mixed and shared nearest neighbor (NN) structures using contingency tables based on the NN relations between classes. We consider Pielou's two types of symmetry tests and Dixon's symmetry test and determine their appropriate null hypotheses and the underlying assumptions. Pielou's first type of symmetry test and Dixon's symmetry tests are for symmetry in mixed NN structure and are based on the nearest neighbor contingency table (NNCT), while Pielou's second type of symmetry test is for symmetry in shared NN structure and is based on the Q-symmetry contingency table. We derive the asymptotic distribution of Dixon's symmetry test under RL, which is also valid under CSR independence conditional on spatial allocation of the points in the study region. We extend Pielou's and Dixon's symmetry tests to multiclass case and prove the consistency of these tests under their appropriate null hypotheses. In particular, we prove consistency for Pielou's first type of symmetry test under the appropriate sparse sampling in the overall multinomial framework, for Pielou's second type of symmetry test under the appropriate sparse sampling in the row-wise multinomial framework and for Dixon's symmetry test under RL patterns with completely mapped data. Among the symmetry tests, we demonstrate that versions of Pielou's first type of symmetry test are extremely conservative when used with the asymptotic critical value for the McNemar's test, due to dependence between base-NN pairs and the underlying framework for the NNCT. Hence, these tests should be avoided in practice with the asymptotic critical values but can be used with Monte Carlo randomization. On the other hand, Pielou's second type of symmetry test and Dixon's symmetry test are about the desired level under complete spatial randomness (CSR) independence and random labeling (RL). We also consider the use of Fisher's exact test for the Q-symmetry contingency table. In particular, we demonstrate that the table exclusive version of the two-sided exact test has the desired level under CSR independence and RL. It is also desirable for a test not only to be consistent but also powerful; hence, determining appropriate alternatives for these tests is an important task. We consider various patterns for assessing the finite sample performance of symmetry tests and discover other patterns under which the null hypotheses for these types of symmetry tests are satisfied. However, the variances and covariances (and hence the asymptotic distributions) should be adjusted to have the desired level for these patterns, because the asymptotic distribution of Dixon's symmetry test is only derived under CSR independence and RL. With the critical values based on the asymptotic distribution under CSR independence or RL, the tests are either extremely conservative or liberal (although the null hypotheses are satisfied). We also find that some of the patterns can serve as alternatives for symmetry in shared NN structure or for symmetry in mixed NN structure. Under these alternatives, we observe that Pielou's second type of symmetry test has higher power compared to Dixon's test of symmetry. Furthermore, Pielou's second type of symmetry test is only empirically shown to be appropriate under CSR independence and RL by Monte Carlo simulations. Finding the distribution of Dixon's symmetry test under the null hypothesis of symmetry in mixed NN structure in general (as CSR independence and RL are only two special cases in this setting) and finding the distribution of Pielou's second type of symmetry under the null hypothesis of symmetry in shared NN structure (even under CSR independence or RL) are still open problems. In a multiclass setting, first an overall symmetry test can be conducted as an omnibus test. If significant, then either one-versus-rest or pairwise type post hoc tests can be applied. If the interest is in the symmetry of one class with respect to the remaining classes, then a one-versus-rest type analysis should be performed. On the other hand, if the interest is in determining which pair(s) significantly deviate from symmetry, then pairwise symmetry tests can be employed. When we are doing the pairwise tests after an overall symmetry test, we recommend the unrestricted pairwise version, which takes all the data into account (indeed the significant overall test was based on all the data considered). But if the interest is only on two of the classes, then a restricted pairwise test (considering only the classes in question) can be employed. For symmetry in shared NN structure (with Q-symmetry contingency table), we recommend the use of one-versus-rest type post hoc tests, as they are more consistent with the overall symmetry test. Throughout the paper, we assumed that the total sample size n is a fixed quantity. To make it a random variable, one may consider that data are from a Poisson point process over the (bounded) region of interest. The generalizations of the tests to high dimensional data and NNCTs based on general dissimilarity measures make this methodology useful for other fields as well. Finally, the tests in this paper are not adjusted for the influence of the edges or boundary of the support, which usually causes the tests to be slightly liberal or conservative. Such an adjustment is only necessary when the spatial allocation of the points is not fixed but results from a stochastic process whose support contains the study region (as in the CSR independence case). To make the size of the test appropriate, the tests need to be adjusted for boundary effects, which is also a topic of prospective research.

(a)

	The number of times a point serving as a NN						Total
	0	1	2	3	4	5	Total
Classes
Class 1	Q _1,0	Q _1,1	Q _1,2	Q _1,3	Q _1,4	Q _1,5	n ₁
Class 2	Q _2,0	Q _2,1	Q _2,2	Q _2,3	Q _2,4	Q _2,5	n ₂

Total	Q ₀	Q ₁	Q ₂	Q ₃	Q ₄	Q ₅	n

(b)

	The number of times a point serving as a NN						Total
	0	1	2	3	4	5	Total
Classes
Class 1	Q _1,0	Q _1,1	Q _1,2	Q _1,3	Q _1,4	Q _1,5	n ₁
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮
Class k	Q _k,0	Q _k,1	Q _k,2	Q _k,3	Q _k,4	Q _k,5	n _k

Total	Q ₀	Q ₁	Q ₂	Q ₃	Q ₄	Q ₅	n

(a)

	Mean ± SD			Rejection rate estimates for Case I patterns
	N ₁₂	N ₂₁	N ₁₂ − N ₂₁	β^IP	β^IP′	β^binP	β^SD	β^IIP	β^IIF
I-(i)	8.5 ± 2.5	6.8 ± 2.7	1.66 ± 2.26	.0018	.0060	.0721	.0092	.0379	.0359
I-(ii)	4.1 ± 1.7	2.2 ± 1.7	1.94 ± 1.58	.0118	.0511	.3109	.0017	.0345	.0324
I-(iii)	2.9 ± 1.4	1.0 ± 1.2	1.87 ± 1.34	.0338	.0919	.5345	.0002	.0324	.0296

(b)

	Rejection rate estimates for Case I patterns based on Monte Carlo randomization
	β^IP	β^IP′	β^SD	β^IIP	β^IIF
I-(i)	.641219	.480581	.409682	.546873	.546491
I-(ii)	.670268	.402391	.271784	.569909	.569605
I-(iii)	.696817	.397304	.196840	.577398	.576899

(a)

	NN species		Total
	B.O.	Maple	Total
Base species
B.O.	82	53	135
Maple	49	465	514

Total	131	518	649

(b)

	NN species		Total
	B.O.	W.O.	Total
B.O.	78	67	135
W.O.	72	376	448

Total	140	443	583

(c)

	NN species		Total
	Maple	W.O.	Total
Maple	379	135	514
W.O.	172	276	448

Total	551	411	962

(a)

	Number of times a point serving as a NN			Total
	0	1	≥2	Total
Classes
B.O.	38 (.28)	65 (.48)	32 (.24)	135
M.	142 (.28)	242 (.47)	130 (.25)	514

Total	180 (.28)	307 (.47)	162 (.25)	649

(b)

	Number of times a point serving as a NN			Total
	0	1	≥2	Total
B.O.	36 (.27)	64 (.47)	35 (.26)	135
W.O.	135 (.30)	203 (.45)	110 (.25)	448

Total	171 (.29)	267 (.46)	145 (.25)	583

(c)

	Number of times a point serving as a NN			Total
	0	1	≥2	Total
M.	117 (.23)	258 (.50)	139 (.27)	514
W.O.	136 (.30)	224 (.50)	88 (.20)	448

Total	253 (.26)	482 (.50)	227 (.24)	962

(a)

	NN species		Total
	B.O.	Rest	Total
Base species
B.O.	53	82	135
Rest	78	884	964

Total	131	966	1097

(b)

	NN species		Total
	Maple	Rest	Total
Maple	352	150	514
Rest	196	387	583

Total	560	537	1097

(c)

	NN species		Total
	W.O.	Rest	Total
W.O.	236	212	448
Rest	167	482	649

Total	403	694	1097

(a)

	Number of times a point serving as a NN			Total
	0	1	≥2	Total
Classes
B.O.	37 (.27)	67 (.50)	31 (.23)	135
R.	256 (.27)	479 (.50)	227 (.24)	962

Total	293 (.27)	546 (.50)	258 (.24)	1097

(b)

	Number of times a point serving as a NN			Total
	0	1	≥2	Total
M.	112 (.22)	259 (.50)	148 (.29)	514
R.	181 (.31)	286 (.49)	116 (.20)	583

Total	293 (.27)	545 (.50)	259 (.24)	1097

(c)

	Number of times a point serving as a NN			Total
	0	1	≥2	Total
W.O.	143 (.32)	219 (.49)	86 (.19)	448
R.	150 (.23)	327 (.50)	172 (.27)	649

Total	293 (.27)	546 (.50)	258 (.24)	1097

2 in total

1. A test for symmetry in contingency tables.

Authors: A H BOWKER
Journal: J Am Stat Assoc Date: 1948-12 Impact factor: 5.033

2. Extension of the Neyman-Pearson theory of tests to discontinuous variates.

Authors: K D TOCHER
Journal: Biometrika Date: 1950-06 Impact factor: 2.445

2 in total