Literature DB >> 32160247

A permutation test and spatial cross-validation approach to assess models of interspecific competition between trees.

Abstract

Measuring species-specific competitive interactions is key to understanding plant communities. Repeat censused large forest dynamics plots offer an ideal setting to measure these interactions by estimating the species-specific competitive effect on neighboring tree growth. Estimating these interaction values can be difficult, however, because the number of them grows with the square of the number of species. Furthermore, confidence in the estimates can be overestimated if any spatial structure of model errors is not considered. Here we measured these interactions in a forest dynamics plot in a transitional oak-hickory forest. We analytically fit Bayesian linear regression models of annual tree radial growth as a function of that tree's species, its size, and its neighboring trees. We then compared these models to test whether the identity of a tree's neighbors matters and if so at what level: based on trait grouping, based on phylogenetic family, or based on species. We used a spatial cross-validation scheme to better estimate model errors while avoiding potentially over-fitting our models. Since our model is analytically solvable we can rapidly evaluate it, which allows our proposed cross-validation scheme to be computationally feasible. We found that the identity of the focal and competitor trees mattered for competitive interactions, but surprisingly, identity mattered at the family rather than species-level.

Entities: Chemical Disease Species

Year: 2020 PMID： 32160247 PMCID： PMC7065802 DOI： 10.1371/journal.pone.0229930

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Competition is a key biotic interaction which structures communities. To better understand the role it plays, we need ways to understand species-specific competitive interactions. Attempts have been made to do this through direct measurement of species-specific competition coefficients, or to generalize competitive interactions based on trait or phylogeny differences between competitors. Repeat censused forest inventory plots are good places to measure species-specific competition. In such forests trees are identified, mapped, and have their diameter measured at regular time intervals [1, 2]. Diameter growth between censuses of individual trees is modeled as a function of neighborhood tree species identity, size, and distance to focal tree [3, 4]. This gives a way to estimate the species-specific effect of competition on tree growth. The first question is whether species identity of the competitor matters at all [5], or is competition neutral (sensu [6]). Assuming that the identity of competitors does matter, it can then be hard to measure all of the interaction coefficients, which increase proportionally with the square of species number in the community. Furthermore, some species pairs might not coexist nearby one another enough to get sufficient data to measure their effects on one another. A number of approaches have been taken to deal with large number of parameters to estimate and with missing species pairs. In low-diversity forests, attempts have been made to measure species-specific competition coefficients of the effect of competition on radial tree growth [7-10]. In a western US, evergreen forest, Das [8] was able to estimate the 16 competition coefficients among the four dominant tree species. Canham et al. [7] measured the competition coefficients between the 14 most common tree species in Northern New England. While they were able to estimate most coefficients, they could not for species-pairs that rarely occurred together. A hierarchical Bayesian approach can be taken to address this problem [11]. All species-pairs are considered, no data are thrown out, and competition coefficients are “regressed back” to an overall average. The amount of regression decreases with the number of interacting pairs of individuals of that particular species pair. This approach was taken by Tatsumi et al. [9] when looking at 38 tree species in a cool-temperate mixed conifer-broadleaf forest. They found support for the hierarchical model over non-hierarchical version, but because of relatively small sample size, competition coefficients of only a few species pairs had 95% credible intervals that did not overlap with the hyperparameter value. A final approach is to estimate competition coefficients based on species attributes, rather than measure each one. A common approach is to correlate competition coefficients with either trait or phylogenetic distance between the species [12]. Uriarte et al. [12] found that trait distance did a better job of explaining competitive interactions than phylogenetic distance. Kunstler et al. [13] also found that traits can explain competitive interactions. For any of these approaches, a method for comparing model performance is needed. Previous studies have used information criteria to compare models [7–9, 12]. Such information criteria are asymptotic approximations for out-of-sample-prediction error [14]. Another commonly used measure of out-of-sample-prediction error is the more easily interpretable (root) mean squared error (RMSE) [15]. Here we present a method to answer two questions from repeat censused forest inventory plots: (1) “Does the species identity of neighboring competitor trees matter?” and (2) “If so, how can you measure species-specific competition coefficients?” For both questions, we analytically derived posterior estimates of coefficients of a linear Bayesian neighborhood competition model. The chief advantage of this analytic approach is that the aforementioned coefficients are much less computationally expensive to estimate than other approaches, such as the Markov chain Monte Carlo based estimates necessitated by other more complicated models [7, 9]. To answer question (1) from the introduction, we use a permutation test based approach. We first fit the model and obtain an observed test statistic assessing the quality of the model fit, in our case the observed RMSE. We then permute the competitor species identities and compute a permuted test statistic a large number of times, thereby empirically generating a “null” distribution of the test statistic. We then compare the observed test statistic to this null distribution to obtain significance measures. The rapid analytic solution is required for this permutation test to run in a reasonable amount of time. We find that identity of the competitor species does matter. For question (2) from the introduction, we can further ask whether species identity, family identity, or trait values best explains competitive interactions. Here we provide a method to compare spatially cross-validated RMSE’s of competing models. The rapid analytic solution is once again required to fit the model repeatedly in the cross-validation scheme. Surprisingly we find that grouping species by family best explains competitive interactions.

Materials and methods

Study site and sampling

The Big Woods plot is a 23 ha forest dynamics plot in Pickney, MI (42.462902 N, 84.006093 W). The plot is within a transitional oak-hickory forest. The canopy is dominated by black oak (Quercus velutina), northern red oak (Quercus rubra), white oak (Quercus alba), bitternut hickory (Carya cordiformis), and shagbark hickory (Carya ovata). However there are relatively few oaks in the mid and understory, instead these strata are dominated by red maple (Acer rubrum) and black cherry (Prunus serotina). A full list of species found in the plot can be found in S2 Appendix. The elevation in the plot ranges from 270 m to 305 m. Above 275 m the mineral soils are largely Boyer–Oshtemo sandy loam, and below 275 m the soils are mainly histosols dominated by Carlisle and Rifle muck. The rugged topography within the plot is the result of glacial scouring with hills and knobs separated by kettle holes and basins. For more information on the plot see Allen et al. [16]. The original plot was established in 2003 and was only 12 ha. All free-standing woody stems in the plot larger than 3.2 cm diameter at breast height (DBH) were censused, had their DBH measured, mapped, and identified. Between 2007 and 2010 this plot was expanded to 23 ha using the same censusing technique and the original 12 ha were re-censused. In 2014 the entire 23 ha plot was re-censused to determine the diameter growth of each individual, tag individuals recruited into the greater than 3.2 cm DBH size class, and identify which individuals died. For this analysis we considered the average annual diameter growth between the 2007–2010 and 2014 censuses for all stems alive in both censuses ignoring re-sprouts. We defined stems alive in the 2007–2010 census as the competitor stems. DA collected these data with the help of others listed in the Acknowledgements section. The Big Woods plot is located within the Edwin S. George Reserve. Permission to sample in the Reserve was granted by the University of Michigan Department of Ecology and Evolutionary Biology. The plot is part of the Smithsonian Institution’s ForestGEO global network of forest research plots (Smithsonian Institution, Washington DC, USA) [1]. Data from the censuses are available at Allen et al. [17].

Species grouping methods

We tested which grouping of trees best explained competitive interactions. We grouped species in three ways: (1) by species, (2) by family, and (3) according to traits assumed to correlate with competitive interactions. To form this third grouping, we collected species trait values from the TRY database of plant traits [18]. From this database we picked plant height, wood density, and specific leaf area as our traits of interest because they have been identified as important to tree competition [13]. We clustered these species based on their values for these three traits using the cluster package in R [19, 20]. The distance matrix was formed by the euclidean distance between species’ three trait values. We then formed a hierarchical clustering of this distance matrix using the agnes command. We chose to cut the resulting hierarchical tree into six groups.

Neighborhood-effect growth model

Let i = 1, …, n index all n trees of “focal” species group j; let j = 1, …, J index all J focal species groups; and let k = 1, …, K index all K “competitor” species groups. We modeled the growth in diameter per year y (in centimeters per year) of the i tree of focal species group j as a linear model f of the following covariates where β0, is the diameter-independent growth rate for group j; DBH is the diameter at breast height (in centimeters) of the focal tree at the earlier census; βDBH, is the amount of the growth rate changed depending on diameter for group j; BA is the sum of the basal area of all trees of competitor species group k within a neighborhood of 7.5 meters of the focal tree; λ is the change in growth for individuals of group j from nearby competitors of group k; and ϵ is a random error term distributed Normal(0, σ2). We chose a distance of 7.5 meters as the competitor neighborhood of a focal tree. Other studies have estimated this distance, we used 7.5 meters as an average of estimated values [3, 5, 7, 11]. For focal trees we considered only those alive in both the 2007–2010 and 2014 censuses. Thus this model only considered the effect of competition on growth, not on mortality. Growth between the two censuses was almost always positive, with the few negative values probably reflecting measurement error. We considered models where the focal species grouping j = 1, …, J and the competitor species grouping k = 1, …, K reflected the three notions of species grouping introduced earlier: trait group with 6 groups, phylogenetic family with 20 groups, and actual species with 36 groups. While our model specification is flexible enough where our notion of focal tree grouping does not necessarily have to match our notion of competitor tree grouping, for simplicity in this paper we only considered models where both notions match and hence J = K. Furthermore, our models incorporated a specific notion of competition reflecting a particular assumption on the nature of competition between trees: species grouping-specific effects of competition. For a given focal species group j, all competitor species groups exert different competitive effects. Such models not only assumed competition between trees exists, but also that different focal versus competitor species group pairs have different competitive relationships. More specifically, using the above three notions of species grouping, we compared three different models for growth y, each with different numbers of (0, DBH, λ, σ2) parameters estimated. Trait group with J = K = 6, thus (0, DBH, σ2) consisted of 6 + 6 + 1 = 13 parameters. Furthermore, there were 6 × 6 unique values of λ, thus λ is a 6 × 6 matrix, totaling 13 + 6 × 6 = 49 parameters. Phylogenetic family with J = K = 20, thus (0, DBH, σ2) consisted of 20 + 20 + 1 = 41 parameters. Furthermore, there were 20 × 20 unique values of λ, thus λ is a 20 × 20 matrix, totaling 41 + 20 × 20 = 441 parameters. Actual species with J = K = 36, thus (0, DBH, σ2) consisted of 36 + 36 + 1 = 73 parameters. Furthermore, there were 36 × 36 unique values of λ, thus λ is a 36 × 36 matrix, totaling 73 + 36 × 36 = 1369 parameters. For each of these three models, all (0, DBH, λ, σ2) parameters were estimated via Bayesian linear regression. We favored a Bayesian approach since it allowed us to incorporate prior information about all the parameters in Eq (1) [21]. This in turn served us when particular species grouping pairs were rare, leading to posterior distributions that are more weighted towards the prior distribution than the likelihood [21]. While the linear regression model in Eq (1) is much less complex than the model formulations considered by [7], both the posterior distribution and posterior predictive distribution of all parameters have analytic and closed-form solutions. Thus, we are saved from the computational expense of using methods to approximate all posterior distributions such as Markov chain Monte Carlo [22, 23]. These savings in computational expense were important given the large number of times we fit the model in Eq (1), as we outline in the upcoming sections on the permutation test and spatial cross-validation scheme we used. We present a brief summary of the closed form solutions to Bayesian linear regression here, leaving fuller detail in S1 Appendix. For simplicity of notation, let represent the parameters 0, DBH, λ. The likelihood function p(|, σ2) of our observed growths resulting from Eq (1) is Multivariate Normal (X , σ2 I) where I is the n × n identity matrix. Bayesian linear regression exploits the fact that the Multivariate Normal-inverse-Gamma distribution is a conjugate prior of the Multivariate Normal distribution. So given our Multivariate Normal likelihood p(|, σ2), by assuming that the joint prior distribution π(, σ2) of , is NIG(0, V0, a0, b0) with the following hyperparameters: 1) a mean vector 0 for , 2) a shape matrix V0 for , 3) shape a0 > 0 for σ2, and scale b0 > 0 for σ2, the joint posterior distribution π(, σ2|) is also NIG(*, V*, a*, b*) with: It can be shown that the marginal prior distribution π(σ) is Inverse-Gamma(a0, b0) while π() is Multivariate-t with location vector 0, shape matrix , and degrees of freedom ν0 = 2a0. Given the aforementioned prior conjugacy, the marginal posterior distributions π(σ|) is also Inverse-Gamma while π(|) is also Multivariate-t, both with updated *, V*, a*, b* hyperparameter values in place of 0, V0, a0, b0. Furthermore, it can also be shown that the posterior predictive distribution for a model matrix corresponding to a new set of observed covariates is Multivariate-t with location vector , shape matrix , and degrees of freedom ν* = 2a*. The means of the posterior predictive distributions will be used to obtain fitted/predicted values .

Permutation test

Recall from the previous section our model assumes species grouping-specific effects of competition, whereby for a given focal species group j, all competitor species groups exert different competitive effects. We evaluated the validity of this hypothesis with the following hypothesis test: where the null hypothesis H0 reflects a hypothesis of no species grouping-specific effects of competition while the alternative hypothesis H reflects a hypothesis of species grouping-specific effects of competition of our three models. We could therefore answer question (1) from the introduction of whether the species identity of neighboring competitor trees matters using a permutation test (also called an “exact test”). We generated the null distribution of a test statistic of interest by randomly permuting the competitor species group labels k for all trees of focal species group j for a large number of iterations. Such permutations of the competitor species group labels (while holding all other variables constant) were permissible under the assumed null hypothesis above. After computing the test statistic for each iteration, we then compared this null distribution to the observed value of the test statistic to obtain measures of statistical significance. Given the large number of permutations and the corresponding large number of model fits this required, having the computationally inexpensive parameter estimates discussed above was all the more important. Our test statistic was a commonly-used and relatively simple measure of a model’s predictive accuracy: the root mean-squared error (RMSE) between all observed values and all fitted/predicted values . Specifically, we compared all observed growths y with their corresponding fitted/predicted growths obtained from the posterior predictive distributions:

Spatial cross-validation

Among the most common methods for estimating out-of-sample predictive error is cross-validation, whereby independent “training” and “test” sets are created by resampling from the original sample of data. The model is first fit to the training set and then the model’s predictive performance is evaluated on the test data [15]. However given the spatial structure of forest census data, there most likely exists spatial-auto-correlation between the individual trees and thus using individual trees as the resampling unit would violate the independence assumption inherent to cross-validation. One must instead resample spatial “blocks” of trees when creating training and test data, thereby preserving within block spatial-auto-correlations. Roberts et al. [24] demonstrated that ignoring such spatial structure can lead to model error estimates that are overly optimistic and thus betray the true performance of any model’s predictive ability on new out-of-sample data. To study the magnitude of this optimism, we report two sets or RMSE’s: one where both the model was fit and the RMSE evaluated on the same entire dataset and another where cross-validation was performed. On top of the large number of permutations, given the large number of iterations of model fits cross-validation required, the importance of computationally inexpensive parameter estimates discussed earlier was again critical. In Fig 1 we display the spatial distribution of the 27,192 trees in the Big Woods study region and illustrate one iteration of the spatial cross-validation algorithm. We superimposed a 100 × 100 meter grid onto the study region and then assigned each of the 27,192 trees to one of 23 arbitrarily numbered spatial blocks. In this particular iteration of the algorithm, we first fit our model to all trees in the 14 unnumbered “training” blocks. We then applied the fitted model to all trees in the “test” block labeled 10 to obtain predicted growths , which we then compared to observed growths to obtain the estimated RMSE for this block. The remaining 8 labeled blocks 1, 2, 3, 9, 11, 13, 14, and 15 acted as “buffer” blocks isolating the test block from the training blocks, thereby ensuring that they are spatially independent. We iterated through this process with all 23 blocks acting as the test block once and then averaged the resulting 23 estimated values of the RMSE to obtain a single estimated RMSE of the fitted model ’s predictive error.

Fig 1

Big Woods study region with one iteration of spatial cross-validation algorithm displayed.

Big Woods study region with one iteration of spatial cross-validation algorithm displayed.

For this particular iteration of the cross-validation algorithm, we train our models on all 14 unnumbered blocks of trees. We then apply the fitted model to make predictions on block 10. Blocks 1, 2, 3, 9, 11, 13, 14, and 15 act as “buffer” blocks isolating the test block from the training blocks. While other implementations of cross-validation exist, in particular implementations that incorporate more principled approaches to determining grid sizes [25], we favored the above blocked approach for its ease of implementation and understanding. We wrote and fit the model in R and used the tidyverse and ggrdiges packages [20, 26, 27].

Results

Permutation test and cross-validation results

We used a permutation test to evaluate whether the identity of the competitor matters for competitive interactions, and used spatial cross-validation to evaluate which of the three identity groupings (trait, family, or species) yielded the best model. For all three groupings the identity of the competitor did matter; the RMSE with actual competitor identity was less than the RSME when the competitor identity was permuted (Fig 2 compare the dotted horizontal lines to the histograms). To address question (2) from the introduction, which species-grouping does the best job of describing competition, we compare subpanels in Fig 2. The trait-grouping model performed much worse than the two phylogenetic groups models. Without spatial cross-validation the species-level grouping greatly out performs the other two models, but with cross-validation the species and family models perform just as well, suggestive of over-fitting of the species-level model. We will use the family-level model for the remainder of the paper, as it performs just as well as the species-level model but with many fewer parameters.

Fig 2

Comparison of the RMSE for all three species-groupings models.

We calculated the RMSE with and without spatial cross-validation. The dotted lines indicate the RMSE value when the competitor species’ identities were not permuted. The histograms indicate the distribution of the RMSE values resulting from 99 permutations of competitor species’ identities.

Comparison of the RMSE for all three species-groupings models.

Posterior distributions of parameters of best model

We found that the family-level model performed nearly as well as the species-level one (compare the dashed yellow lines in panels 2 and 3 of Fig 2). It did so with many fewer parameters, so here we will report the posterior distributions of relevant (0, DBH, λ) parameters for the family-level model. In Fig 3, we plot the posterior distribution of all β0, baselines for all families, j. These values range from 0.05 to 0.4 cm y-1. We generally have better estimates of parameters for families with larger sample sizes. In Fig 4, we plot the posterior distribution of all βDBH,; this is the effect of tree DBH on growth rate. For most families these values are between 0 and 0.05. These positive values indicate that larger trees grow faster. For the few families with estimates of negative βDBH,, smaller trees grow faster.

Fig 3

Posterior distribution of β0, for the family-level model.

These distributions display the estimated baseline (diameter-independent) growth (cm y-1) for each family.

Fig 4

Posterior distribution of βDBH, for the family-level model.

Posterior distribution of β0, for the family-level model.

These distributions display the estimated baseline (diameter-independent) growth (cm y-1) for each family.

Posterior distribution of βDBH, for the family-level model.

These distributions display the estimated change in annual growth (cm y-1) per cm of DBH for each family. Positive values indicate that larger individuals grow faster, while negative values indicate that larger individuals growth slower. In Fig 5, we plot the posterior distribution of all λ relating to inter-family competition. For clarity this figure shows the λ values for the eight families with at least 200 individuals in the plot (for the full 20-by-20 λ matrix see S1 Fig). Fig 5 shows differences in family-level effect and response to competition. Some families, such as Fagaceae and Juglandaceae, have a strong negative effect on the growth of all other families. In other words, for focal trees of nearly all families having many Fagaceae and Juglandaceae (oaks and hickories) neighbors is associated with slower growth. Other families, such as Rosaceace, have a positive effect on the growth of most other families. In other words, for focal trees of nearly all families having many Rosaceace neighbors is associated with faster growth. Generally there is a strong negative intra-family effect even for families which have little negative or even a positive effect on other families, for example Ulmaceae and Sapindaceae. In other words, most individuals tend to grow slower when they have more neighbors of the same family.

Fig 5

Posterior distribution of λ family-specific competition coefficients.

Read across rows for that family’s competitive effect on other families and down columns for that family’s response to competition from other families. Positive values of λ indicate that trees of the focal group tend to grow faster if they have more neighbors of that competitor group, while negative values of λ indicate that trees of the focal group then to grow slower. For example, almost all groups tend to have slower growth in the presence of more Fagaceae neighbors, but tend to have faster growth in the presence of more Ulmaceae neighbors. Here we display just the 8 families for which there are at least 200 individuals in the plot. For a high resolution version of this image for the full 20-by-20 lambda matrix see S1 Fig. A full list of all species and families in the plot can be found in S2 Appendix. See S2 Fig for a phylogeny of these families and S3 Fig for counts of focal and competitor family pairs.

Posterior distribution of λ family-specific competition coefficients.

Spatial patterns in residuals

We calculated the residuals for all individuals based on the family-level model and plotted them spatially across the plot (Fig 6). There is a clear spatial pattern to these residuals, with spatial patches of trees growing faster than predicted by the model, for example around (0, 150). This is a relatively wet portion of the plot, so soil moisture may be an important factor not considered in the model. In other areas the trees are growing slower than predicted by the model, for example around (450, 50). This suggests that some spatially correlated factor beyond tree species, diameter, or competitors is important to determining tree growth.

Fig 6

Spatial pattern of residuals for the family-level model.

This shows the actual growth y of each tree minus its predicted growth . Blue individuals grew faster than expected by the model and red individuals grew slower. The residuals show clear spatial patterning.

Spatial pattern of residuals for the family-level model.

Discussion

Here we tested whether species identity matters for competitive interactions among forest trees. We found a strong signal that identity matters for both the effect focal trees feel from competition and the effect competitors exerts on a focal tree. Models which included competitor identity outperformed those with randomized identity. This importance of competitor identity has been observed before [3, 8, 13]. Surprisingly, we found that the model which predicted the effect of competition best was at the family-level rather than species or trait grouping-level. Previous work has found that species traits perform well at predicting competitive ability [12, 13]. We suspect that this family-level grouping for competition is not a general phenomenon, but a consequence of the particular community in this forest. The forest is undergoing rapid successional change as the oak-hickory overstory is being replaced by more mesic, shade-tolerant species [16, 28]. Oaks and hickories here fill a specific functional role, and we suspect the family-level grouping fit best as a result since it groups many species into these two families. Here we used a Bayesian framework to measure the full λ matrix of family-level competitive interactions. This matrix showed clear family-level differences in both the effect and response to competition. Interestingly, some families had a positive effect on the growth of neighbors, such as Ulmaceae. This is probably not a direct positive effect, but a reflection of the spatial pattern of Ulmaceae individuals. These individuals could inhabit more productive soils, thus nearby individuals grow faster. This illustrates a limitation of neighborhood-based methods for measuring competition [4]. The best way to address this and truly measure competitive interactions would be with a manipulative experiment. This would be largely infeasible for competing forest trees. However, one way to potentially address this issue still in an observational, neighborhood competitor framework would be to control for soil or other covariates potentially important to growth in the model. That way those spatially varying covariates could be included and thus the signal from competition could potentially be more clear. One clear pattern seen is that oaks and hickories are particularly strong competitors. Their neighbors had consistently lower growth rates. This is interesting in light of the successional change going on in the forest. Allen et al. [28] show that canopy oaks are rapidly being replaced by red maples and black cherries. Even though we found here that oaks had very strong negative effect on neighbors’ growth, that is not enough for them to maintain canopy dominance [28]. We fit Bayesian linear regression models of the growth of a focal tree as a function of its species, its size, and the basal area of its competitors. While more complicated models of growth exist than the one proposed in Eq (1), this particular model has the benefit of having closed-form analytic solutions and thus we can avoid computationally expensive methods for posterior estimation such as Markov chain Monte Carlo. This is of particular importance given the large number of times we fit models when performing our permutation test as well as our spatial cross-validation algorithm. We highlight the importance of cross-validation. It initially appeared in our non cross-validated error estimates in Fig 2 that the species-level model out performed the family-level model. However, when using cross-validation to generate our estimates of model error, both models roughly performed the same. This was due to the over-fitting induced by the large number of parameters of the more complex species-level model. Furthermore, had we not incorporated the inherent spatial structure of our data to our cross-validation algorithm, our model error estimates would have been overly-optimistic. This is a point that been demonstrated in other ecological settings [24, 29, 30]. Here we provide a flexible method to estimate species-specific competitive effects between forest trees. This method could be used for other ForestGEO plots with two or more censuses or with USFS FIA data. Comparisons across forest plots would be particularly powerful to assess whether species-specific interactions are general or site-specific. In the future we hope to produce an R package that includes functions to perform the method presented here.

Posterior distribution of λ values, family-specific competition coefficients.

Read across rows for that family’s competitive effect on other families and down columns for how a family responses to the competition of other families. (PDF) Click here for additional data file.

Phylogenetic relationship of families.

The phylogenetic relationship of families pulled from the Open Tree of Life [31] using the R package rotl [32]. A) The phylogeny for the most common families, this corresponds to the families shown in Fig 5. B) The phylogeny for all families in the plot, this corresponds to families shown in S1 Fig. (PDF) Click here for additional data file.

Counts of focal and competitor family pairs.

The total width of each horizontal bar represents the total number of neighbors (or competitors) of focal trees of a particular family in the study region. Within each bar, the width of each color represents the total number of competitor trees of a particular family within a neighborhood of 7.5 meters of trees of the focal family. For clarity this figure shows counts for the eight families with at least 200 individuals in the plot. This figure provides sample sizes for Fig 5. (PDF) Click here for additional data file.

Closed-form solutions for Bayesian linear regression.

(PDF) Click here for additional data file.

Species list.

(PDF) Click here for additional data file. 9 Dec 2019 PONE-D-19-24831 A permutation test and spatial cross-validation approach to assess models of interspecies competition between trees PLOS ONE Dear Dr Allen, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. We would appreciate receiving your revised manuscript by Jan 23 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Jian Yang Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1) Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2) Please upload a copy of Figure 7, to which you refer in your text on page 9. If the figure is no longer to be included as part of the submission please remove all reference to it within the text. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: In community ecology, one of the most challenging questions in is to clarify the mechanisms of community assembly. Two theories have been widely implemented to explain species coexistence, namely niche theory and neutral theory. The most common modern metaphor of niche theory describes sub pool of species passing through environmental filters and biotic interactions (mainly competition) and then form local communities. So measuring species-specific competitive interactions is key to understanding how communities are assembled. The interspecific competitive interactions in regulating natural plant communities has been investigated in a multitude of studies that have used different methods for answering the question. Well, most studies related to species competition are manipulated experiments where density and/or the proportion of different species are varied and the biomass or fecundity of the competing species are measured. Such competition experiments are often conducted in artificial environmental conditions with a limited number of individuals in small plots. However, there has been an increasing awareness that the interspecific interactions critically depend on the abiotic and biotic setting, it is now more common to conduct ecological manipulation experiments in natural plant communities, where the density of either the neighbors (removal experiments) or the target species has been manipulated. This manuscript provides a method to measure interspecies competition between trees using repeat censured forest inventory plots’ data. Overall, I think this study is easy to read and is also timely and important within the field. However the manuscript still needs reliable English editing, especially English tense. Additionally, there are some specific suggestions as following: Comment 1: Line 26: “take” or “taken”? Comment 2: Line 38: “find” or “found”? Comment 3: The English tense problems existed in the Materials and methods section. Please confirm to keep the tenses consistent. Comment 4: Please describe the meanings or definitions of β0,j , βdbh,j and λj,k when present the Neighborhood-effect growth model. Comment 5: From the legend of Figure 3, I got that the β0,j represented the estimated baseline growth per year. Furthermore, I also got that the βdbh,j represented the estimated increase in annual growth (cm) per DBH (cm) from the legend of Figure 4. In my opinion, it would be better to change the unit in Figure 3 to be (cm/year or cm·y-1) . And what about the unit for βdbh,j ? Comment 6: Indeed, the Figure 3 and Figure 4 represented the tree growth with positive values. Whether the negative values represented negative growth due to the trees’ mortality induced by competition, disturbance (e.g. insect, wind) or senescence? Further, how to distinguish these reasons induced tree mortality? Comment 7: The Figure 5 displayed the inter-family and intra-family competitive coefficients (λ). Could the authors provide the detailed species and family information in the supporting information? Comment 8: Whether the authors could presented the phylogenetic relationships among the species occurred in the field with phylogenetic tree. If the phylogenetic tree could be provided, the authors could distinguish the families with different colors. It would be useful for authors to understand the results shown in Figure 5 from the aspect of phylogeny Comment 9: line 259-260 The result said that “There is clear spatial patterning to these residuals, with clusters of trees growing faster than the model predicts and other clusters growing slower”. I failed to understand this result shown in Figure 6. How the result shown that the clusters of trees growing faster than the model predicts and other clusters growing slower? Furthermore, what the other clusters represented for? Comment 10: The figure 7 was missed in the manuscript. Comment 11: Line 268: “found” or “find”? Comment 12: Change “illustrations” into “illustrates”. Comment 13: In the discussion section, the authors stated a limitation of neighborhood-based methods for measuring competition. So how to tackle this limitation? Reviewer #2: I enjoyed reviewing this manuscript. It is well written, concise and deals with an important topic in forest sciences. Please find attached file for especific comments, questions, and suggested edits, which I ask authors to address. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. Submitted filename: PONE-D-19-24831_reviewer.pdf Click here for additional data file. 6 Jan 2020 Dear PLOS One Editor: Thank you for the opportunity to revise and resubmit our manuscript (PONE- D-19-24831). We also thank the reviewers for their thoughtful comments. We have addressed these comments and think that the manuscript is much stronger as a result. • The data used in this manuscript are now available on the University of Michigan data repository Deep Blue Data. They have a DOI which is linked in the article. https://doi.org/10.7302/wx55-kt18 • We have addressed the grammar and wording suggests of the reviewers. This includes making sure that the manuscript is written in the past tense. • There was no Figure 7, this was a mistake in the latex code which re- ferred to a figure that didn’t exist. We have removed this reference to the nonexistent figure. • As suggested by reviewer 1, we have added a species list (Appendix 2) and phylogenetic tree of the families in the plot (Figure S2). • We have explicitly defined the parameters — β0,j , βDBH,j , λj,k — when we introduce them in the methods section. We made interpretation of them clearer by explaining the fit values in figure legends and the results section. We clearly stated the unit of each parameter (e.g, β0,j is in cm y-1). For λj,k we discussed what it means for this parameter to be positive versus negative. • As suggested by both reviewers, we make more clear what we mean by the spatial pattern of residuals in Figure 6. • As suggested by reviewer 1, we have added some discussion on how to overcome the limitation of neighborhood-based approaches for measuring competition. • Reviewer 2 suggested that we remove the last two paragraphs from the introduction. The second to last paragraph summarized our methods and the last paragraph summarized our results. We feel that this summary of methods and results in the introduction helps the reader understand the manuscript as a whole. By giving a summary of the methods and results the reader is better able to understand as they go through the manuscript. We prefer including this end-of-introduction summary, but if the reviewers and editor feels strongly that it should be removed we will. • Reviewer 2 wanted a link to Allen et al. (2019) (line 80 of the revised manuscript). That article is currently in revisions. We think that it will December 26, 2019 be published by the time this current manuscript is published, so can be updated with final citation. If not we will provide some other access to the manuscript draft. We are happy to share a copy with the reviewer before that if requested. • We have provided justification for our use of 7.5 m as the distance for the competitive neighborhood. See lines 115–116 of the revised manuscript. • We have included the sample sizes in Figures 3 and 4. Furthermore, we have included an additional figure of the sample sizes of the focal and competitor families pairs (Figure S3); this will facilitate the interpretation of Figure 5. Thanks again for the opportunity to resubmit this manuscript. If you have any questions, do not hesitate to ask me. Submitted filename: Response to Reviewers.pdf Click here for additional data file. 19 Feb 2020 A permutation test and spatial cross-validation approach to assess models of interspecific competition between trees PONE-D-19-24831R1 Dear Dr. Allen, We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements. Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication. Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. With kind regards, Jian Yang Academic Editor PLOS ONE Additional Editor Comments: L114: Missing "of" between the word "amount" and "the". 24 Feb 2020 PONE-D-19-24831R1 A permutation test and spatial cross-validation approach to assess models of interspecific competition between trees Dear Dr. Allen: I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. For any other questions or concerns, please email plosone@plos.org. Thank you for submitting your work to PLOS ONE. With kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Jian Yang Academic Editor PLOS ONE

7 in total

1. Trait similarity, shared ancestry and the structure of neighbourhood interactions in a subtropical wet forest: implications for community assembly.

Authors: María Uriarte; Nathan G Swenson; Robin L Chazdon; Liza S Comita; W John Kress; David Erickson; Jimena Forero-Montaña; Jess K Zimmerman; Jill Thompson
Journal: Ecol Lett Date: 2010-11-05 Impact factor: 9.492

2. Synthesis of phylogeny and taxonomy into a comprehensive tree of life.

Authors: Cody E Hinchliff; Stephen A Smith; James F Allman; J Gordon Burleigh; Ruchi Chaudhary; Lyndon M Coghill; Keith A Crandall; Jiabin Deng; Bryan T Drew; Romina Gazis; Karl Gude; David S Hibbett; Laura A Katz; H Dail Laughinghouse; Emily Jane McTavish; Peter E Midford; Christopher L Owen; Richard H Ree; Jonathan A Rees; Douglas E Soltis; Tiffani Williams; Karen A Cranston
Journal: Proc Natl Acad Sci U S A Date: 2015-09-18 Impact factor: 11.205

3. Plant functional traits have globally consistent effects on competition.

Authors: Georges Kunstler; Daniel Falster; David A Coomes; Francis Hui; Robert M Kooyman; Daniel C Laughlin; Lourens Poorter; Mark Vanderwel; Ghislain Vieilledent; S Joseph Wright; Masahiro Aiba; Christopher Baraloto; John Caspersen; J Hans C Cornelissen; Sylvie Gourlet-Fleury; Marc Hanewinkel; Bruno Herault; Jens Kattge; Hiroko Kurokawa; Yusuke Onoda; Josep Peñuelas; Hendrik Poorter; Maria Uriarte; Sarah Richardson; Paloma Ruiz-Benito; I-Fang Sun; Göran Ståhl; Nathan G Swenson; Jill Thompson; Bertil Westerlund; Christian Wirth; Miguel A Zavala; Hongcheng Zeng; Jess K Zimmerman; Niklaus E Zimmermann; Mark Westoby
Journal: Nature Date: 2015-12-23 Impact factor: 49.962

4. Scale and strength oak-mesophyte interactions in a transitional oak-hickory forest.

Authors: David Allen; Christopher W Dick; Ethan Strayer; Ivette Perfecto; John Vandermeer
Journal: Can J For Res Date: 2018-08-29 Impact factor: 1.991

5. Analysis of neighborhood dynamics of forest ecosystems using likelihood methods and modeling.

Authors: Charles D Canham; María Uriarte
Journal: Ecol Appl Date: 2006-02 Impact factor: 4.657

6. Neighborhood analyses of canopy tree competition along environmental gradients in New England forests.

Authors: Charles D Canham; Michael J Papaik; María Uriarte; William H McWilliams; Jennifer C Jenkins; Mark J Twery
Journal: Ecol Appl Date: 2006-04 Impact factor: 4.657

Review 7. CTFS-ForestGEO: a worldwide network monitoring forests in an era of global change.

Authors: Kristina J Anderson-Teixeira; Stuart J Davies; Amy C Bennett; Erika B Gonzalez-Akre; Helene C Muller-Landau; S Joseph Wright; Kamariah Abu Salim; Angélica M Almeyda Zambrano; Alfonso Alonso; Jennifer L Baltzer; Yves Basset; Norman A Bourg; Eben N Broadbent; Warren Y Brockelman; Sarayudh Bunyavejchewin; David F R P Burslem; Nathalie Butt; Min Cao; Dairon Cardenas; George B Chuyong; Keith Clay; Susan Cordell; Handanakere S Dattaraja; Xiaobao Deng; Matteo Detto; Xiaojun Du; Alvaro Duque; David L Erikson; Corneille E N Ewango; Gunter A Fischer; Christine Fletcher; Robin B Foster; Christian P Giardina; Gregory S Gilbert; Nimal Gunatilleke; Savitri Gunatilleke; Zhanqing Hao; William W Hargrove; Terese B Hart; Billy C H Hau; Fangliang He; Forrest M Hoffman; Robert W Howe; Stephen P Hubbell; Faith M Inman-Narahari; Patrick A Jansen; Mingxi Jiang; Daniel J Johnson; Mamoru Kanzaki; Abdul Rahman Kassim; David Kenfack; Staline Kibet; Margaret F Kinnaird; Lisa Korte; Kamil Kral; Jitendra Kumar; Andrew J Larson; Yide Li; Xiankun Li; Shirong Liu; Shawn K Y Lum; James A Lutz; Keping Ma; Damian M Maddalena; Jean-Remy Makana; Yadvinder Malhi; Toby Marthews; Rafizah Mat Serudin; Sean M McMahon; William J McShea; Hervé R Memiaghe; Xiangcheng Mi; Takashi Mizuno; Michael Morecroft; Jonathan A Myers; Vojtech Novotny; Alexandre A de Oliveira; Perry S Ong; David A Orwig; Rebecca Ostertag; Jan den Ouden; Geoffrey G Parker; Richard P Phillips; Lawren Sack; Moses N Sainge; Weiguo Sang; Kriangsak Sri-Ngernyuang; Raman Sukumar; I-Fang Sun; Witchaphart Sungpalee; Hebbalalu Sathyanarayana Suresh; Sylvester Tan; Sean C Thomas; Duncan W Thomas; Jill Thompson; Benjamin L Turner; Maria Uriarte; Renato Valencia; Marta I Vallejo; Alberto Vicentini; Tomáš Vrška; Xihua Wang; Xugao Wang; George Weiblen; Amy Wolf; Han Xu; Sandra Yap; Jess Zimmerman
Journal: Glob Chang Biol Date: 2014-09-25 Impact factor: 10.863

7 in total