| Literature DB >> 22132077 |
James A Fordyce1, Zachariah Gompert, Matthew L Forister, Chris C Nice.
Abstract
Many ecological studies use the analysis of count data to arrive at biologically meaningful inferences. Here, we introduce a hierarchical bayesian approach to count data. This approach has the advantage over traditional approaches in that it directly estimates the parameters of interest at both the individual-level and population-level, appropriately models uncertainty, and allows for comparisons among models, including those that exceed the complexity of many traditional approaches, such as ANOVA or non-parametric analogs. As an example, we apply this method to oviposition preference data for butterflies in the genus Lycaeides. Using this method, we estimate the parameters that describe preference for each population, compare the preference hierarchies among populations, and explore various models that group populations that share the same preference hierarchy.Entities:
Mesh:
Year: 2011 PMID: 22132077 PMCID: PMC3221656 DOI: 10.1371/journal.pone.0026785
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic of hierarchical Bayesian model for count data.
Individual count data inform the parameters for each individual's multinomial parameters. The multinomial parameters for all individuals inform the population level preference modeled as a Dirichlet. This population-level preference is shown as a ternary diagram (a triangle plot). The population-level preference, in turn, informs the most likely individual multinomial parameters given the population preference. Thus, at each MCMC step information is passed from the individual preferences to the population preference, and vice versa. Note that the 's and are not fixed for the analysis. The role of the hyperpriors on w and q are not depicted in the figure.
Figure 2Simulations examining performance.
A) Performance of conventional methods for analyzing count/preference data. Red hatched line indicates the 0.05 quantile of p-values for 1000 data sets simulated under the null model of no preference. Numbers are the p-values of 0.05 quantile. Methods examined were the (i) Friedman test, (ii) the Quade test, and (iii) ANOVA on arcsin square root transformed proportions. B) Distribution of DIC values for models with equal preference versus models with different preferences for each item. Red hatched line indicates the 0.95 quantile of DIC values.
Figure 3Host plant preferences for focal populations.
Colored curves indicate posterior density for population-level preference for four host plant species. Posterior densities estimated from 40000 MCMC steps following a burnin of 10000 generations.
Populations and DIC values for constrained and unconstrained preference.
| Population | Natal host plant | N | Constrained DIC | Unconstrained DIC |
| Carson Pass, CA |
| 12 | −240.85 | −5162.92 |
| Mt. Rose, NV |
| 13 | −218.15 | −929.68 |
| Gardnerville, NV |
| 15 | 67.71 | 63.03 |
| Verdi, NV |
| 14 | 97.06 | 79.01 |
| Leek Springs, CA |
| 8 | 107.74 | 93.15 |
| Trap Creek, CA |
| 14 | 6.65 | −17.59 |
| Yuba Gap, CA |
| 13 | 101.83 | 87.41 |
Constrained refers to models where preference for each plant is equal. Unconstrained refers to models where preferences are permitted to vary among host plants. N is the number of replicates for each population.
Figure 4Population and individual preferences.
Population-level preferences (solid lines) and individual-level preferences (dotted lines) for each of the four host plants. Colors for each plant as in figure 3. Populations presented are A) Carson Pass and B) Gardnerville. Posterior densities estimated from 40000 MCMC steps following a burnin of 10000 generations.
DIC comparisons among grouping schemes for host plant preferences.
| Grouping | DIC |
| (CP,MR,GV,VE,LS,TC,YG) | 137.61 |
| (CP)(MR)(GV)(VE)(LS)(TC)(YG) | −3382.07 |
| (CP,MR)(GV,VE)(LS)(TC)(YG) | −5416.32 |
| (CP,MR)(GV)(VE)(LS)(TC)(YG) | −5530.88 |
| (CP,MR)(GV,VE)(TC)(LS,YG) | −5597.75 |
| (CP,MR)(GV,VE)(LS,TC,YG) | −6085.99 |
Parenthetical groups constrained to have same preference parameters in the model. DIC values based on 40000 MCMC steps following a burnin of 10000 generations. Abbreviations are as follows: CP, Carson Pass; GV, Gardnerville; LS, Leek Springs; MR, Mt. Rose; TC, Trap Creek; VE, Verdi; YG, Yuba Gap.
Non-focal population summary of preference for Astragalus and natal host plant, and DIC scores for constrained and non-constrained models.
| Population | Test plants | N | Preference: | Preference: Natal | Constrained DIC | Unconstrained DIC |
| Big Pine, CA |
| 11 | 0.57 (0.37, 0.73) | 0.17 (0.07, 0.32) | −4.04 | −11.60 |
| Cave Lake, CA |
| 6 | 0.62 (0.44, 0.78) | 5.34 | −44.36 | |
| Eagle Peak, CA |
| 10 | 0.72 (0.56, 0.87) | −22.31 | −100.63 | |
| White Mts., CA |
| 15 | 0.64 (0.44, 0.78) | −13.06 | −53.16 |
Host plant abbreviations are as follows: A.l., Astragalus letiginosus; A.w., A. whitneyi; G.l., Glycyrrhiza lepidota; M.s., Medicago sativa; V.a.,Vicia americana. Natal plant for Cave Lake population is not definitively known, however, it is not A. whitneyi and is most likely L. polyphyllus. Constrained model is one where preference for all plants is equal, whereas the unconstrained is one where preference is permitted to vary across host plants. DIC scores based on 40000 MCMC generations following a 10000 generation burnin.