| Literature DB >> 24834325 |
Paul W Goedhart1, Hilko van der Voet1, Ferdinando Baldacchino2, Salvatore Arpaia2.
Abstract
Genetic modification of plants may result in unintended effects causing potentially adverse effects on the environment. A comparative safety assessment is therefore required by authorities, such as the European Food Safety Authority, in which the genetically modified plant is compared with its conventional counterpart. Part of the environmental risk assessment is a comparative field experiment in which the effect on non-target organisms is compared. Statistical analysis of such trials come in two flavors: difference testing and equivalence testing. It is important to know the statistical properties of these, for example, the power to detect environmental change of a given magnitude, before the start of an experiment. Such prospective power analysis can best be studied by means of a statistical simulation model. This paper describes a general framework for simulating data typically encountered in environmental risk assessment of genetically modified plants. The simulation model, available as Supplementary Material, can be used to generate count data having different statistical distributions possibly with excess-zeros. In addition the model employs completely randomized or randomized block experiments, can be used to simulate single or multiple trials across environments, enables genotype by environment interaction by adding random variety effects, and finally includes repeated measures in time following a constant, linear or quadratic pattern in time possibly with some form of autocorrelation. The model also allows to add a set of reference varieties to the GM plants and its comparator to assess the natural variation which can then be used to set limits of concern for equivalence testing. The different count distributions are described in some detail and some examples of how to use the simulation model to study various aspects, including a prospective power analysis, are provided.Entities:
Keywords: Difference testing; environmental risk assessment; equivalence testing; field trials; simulation model; statistical distributions; statistical power
Year: 2014 PMID: 24834325 PMCID: PMC4020688 DOI: 10.1002/ece3.1019
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Main characteristics of fields experiment 1992–2005 using GM crops.
| Authors and Journal | Functional group | Crop | Measurement endpoint | Dimensions | Experimental design | Statistical method |
|---|---|---|---|---|---|---|
| Johnson & Gould ( | Parasitoids | Tobacco | Parasitism rate | 9 replications, 2 years | Randomized blocks | Chi-square |
| Johnson ( | Parasitoids | Tobacco | Parasitism rate | 3 years, 15 sites | Randomized blocks | ANOVA |
| Mascarenhas and Luttrell ( | Parasitoids | Cotton | Host survival | 4 replications | Completely randomized | ANOVA |
| Orr & Landis ( | Parasitoids,Predators | Maize | Egg fate, Parasitism rate, Visual counts | 3 replications (50 plants), 3 sampling dates | Completely randomized | ANOVA |
| Pilcher et al. ( | Predators | Maize | Abundance, Visual counts | 2 years, 3 replications (6 plants in each), 3 sampling dates | Randomized blocks | ANOVA |
| Riddick et al. ( | Predators | Potato | Abundance, Visual counts, Sweep nets, Pitfall traps | 2 years, 3 sites | Completely randomized | ANOVA |
| Buckelew et al. ( | Predators | Soybean | Abundance, Sweep nets | 2 sites, 2 years, weekly samplings | Randomized blocks | ANOVA |
| Al–Deeb et al. ( | Predators | Maize | Abundance, Visual counts | 40 plants, 2 locations | Completely randomized | ANOVA mixed model |
| Reed et al. ( | Predators | Potato | Abundance, Visual counts | 2 years, 6 replications | Latin square | ANOVA |
| Wold et al. ( | Predators | Maize | Abundance, Visual counts | 2 years, 4 replications, 6 sampling dates | Completely randomized | ANOVA |
| Bourguet et al. ( | Predators, Parasitoids | Maize | Abundance,Parasitization | 2 sites, 4 replications, weekly samplings | Split-plot | ANOVA |
| Manachini & Lozzia ( | Soil organisms | Maize | Abundance, Diversity | 2 separate fields, 8 locations50 soil samples | n.a. | ANOVA |
| Al–Deeb and Wilde ( | Predators | Maize | Abundance, Visual counts, Pitfall traps | 2 years, 8 locations | Completely randomized | ANOVA mixed model |
| Jasinski et al. ( | Predators | Soybean,Maize | Abundance, Sweep nets, Sticky traps, Soil samples | 24 commercial fields | n.a. | ANOVA |
| Men et al. ( | Herbivores,Predators,Parasitoids | Cotton | Abundance, Sweep nets, Visual counts | 3 years, 3 replications, 5 sampling dates | Completely randomized | ANOVA, Diversity indices |
| Musser & Shelton ( | Predators | Maize | Abundance, Egg predation | 2 years 2–10 plants/replication | Randomized block | ANOVA |
| Volkmar et al. ( | Predators | Sugar beet | Abundance, Pitfall Traps | 4 replications | Randomised block | ANOVA |
| Wu & Guo ( | Predators | Cotton | Abundance, Visual counts | 3 replications | Completely randomized | ANOVA |
| Candolfi et al. ( | Predators,HerbivoresSoil org., | Maize | Abundance, Pitfall traps, Yellow traps | 3 replications, field size | Completely randomized | Principal response curves, Diversity indices |
| Duan et al. ( | Predators | Potato | Abundance, Pitfall traps | 2 years, 6 replications | Latin square | ANOVA |
| Manachini et al. ( | Soil organisms | Canola | Extraction from soil | 3 replications | Completely randomized | Multivariate |
| Wade French et al. ( | Predators | Maize | Abundance, Pitfall traps | 2 years, commercial fields | n.a. | Canonical correspondence |
| Wei-Di et al. ( | Herbivores, Predators,Parasitoids | Cotton | Abundance, Diversity | 2 years, 3 replications | Completely randomized | ANOVA, Diversity indices |
| Bhatti et al. ( | Predators,Detritivores,Soil herbivore | Maize | Abundance | 3 years | Split-plot | ANOVA mixed model |
| Bhatti et al. ( | Predators,Herbivores,Parasitoids | Maize | Abundance | 3 years two-weekly samplings | Split-plot | ANOVA mixed model |
| Daly & Buntin ( | Predators,Herbivores | Maize | Abundance | 2 locations 2 years, 4 replications, weekly samplings | Completely randomized | ANOVA mixed model |
| De La Poza et al. ( | Predators | Maize | Abundance, Visual counts, Pitfall traps | 2 locations, 3 years, 3–4 replicates | Completely randomized(split for year and location) | ANOVA |
| Hagerty et al. ( | Predators,Herbivores | Cotton | Abundance, Damage | 2 years, 4 replications | Completely randomized | ANOVA |
| Head et al. ( | Predators,Herbivores | Cotton | Abundance,Predation rates | 3 years, 3–4 replications,6–16 sampling dates | Completely randomized | ANOVA mixed model with repeated measures |
| Naranjo ( | Predators | Cotton | Diversity | 6 years, 3–4 replications | Completely randomized | ANOVA, PCA |
| Pons et al. ( | Herbivores | Maize | Pest incidence | 3 years, 4 replications, various sampling dates | Completely randomized(year as factorial element) | ANOVA |
| Torres and Ruberson ( | Predators | Cotton | Abundance | 3 years, 3 replicates, weekly samplings | Completely randomized | ANOVA mixed model with repeated measures |
| Whitehouse et al. ( | Different guilds | Cotton | Diversity Index | 3 years, 2–3 replicates, weekly samplings | Completely randomized | ANOVA mixed model with repeated measures |
Figure 1Examples of probabilities of statistical distributions for counts for means μ = 1, 4, and 10. The variance of the overdispersed Poisson distribution equals ϕμ. The variance of the negative binomial and Poisson-lognormal equals μ + ωμ2.
Figure 2Examples of probabilities of statistical distributions for presence/absence data for n = 16 with mean nπ and variance ωnπ (1 −π). For the binomial distribution, ω = 1.
Elements of the statistical simulation model.
| Element | Possible choices |
|---|---|
| Distribution of counts | Poisson, overdispersed Poisson, negative binomial, Poisson-lognormal, binomial, betabinomial, binomial-logitnormal |
| Excess-zero counts | No/yes |
| Design | Completely randomized, randomized blocks, number of replications |
| Additional varieties | Number of additional varieties or treatments in addition to the GM plant and its comparator |
| Reference varieties | Number of reference varieties which represent a population |
| Trial | Single trial, multiple trials, site × year trials |
| Measurement | Single time point, repeated measures (constant, linear or quadratic in time, autocorrelation) |
| Parameters | Parameter values for all the count distributions, for example, a mean and an excess-zero probability for each variety |
Figure 3Power of a likelihood ratio difference test with α = 0.05 for the negative binomial distribution with dispersion parameter ω, a mean μ for the comparator and a mean θμ for the GM plant for replication levels N = 6 (black), 10 (red), 20 (green), and 40 (blue).
Number of replications needed to obtain a significant difference test with probability 80% when the quotient of the mean of the GMO and the comparator equals Θ = 2 for data which have a negative binomial distribution with mean μ for the comparator, mean Θμ for the GM plant, and dispersion parameter ω.
|
| ||||||
|---|---|---|---|---|---|---|
| 0.25 | 29 | 21 | 13 | 10 | 9 | 9 |
| 0.50 | ≥40 | 27 | 21 | 19 | 17 | 16 |
| 1.00 | ≥40 | ≥40 | 37 | 35 | 33 | 32 |
Figure 4Power of a likelihood ratio difference test with α = 0.05 for negative binomial data with dispersion parameter ω, a mean μ for the comparator, and a mean θμ for the GM plant for replication level N = 40 when analyzed employing a negative binomial model (black), a quasi-Poisson model (red), and a log transformation (green).
Figure 595% likelihood ratio confidence intervals for the ratio of the Poisson means of the GM plant and the comparator when the underlying mean of both is μ = 5 and various numbers of replication N. The red vertical lines denote the artificial equivalence limits set at 1/2 and 2.
Number of replications needed to obtain a significant difference test or to reject the hypothesis of non-equivalence with limits ½ and 2, with probability 80% when the quotient of the mean of the GMO and the comparator equals Θ for data which have a Poisson distribution with mean μ for the comparator and mean Θμ for the GM plant.
|
| ||||||
|---|---|---|---|---|---|---|
| Replications for Difference test, | ||||||
| 1.0 | – | – | – | – | – | – |
| 1.2 | ≥40 | ≥40 | ≥40 | ≥40 | 22 | 12 |
| 1.4 | ≥40 | ≥40 | 24 | 13 | 6 | ≤4 |
| 1.6 | ≥40 | 28 | 12 | 6 | ≤4 | ≤4 |
| 1.8 | 36 | 18 | 7 | ≤4 | ≤4 | ≤4 |
| 2.0 | 24 | 12 | 5 | ≤4 | ≤4 | ≤4 |
| Replications for Equivalence test, | ||||||
| 1.0 | ≥40 | 20 | 8 | 5 | ≤4 | ≤4 |
| 1.2 | ≥40 | 26 | 11 | 6 | ≤4 | ≤4 |
| 1.4 | ≥40 | ≥40 | 20 | 10 | 5 | ≤4 |
| 1.6 | ≥40 | ≥40 | ≥40 | 23 | 12 | 6 |
| 1.8 | ≥40 | ≥40 | ≥40 | ≥40 | ≥40 | 23 |
| 2.0 | – | – | – | – | – | – |
Figure 6Power of a likelihood ratio difference test with α = 0.05 for negative binomial data with overdispersion parameter ω = 0.25 and additional excess-zeros with probability δ = 0 (black), 0.1 (red), 0.2 (blue), and 0.5 (green). The comparator has mean μ(1 −δ), and the GM plant has a mean of 2μ(1 − δ).
Figure 7Power of a likelihood ratio difference test for Poisson-lognormal data with overdispersion parameter ω = 0.25 and a single observation (black), the sum of 5 independent observations (red), and the sum of 5 dependent observations (blue). The mean of both the comparator and the GM plant follows a quadratic polynomial on the log scale with a maximum mean count of μ for the comparator and 2μ for the GM plant (see text).
Number of replications needed to reject the hypothesis of non-equivalence with limits 1/3 and 3, with probability 80% using a two-sided test for the repeated measurements simulation, see text.
| Maximum mean | Single observations | Multiple dependent | Multiple independent |
|---|---|---|---|
| 1 | ≥40 | ≥40 | 39 |
| 2 | ≥40 | 33 | 23 |
| 5 | 39 | 25 | 14 |
| 10 | 30 | 20 | 10 |
| 20 | 26 | 20 | 9 |
| 40 | 24 | 19 | 8 |
| Variety | Effect count | Variance count | Effect zero | Variance zero |
|---|---|---|---|---|
| GM plant | 0.4 | – | −0.3 | – |
| Comparator | 0.5 | – | −0.2 | – |
| Reference varieties | 1 | 1 | −0.8 | 0.5 |
| Variety | ||||
|---|---|---|---|---|
| GM plant | exp( | exp( | logit−1 | logit−1 |
| Comparator | exp( | exp( | logit−1 | logit−1 |
| Reference 1 | exp(0.8–0.4) | exp(0.8 + 0.1) | logit−1(−1.2–0.1) | logit−1(−1.2 + 0.2) |
| Reference 2 | exp(0.9–0.4) | exp(0.9 + 0.1) | logit−1(−0.9–0.1) | logit−1(−0.9 + 0.2) |
| Reference 3 | exp(1.2–0.4) | exp(1.2 + 0.1) | logit−1(−0.6–0.1) | logit−1(−0.6 + 0.2) |