| Literature DB >> 25893088 |
Paul C D Johnson1, Sarah J E Barry2, Heather M Ferguson3, Pie Müller4.
Abstract
'Will my study answer my research question?' is the most fundamental question a researcher can ask when designing a study, yet when phrased in statistical terms - 'What is the power of my study?' or 'How precise will my parameter estimate be?' - few researchers in ecology and evolution (EE) try to answer it, despite the detrimental consequences of performing under- or over-powered research. We suggest that this reluctance is due in large part to the unsuitability of simple methods of power analysis (broadly defined as any attempt to quantify prospectively the 'informativeness' of a study) for the complex models commonly used in EE research. With the aim of encouraging the use of power analysis, we present simulation from generalized linear mixed models (GLMMs) as a flexible and accessible approach to power analysis that can account for random effects, overdispersion and diverse response distributions.We illustrate the benefits of simulation-based power analysis in two research scenarios: estimating the precision of a survey to estimate tick burdens on grouse chicks and estimating the power of a trial to compare the efficacy of insecticide-treated nets in malaria mosquito control. We provide a freely available R function, sim.glmm, for simulating from GLMMs.Analysis of simulated data revealed that the effects of accounting for realistic levels of random effects and overdispersion on power and precision estimates were substantial, with correspondingly severe implications for study design in the form of up to fivefold increases in sampling effort. We also show the utility of simulations for identifying scenarios where GLMM-fitting methods can perform poorly.These results illustrate the inadequacy of standard analytical power analysis methods and the flexibility of simulation-based power analysis for GLMMs. The wider use of these methods should contribute to improving the quality of study design in EE.Entities:
Keywords: experimental design; generalized linear mixed model; long-lasting insecticidal net; overdispersion; precision; random effects; sample size; simulation
Year: 2014 PMID: 25893088 PMCID: PMC4394709 DOI: 10.1111/2041-210X.12306
Source DB: PubMed Journal: Methods Ecol Evol Impact factor: 7.781
Study design choices and effect parameter assumptions for the two example studies
| Example study | Study variable | Simulated values |
|---|---|---|
| Tick burden survey | No of locations | 10, 20, 50, 100, 200 |
| No of broods per location | 2 | |
| No of chicks per brood | 3 | |
| Mean tick burden per chick, | 1, 5, 10 | |
| Location-level variance, | 0, 1 | |
| Brood-level variance, | 0, 0·7 | |
| Chick-level variance, | 0, 0·3 | |
| LLIN trial | No of rotations of the Latin square | 1, 2, 3, 4, 5 |
| No of huts, No of nets, No of weeks per rotation | 6 | |
| No of | 5, 25 | |
| Mortality using the control net, | 70% | |
| Minimum acceptable mortality using LLIN type E1, | 80% (1·7) | |
| Mortality assumed using the four secondary LLINs (odds ratio) | 80% (1·7) | |
| Between-hut and -week variances, | 0, 0·5 | |
| Observation-level (overdispersion) variance, | 0, 0·5, 1 |
Latin square design for trialling one control (C) and five experimental (E1 to E5) types of long-lasting insecticidal net, rotated through six huts over six weeks according to a design balanced against carry-over effects (Williams 1949)
| Week | Hut 1 | Hut 2 | Hut 3 | Hut 4 | Hut 5 | Hut 6 |
|---|---|---|---|---|---|---|
| 1 | E1 | E2 | E3 | E4 | E5 | C |
| 2 | E2 | E3 | E4 | E5 | C | E1 |
| 3 | C | E1 | E2 | E3 | E4 | E5 |
| 4 | E3 | E4 | E5 | C | E1 | E2 |
| 5 | E5 | C | E1 | E2 | E3 | E4 |
| 6 | E4 | E5 | C | E1 | E2 | E3 |
Figure 1The relationship between margin of error in tick burden estimates and number of locations sampled. Margin of error was averaged over 1000 data sets simulated under scenarios that varied in mean tick burden and the degree of variation in mean tick burden at the location, brood and individual chick levels. The grey line shows the target margin of error of ± 25%.
Figure 2The relationship between the power of the long-lasting insecticidal net trial to detect a difference between nets causing 70% and 80% mortality and its duration in number of 6-week rotations of the Latin square. Each power estimate was derived from 1000 simulated data sets, generated under scenarios that varied in Anopheles gambiae abundance, degree of variation in mortality between huts and weeks, and strength of overdispersion. The horizontal grey line shows the target power of 80%.