| Literature DB >> 20074371 |
Abstract
BACKGROUND: Pseudoreplication occurs when observations are not statistically independent, but treated as if they are. This can occur when there are multiple observations on the same subjects, when samples are nested or hierarchically organised, or when measurements are correlated in time or space. Analysis of such data without taking these dependencies into account can lead to meaningless results, and examples can easily be found in the neuroscience literature.Entities:
Mesh:
Year: 2010 PMID: 20074371 PMCID: PMC2817684 DOI: 10.1186/1471-2202-11-5
Source DB: PubMed Journal: BMC Neurosci ISSN: 1471-2202 Impact factor: 3.288
Figure 1An example of pseudoreplication. Two rats are sampled from a population with a mean (μ) of 50 and a standard deviation (σ) of 10, and ten measurements of an arbitrary outcome variable are made on each rat. The first (incorrect) 90% CI uses all 20 data points and does not account for the hierarchical nature of the data. For the second 90% CI, the mean of the ten values for each rat are calculated first, and then only these two averaged values are used for the calculation of the CI. The error bar on the left is incorrect because each of the 20 data points are not a random sample from the whole population, but rather samples within two rats. This is evident from the fact that the 10 points are normally distributed around the mean of their respective rats, but not normally distributed around the population mean (horizontal grey line), as would be expected when independent samples are randomly drawn from a population. Increasing the number of observations on each rat does not lead to a more precise estimate of μ, which requires more rats. Note that 90% CI are plotted for clarity because the graph needs to be greatly compressed to display the 95% CI.
Four situations in which pseudoreplication can arise.
| Situation | Example | Solutions |
|---|---|---|
| Repeated measures | Growth curve | 1. Include subject as a random effect |
| 2. Repeated measures ANOVA | ||
| 3. Summary-measure analysis | ||
| Hierarchical/nested | Multiple brain sections | 1. Include random effects |
| Multiple coverslips/wells | 2. Average over observations | |
| Litter effects | ||
| Correlated in time | Time of day testing occurs | 1. Include time as covariate |
| Circadian effects | 2. Include sample number as a covariate | |
| Correlated in space | Multiple incubators | 1. Include random effects |
| Cage effects | 2. Average over observations |
Degrees of freedom associated with common statistical tests.
| Test | Degrees of Freedom |
|---|---|
| Independent | |
| Paired | |
| Main effect of A | |
| Main effect of B | |
| A × B interaction | ( |
| Error | |
| Between subjects | |
| Error | ( |
| | |
| Groups | |
| Error | |
| | |
| Obs | |
| Group × Obs Interaction | ( |
| Error | |
| 1 and | |
| ( |
n1 = sample size of group one; n2 = sample size of group two; n = total sample size; N = total number of observations (equal to n × Obs); G = number of groups; G= number of groups for Factor A; G= number of groups for Factor B; Obs = number of repeated observations on the same subject; R = number of rows; C = number or columns. † "Mixed" refers to the presence of both between and within subjects effects, and should not be confused with mixed effects models described in the text.