| Literature DB >> 29319501 |
Nick R Parsons1, M Dawn Teare2, Alice J Sitch3.
Abstract
Many studies in the biomedical research literature report analyses that fail to recognise important data dependencies from multilevel or complex experimental designs. Statistical inferences resulting from such analyses are unlikely to be valid and are often potentially highly misleading. Failure to recognise this as a problem is often referred to in the statistical literature as a unit of analysis (UoA) issue. Here, by analysing two example datasets in a simulation study, we demonstrate the impact of UoA issues on study efficiency and estimation bias, and highlight where errors in analysis can occur. We also provide code (written in R) as a resource to help researchers undertake their own statistical analyses.Entities:
Keywords: Science Forum; epidemiology; experimental design; global health; mixed-effects models; statistics
Mesh:
Year: 2018 PMID: 29319501 PMCID: PMC5762161 DOI: 10.7554/eLife.32486
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.140
Variance inflation factors for cluster sizes () 2, 5, 10 and 20, and intraclass correlation coefficients (ICC) 0.01, 0.05, 0.1 and 0.5.
| ICC | ||||
|---|---|---|---|---|
| 0.01 | 0.05 | 0.1 | 0.5 | |
| 2 | 1.01 | 1.05 | 1.10 | 1.50 |
| 5 | 1.04 | 1.20 | 1.40 | 3.00 |
| 10 | 1.09 | 1.45 | 1.90 | 5.50 |
| 20 | 1.19 | 1.95 | 2.90 | 10.50 |
Lymph node sizes (mm), by sample slice and subject, by radiotherapy (RT) group, subjects 1 to 6 no RT and subjects 7 to 12 short RT; highlighted cells are those removed to unbalance the design.
| None | Short RT | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Subject | Sample | Slice | Subject | Sample | Slice | ||||
| 1 | 2 | 3 | 1 | 2 | 3 | ||||
| 1 | 1 | 1.71 | 1.98 | 1.88 | 7 | 1 | 2.37 | 2.36 | 2.20 |
| 2 | 1.72 | 1.98 | 1.85 | 2 | 2.36 | 2.62 | 2.60 | ||
| 2 | 1 | 2.51 | 2.55 | 2.65 | 8 | 1 | 1.33 | 1.35 | 1.15 |
| 2 | 2.98 | 3.20 | 2.80 | 2 | 1.90 | 1.87 | 1.85 | ||
| 3 | 1 | 1.69 | 1.72 | 1.80 | 9 | 1 | 1.70 | 1.78 | 1.78 |
| 2 | 1.82 | 1.97 | 1.73 | 2 | 2.07 | 1.76 | 1.85 | ||
| 4 | 1 | 1.72 | 1.78 | 2.04 | 10 | 1 | 2.23 | 2.14 | 2.21 |
| 2 | 2.50 | 2.65 | 2.77 | 2 | 2.50 | 2.33 | 2.16 | ||
| 5 | 1 | 3.32 | 3.27 | 3.07 | 11 | 1 | 2.10 | 1.89 | 1.75 |
| 2 | 3.11 | 3.03 | 3.11 | 2 | 2.11 | 2.16 | 2.12 | ||
| 6 | 1 | 2.33 | 2.48 | 2.53 | 12 | 1 | 2.58 | 2.54 | 2.59 |
| 2 | 2.86 | 2.87 | 2.52 | 2 | 2.77 | 2.65 | 2.60 | ||
Figure 1.A strip plot showing observed lymph node size data by subject (1-12) and sample, after none and a short course of radiotherapy (Short RT).
Figure 2.Boxplots of residuals (observed values - fitted values) for each subject; symbols () are medians, boxes are interquartile ranges (IQR), whiskers extend to 1.5IQR and symbols () outside these are suspected outliers (a).
Quantile-quantile (Q–Q) plot of the model residuals () on the horizontal axis against theoretical residuals from a Normal distribution on the vertical axis (b).
Number of five selected lymph nodes with maximum diameters 2mm, for up to five tissue samples per subject (1-12), after either none or a short course of radiotherapy (Short RT).
| None | Short RT | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Subject | Sample | Subject | Sample | ||||||||
| 1 | 2 | 3 | 4 | 5 | 1 | 2 | 3 | 4 | 5 | ||
| 1 | 4 | 4 | - | - | - | 7 | 1 | 0 | 0 | 0 | 0 |
| 2 | 3 | 4 | 5 | 2 | - | 8 | 1 | 2 | - | - | - |
| 3 | 2 | 3 | 3 | 2 | 9 | 1 | 0 | 1 | 0 | 2 | |
| 4 | 2 | 4 | 1 | 2 | 1 | 10 | 2 | 1 | 4 | 0 | 2 |
| 5 | 3 | 4 | 4 | 3 | 5 | 11 | 4 | 2 | 4 | 3 | 3 |
| 6 | 2 | 5 | 5 | 3 | 3 | 12 | 3 | 4 | 3 | - | - |