| Literature DB >> 24530839 |
Anderson M Winkler1, Gerard R Ridgway2, Matthew A Webster3, Stephen M Smith3, Thomas E Nichols4.
Abstract
Permutation methods can provide exact control of false positives and allow the use of non-standard statistics, making only weak assumptions about the data. With the availability of fast and inexpensive computing, their main limitation would be some lack of flexibility to work with arbitrary experimental designs. In this paper we report on results on approximate permutation methods that are more flexible with respect to the experimental design and nuisance variables, and conduct detailed simulations to identify the best method for settings that are typical for imaging research scenarios. We present a generic framework for permutation inference for complex general linear models (GLMS) when the errors are exchangeable and/or have a symmetric distribution, and show that, even in the presence of nuisance effects, these permutation inferences are powerful while providing excellent control of false positives in a wide range of common and relevant imaging research scenarios. We also demonstrate how the inference on GLM parameters, originally intended for independent data, can be used in certain special but useful cases in which independence is violated. Detailed examples of common neuroimaging applications are provided, as well as a complete algorithm - the "randomise" algorithm - for permutation inference with the GLM.Entities:
Keywords: General linear model; Multiple regression; Permutation inference; Randomise
Mesh:
Year: 2014 PMID: 24530839 PMCID: PMC4010955 DOI: 10.1016/j.neuroimage.2014.01.060
Source DB: PubMed Journal: Neuroimage ISSN: 1053-8119 Impact factor: 6.556
Fig. 1Examples of a permutation matrix (a), of a sign flipping matrix (b), and of a matrix that does permutation and sign flipping (c). Pre-multiplication by a permutation matrix shuffles the order of the data, whereas by a sign flipping matrix changes the sign of a random subset of data points.
Compared with parametric methods, permutation tests relax a number of assumptions and can be used in a wider variety of situations. Some of these assumptions can be further relaxed with the definition of exchangeability blocks.
| Assumptions | Parametric | ||
|---|---|---|---|
| Independent | ✓ | ✓ | ✓ |
| Non-independent, exchangeable | ✓ | ✗ | ✗ |
| Non-independent, non-exchangeable | ✗ | ✗ | ✗ |
| Normal, identical | ✓ | ✓ | ✓ |
| Symmetrical, identical | ✓ | ✓ | ✗ |
| Symmetrical, non-identical | ✗ | ✓ | ✗ |
| Skewed, identical | ✓ | ✗ | ✗ |
| Skewed, non-identical | ✗ | ✗ | ✗ |
✓Can be used directly if the assumptions regarding dependence structure and distribution of the error terms are both met.
✗Cannot be used directly, or can be used in particular cases.
A number of methods are available to obtain parameter estimates and construct a reference distribution in the presence of nuisance variables.
| Method | Model |
|---|---|
| Draper–Stoneman | |
| Still–White | |
| Freedman–Lane | ( |
| Manly | |
| ter Braak | ( |
| Kennedy | |
| Huh–Jhun | |
| Smith | |
| Parametric |
Draper and Stoneman (1966). This method was called “Shuffle Z” by (Kennedy, 1995), and using the same notation adopted here, it would be called “Shuffle X”.
Gail et al. (1988); Levin and Robbins (1983); Still and White (1981). Still and White considered the special anova case in which Z are the main effects and X the interaction.
Freedman and Lane (1983).
Manly (1986); Manly (2007).
ter Braak (1992). The null distribution for this method considers , i.e., the permutation happens under the alternative hypothesis, rather than the null.
Kennedy (1995); Kennedy and Cade (1996). This method was referred to as “Residualize both Y and Z” in the original publication, and using the same notation adopted here, it would be called “Residualize both Y and X”.
Huh and Jhun (2001); Jung et al. (2006); Kherad-Pajouh and Renaud (2010). Q is a N′ × N′ matrix, where N′ is the rank of R. Q is computed through Schur decomposition of R, such that R = QQ′ and . For this method, P is N′ × N′. From the methods in the table, this is the only that cannot be used directly under restricted exchangeability, as the block structure is not preserved.
The Smith method consists of orthogonalization of X with respect to Z. In the permutation and multiple regression literature, this method was suggested by a referee of O'Gorman (2005), and later presented by Nichols et al. (2008) and discussed by Ridgway (2009).
The parametric method does not use permutations, being instead based on distributional assumptions. For all the methods, the left side of the equations contains the data (regressand), the right side the regressors and error terms. The unpermuted models can be obtained by replacing P for I. Even for the unpermuted models, and even if X and Z are orthogonal, not all these methods produce the same error terms ϵ. This is the case, for instance, of the Kennedy and Huh–Jhun methods. Under orthogonality between X and Z, some regression methods are equivalent to each other.
Fig. 2Left: Example of a permutation matrix that shuffles data within block only. The blocks are not required to be of the same size. The elements outside the diagonal blocks are always equal to zero, such that data cannot be swapped across blocks. Right: Example of a sign flipping matrix. Differently than within-block permutation matrices, here sign flipping matrices are transparent to the definitions of the blocks, such that the block definitions do not need to be taken into account, albeit their corresponding variance groups are considered when computing the statistic.
Fig. 3(a) Example of a permutation matrix that shuffles whole blocks of data. The blocks need to be of the same size. (b) Example of a sign flipping matrix that changes the signs of the blocks as a whole. Both matrices can be constructed by the Kronecker product (represented by the symbol ⊗) of a permutation or a sign flipping matrix (with size determined by the number of blocks) and an identity matrix (with size determined by the number of observations per block).
The statistic G provides a generalisation for a number of well known statistical tests.
| rank( | rank( | |
|---|---|---|
| Homoscedastic errors, unrestricted exchangeability | Square of Student's | |
| Homoscedastic within | Square of Aspin–Welch | Welch's |
Maximum number of unique permutations considering exchangeability blocks.
| Exchangeability | ||
|---|---|---|
| Unrestricted | 2 | |
| Unrestricted, repeated rows | 2 | |
| Within-block | 2 | |
| Within-block, repeated rows | 2 | |
| Whole-block | 2 | |
| Whole-block, repeated blocks | 2 |
B Number of exchangeability blocks (eb).
M Number of distinct rows in X.
M|b Number of distinct rows in X within the b-th block.
Number of distinct blocks of rows in X.
N Number of observations.
N Number of observations in the b-th block.
N Number of times each of the M distinct rows occurs in X.
N Number of times each of the m-th unique row occurs within the b-th block.
Number of times each of the distinct blocks occurs in X.
The eight different simulation scenarios, each with its own same sample sizes and different variances. The distributions of the statistic (F or G) for each pair of variance configuration within scenario were compared using the KS test. The letters in the last column (marked with a star, ⋆) indicate the variance configurations represented in the pairwise comparisons shown in Fig. 4 and results shown in Table 6.
| Simulation scenario | Sample sizes for each | Variances for each | ⋆ |
|---|---|---|---|
| 1 | 8, 4 | 5, 1 | ( |
| 1.2, 1 | ( | ||
| 1, 1 | ( | ||
| 1, 1.2 | ( | ||
| 1, 5 | ( | ||
| 2 | 20, 5 | 5, 1 | ( |
| 1.2, 1 | ( | ||
| 1, 1 | ( | ||
| 1, 1.2 | ( | ||
| 1, 5 | ( | ||
| 3 | 80, 30 | 5, 1 | ( |
| 1.2, 1 | ( | ||
| 1, 1 | ( | ||
| 1, 1.2 | ( | ||
| 1, 5 | ( | ||
| 4 | 40, 30, 20, 10 | 15, 10, 5, 1 | ( |
| 3.6, 2.4, 1.2, 1 | ( | ||
| 1, 1, 1, 1 | ( | ||
| 1, 1.2, 2.4, 3.6 | ( | ||
| 1, 5, 10, 15 | ( | ||
| 5 | 4, 4 | 1, 1 | ( |
| 1, 1.2 | ( | ||
| 1, 5 | ( | ||
| 6 | 20, 20 | 1, 1 | ( |
| 1, 1.2 | ( | ||
| 1, 5 | ( | ||
| 7 | 4, 4, 4, 4 | 1, 1, 1, 1 | ( |
| 1, 1.2, 2.4, 3.6 | ( | ||
| 1, 5, 10, 15 | ( | ||
| 8 | 20, 20, 20, 20 | 1, 1, 1, 1 | ( |
| 1, 1.2, 2.4, 3.6 | ( | ||
| 1, 5, 10, 15 | ( |
Fig. 4Heatmaps for the comparison of the distributions obtained under different variance settings for identical sample sizes. In each map, the cells below the main diagonal contain the results for the pairwise F statistic, and above, for the G statistic. The percentages refer to the fraction of the 1000 tests in which the distribution of the statistic for one variance setting was found different than for another in the same simulation scenario. Each variance setting is indicated by letters (a–e), corresponding to the same letters in Table 5. Smaller percentages indicate robustness of the statistic to heteroscedasticity. Confidence intervals (95%) are shown in parenthesis.
Proportion of error type I and power (%) for the statistics F and G in the various simulation scenarios and variance configurations shown in Table 5. Confidence intervals (95%) are shown in parenthesis.
| Simulation scenario | ⋆ | Proportion of error type | Power | ||
|---|---|---|---|---|---|
| 1 | ( | 5.9 (4.6–7.5) | 6.1 (4.8–7.8) | 20.1 (17.7–22.7) | 23.8 (21.3–26.5) |
| ( | 4.9 (3.7–6.4) | 5.3 (4.1–6.9) | 28.3 (25.6–31.2) | 31.9 (29.1–34.9) | |
| ( | 4.7 (3.6–6.2) | 4.5 (3.4–6.0) | 29.3 (26.6–32.2) | 32.6 (29.8–35.6) | |
| ( | 4.9 (3.7–6.4) | 4.6 (3.5–6.1) | 29.9 (27.1–32.8) | 32.0 (29.2–35.0) | |
| ( | 3.9 (2.9–5.3) | 4.1 (3.0–5.5) | 14.0 (12.0–16.3) | 14.1 (12.1–16.4) | |
| 2 | ( | 6.7 (5.3–8.4) | 6.6 (5.2–8.3) | 29.1 (26.4–32.0) | 38.3 (35.3–41.4) |
| ( | 5.0 (3.8–6.5) | 4.6 (3.5–6.1) | 42.4 (39.4–45.5) | 48.8 (45.7–51.9) | |
| ( | 5.0 (3.8–6.5) | 5.8 (4.5–7.4) | 44.6 (41.6–47.7) | 48.9 (45.8–52.0) | |
| ( | 6.1 (4.8–7.8) | 6.2 (4.9–7.9) | 42.3 (39.3–45.4) | 46.7 (43.6–49.8) | |
| ( | 5.9 (4.6–7.5) | 6.2 (4.9–7.9) | 19.5 (17.2–22.1) | 19.0 (16.7–21.6) | |
| 3 | ( | 5.2 (4.0–6.8) | 5.0 (3.8–6.5) | 90.4 (88.4–92.1) | 92.3 (90.5–93.8) |
| ( | 4.9 (3.7–6.4) | 5.1 (3.9–6.6) | 99.7 (99.1–99.9) | 99.8 (99.3–100) | |
| ( | 6.3 (5.0–8.0) | 6.2 (4.9–7.9) | 99.8 (99.3–100) | 99.8 (99.3–100) | |
| ( | 4.4 (3.3–5.9) | 4.4 (3.3–5.9) | 99.6 (99.0–99.8) | 99.6 (99.0–99.8) | |
| ( | 4.4 (3.3–5.9) | 4.4 (3.3–5.9) | 72.9 (70.1–75.6) | 72.9 (70.1–75.6) | |
| 4 | ( | 6.4 (5.0–8.1) | 5.7 (4.4–7.3) | 10.2 (8.5–12.2) | 19.4 (17.1–22.0) |
| ( | 5.3 (4.1–6.9) | 5.6 (4.3–7.2) | 37.8 (34.9–40.9) | 45.6 (42.5–48.7) | |
| ( | 5.7 (4.4–7.3) | 4.9 (3.7–6.4) | 72.2 (69.3–74.9) | 74.9 (72.1–77.5) | |
| ( | 3.1 (2.2–4.4) | 3.7 (2.7–5.1) | 34.6 (31.7–37.6) | 44.6 (41.6–47.7) | |
| ( | 4.5 (3.4–6.0) | 4.2 (3.1–5.6) | 9.7 (8.0–11.7) | 15.7 (13.6–18.1) | |
| 5 | ( | 4.3 (3.2–5.7) | 4.3 (3.2–5.7) | 29.9 (27.1–32.8) | 29.9 (27.1–32.8) |
| ( | 4.3 (3.2–5.7) | 4.3 (3.2–5.7) | 30.6 (27.8–33.5) | 30.6 (27.8–33.5) | |
| ( | 6.9 (5.5–8.6) | 6.9 (5.5–8.6) | 14.5 (12.5–16.8) | 14.5 (12.5–16.8) | |
| 6 | ( | 3.3 (2.4–4.6) | 3.3 (2.4–4.6) | 92.6 (90.8–94.1) | 92.6 (90.8–94.1) |
| ( | 4.4 (3.3–5.9) | 4.4 (3.3–5.9) | 90.5 (88.5–92.2) | 90.5 (88.5–92.2) | |
| ( | 4.4 (3.3–5.9) | 4.4 (3.3–5.9) | 53.7 (50.6–56.8) | 53.7 (50.6–56.8) | |
| 7 | ( | 5.6 (4.3–7.2) | 5.5 (4.3–7.1) | 11.0 (9.2–13.1) | 8.8 (7.2–10.7) |
| ( | 5.2 (4.0–6.8) | 4.4 (3.3–5.9) | 6.5 (5.1–8.2) | 7.8 (6.3–9.6) | |
| ( | 5.7 (4.4–7.3) | 4.8 (3.6–6.3) | 5.8 (4.5–7.4) | 6.9 (5.5–8.6) | |
| 8 | ( | 4.6 (3.5–6.1) | 4.5 (3.4–6.0) | 78.7 (76.1–81.1) | 78.1 (75.4–80.6) |
| ( | 4.6 (3.5–6.1) | 5.6 (4.3–7.2) | 40.7 (37.7–43.8) | 45.5 (42.4–48.6) | |
| ( | 4.7 (3.6–6.2) | 4.8 (3.6–6.3) | 11.6 (9.8–13.7) | 19.3 (17.0–21.9) | |
A summary of the results for the 1536 simulations with different parameters. The amount of error type I is calculated for the 768 simulations without signal (β1 = 0). Confidence intervals (CI) at 95% were computed around the nominal level α = 0.05, and the observed amount of errors for each regression scenario and for each method was compared with this interval. Methods that mostly remain within the CI are the most appropriate. Methods that frequently produce results below the interval are conservative; those above are invalid. Power was calculated for the remaining 768 simulations, which contained signal (β1 = 0.5).
| Method | Proportion of error type | Average power | ||
|---|---|---|---|---|
| Within | Below | Above | ||
| Draper–Stoneman | 86.33% | 8.20% | 5.47% | 72.96% |
| Still–White | 67.84% | 14.58% | 17.58% | 71.82% |
| Freedman–Lane | 88.67% | 8.46% | 2.86% | 73.09% |
| ter Braak | 83.59% | 11.07% | 5.34% | 73.38% |
| Kennedy | 77.60% | 1.04% | 21.35% | 74.81% |
| Manly | 73.31% | 15.89% | 10.81% | 73.38% |
| Smith | 89.32% | 7.81% | 2.86% | 72.90% |
| Huh–Jhun | 85.81% | 9.24% | 4.95% | 71.62% |
| Parametric | 77.47% | 14.84% | 7.68% | 72.73% |
Proportion of error type I (for a = 0.05), for some representative of the 768 simulation scenarios that did not have signal, using the different permutation methods, and with G as the statistic in the absence of EB (so, equivalent to the F statistic). Confidence intervals (95%) are shown in parenthesis.
| Simulation parameters | Proportion of error type | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| D–S | S–W | F–L | tB | K | M | S | H–J | P | ||||||||
| 12 | 0 | ✗ | ✓ | ✗ | 4.9 (3.7–6.4) | 5.3 (4.1–6.9) | 5.1 (3.9–6.6) | 5.3 (4.1–6.9) | 5.3 (4.1–6.9) | 5.0 (3.8–6.5) | 4.9 (3.7–6.4) | 4.7 (3.6–6.2) | 4.4 (3.3–5.9) | |||
| 12 | 0 | ✗ | ✓ | ✓ | 5.3 (4.1–6.9) | 6.9 (5.5–8.6) | 5.1 (3.9–6.6) | 5.2 (4.0–6.8) | 6.9 (5.5–8.6) | 5.8 (4.5–7.4) | 5.3 (4.1–6.9) | 5.2 (4.0–6.8) | 4.6 (3.5–6.1) | |||
| 12 | 0 | ✗ | ✓ | ✗ | 5.9 (4.6–7.5) | 6.5 (5.1–8.2) | 5.2 (4.0–6.8) | 5.4 (4.2–7.0) | 6.5 (5.1–8.2) | 5.0 (3.8–6.5) | 5.9 (4.6–7.5) | 5.4 (4.2–7.0) | 8.3 (6.7–10.2) | |||
| 12 | 0 | ✗ | ✓ | ✓ | 5.3 (4.1–6.9) | 6.9 (5.5–8.6) | 5.1 (3.9–6.6) | 4.7 (3.6–6.2) | 6.9 (5.5–8.6) | 5.0 (3.8–6.5) | 5.3 (4.1–6.9) | 4.8 (3.6–6.3) | 5.7 (4.4–7.3) | |||
| 12 | 0.8 | ✗ | ✓ | ✗ | 4.4 (3.3–5.9) | 3.6 (2.6–4.9) | 5.1 (3.9–6.6) | 5.2 (4.0–6.8) | 5.8 (4.5–7.4) | 4.8 (3.6–6.3) | 5.1 (3.9–6.6) | 4.4 (3.3–5.9) | 4.4 (3.3–5.9) | |||
| 12 | 0.8 | ✗ | ✓ | ✗ | 1.5 (0.9–2.5) | 1.2 (0.7–2.1) | 4.8 (3.6–6.3) | 5.2 (4.0–6.8) | 6.5 (5.1–8.2) | 4.9 (3.7–6.4) | 5.8 (4.5–7.4) | 5.8 (4.5–7.4) | 8.5 (6.9–10.4) | |||
| 12 | 0.8 | ✗ | ✓ | ✓ | 5.5 (4.2–7.1) | 5.4 (4.2–7.0) | 4.9 (3.7–6.4) | 5.4 (4.2–7.0) | 7.5 (6.0–9.3) | 4.8 (3.6–6.3) | 4.8 (3.6–6.3) | 5.8 (4.5–7.4) | 4.6 (3.5–6.1) | |||
| 12 | 0.8 | ✓ | ✓ | ✓ | 5.1 (3.9–6.6) | 7.2 (5.8–9.0) | 5.4 (4.2–7.0) | 4.3 (3.2–5.7) | 7.2 (5.8–9.0) | 5.2 (4.0–6.8) | 5.1 (3.9–6.6) | 4.6 (3.5–6.1) | 4.6 (3.5–6.1) | |||
| 12 | 0 | ✗ | ✓ | ✗ | 5.6 (4.3–7.2) | 6.8 (5.4–8.5) | 5.4 (4.2–7.0) | 4.7 (3.6–6.2) | 6.8 (5.4–8.5) | 4.0 (3.0–5.4) | 5.6 (4.3–7.2) | 3.7 (2.7–5.1) | 8.9 (7.3–10.8) | |||
| 12 | 0 | ✗ | ✓ | ✗ | 3.9 (2.9–5.3) | 4.9 (3.7–6.4) | 3.9 (2.9–5.3) | 4.0 (3.0–5.4) | 4.9 (3.7–6.4) | 4.3 (3.2–5.7) | 3.9 (2.9–5.3) | 4.2 (3.1–5.6) | 3.7 (2.7–5.1) | |||
| 12 | 0 | ✗ | ✗ | ✓ | 2.9 (2.0–4.1) | 4.3 (3.2–5.7) | 2.6 (1.8–3.8) | 2.8 (1.9–4.0) | 4.3 (3.2–5.7) | 14.1 (12.1–16.4) | 2.9 (2.0–4.1) | 16.4 (14.2–18.8) | 9.0 (7.4–10.9) | |||
| 12 | 0 | ✗ | ✓ | ✗ | 3.2 (2.3–4.5) | 4.6 (3.5–6.1) | 2.2 (1.5–3.3) | 2.0 (1.3–3.1) | 4.6 (3.5–6.1) | 3.8 (2.8–5.2) | 3.2 (2.3–4.5) | 2.6 (1.8–3.8) | 0.5 (0.2–1.2) | |||
| 24 | 0.8 | ✗ | ✓ | ✗ | 4.4 (3.3–5.9) | 3.5 (2.5–4.8) | 4.3 (3.2–5.7) | 4.4 (3.3–5.9) | 4.9 (3.7–6.4) | 4.4 (3.3–5.9) | 4.3 (3.2–5.7) | 4.5 (3.4–6.0) | 4.4 (3.3–5.9) | |||
| 24 | 0 | ✗ | ✓ | ✗ | 5.0 (3.8–6.5) | 5.4 (4.2–7.0) | 5.1 (3.9–6.6) | 5.1 (3.9–6.6) | 5.4 (4.2–7.0) | 4.9 (3.7–6.4) | 5.0 (3.8–6.5) | 4.5 (3.4–6.0) | 5.0 (3.8–6.5) | |||
| 24 | 0 | ✗ | ✓ | ✗ | 6.2 (4.9–7.9) | 6.6 (5.2–8.3) | 6.3 (5.0–8.0) | 5.9 (4.6–7.5) | 6.6 (5.2–8.3) | 5.5 (4.2–7.1) | 6.2 (4.9–7.9) | 5.9 (4.6–7.5) | 5.8 (4.5–7.4) | |||
| 24 | 0.8 | ✗ | ✓ | ✗ | 4.9 (3.7–6.4) | 1.8 (1.1–2.8) | 5.1 (3.9–6.6) | 4.8 (3.6–6.3) | 5.4 (4.2–7.0) | 5.1 (3.9–6.6) | 5.2 (4.0–6.8) | 5.7 (4.4–7.3) | 5.4 (4.2–7.0) | |||
| 48 | 0 | ✗ | ✗ | ✓ | 4.9 (3.7–6.4) | 5.4 (4.2–7.0) | 5.0 (3.8–6.5) | 5.6 (4.3–7.2) | 5.4 (4.2–7.0) | 3.8 (2.8–5.2) | 4.9 (3.7–6.4) | 6.0 (4.7–7.6) | 5.0 (3.8–6.5) | |||
| 48 | 0.8 | ✓ | ✓ | ✗ | 5.1 (3.9–6.6) | 5.4 (4.2–7.0) | 5.0 (3.8–6.5) | 5.7 (4.4–7.3) | 5.4 (4.2–7.0) | 5.2 (4.0–6.8) | 5.1 (3.9–6.6) | 5.6 (4.3–7.2) | 5.6 (4.3–7.2) | |||
| 48 | 0.8 | ✓ | ✓ | ✗ | 4.6 (3.5–6.1) | 4.8 (3.6–6.3) | 4.7 (3.6–6.2) | 4.7 (3.6–6.2) | 4.8 (3.6–6.3) | 4.6 (3.5–6.1) | 4.6 (3.5–6.1) | 4.4 (3.3–5.9) | 4.5 (3.4–6.0) | |||
| 48 | 0 | ✗ | ✗ | ✓ | 5.4 (4.2–7.0) | 5.7 (4.4–7.3) | 5.1 (3.9–6.6) | 5.5 (4.2–7.1) | 5.7 (4.4–7.3) | 9.2 (7.6–11.2) | 5.4 (4.2–7.0) | 4.3 (3.2–5.7) | 5.1 (3.9–6.6) | |||
| 48 | 0.8 | ✗ | ✓ | ✗ | 5.5 (4.2–7.1) | 0.3 (0.1–0.9) | 5.0 (3.8–6.5) | 5.0 (3.8–6.5) | 5.0 (3.8–6.5) | 4.9 (3.7–6.4) | 5.0 (3.8–6.5) | 5.0 (3.8–6.5) | 4.9 (3.7–6.4) | |||
| 96 | 0 | ✗ | ✓ | ✓ | 5.1 (3.9–6.6) | 5.3 (4.1–6.9) | 5.1 (3.9–6.6) | 4.9 (3.7–6.4) | 5.3 (4.1–6.9) | 4.6 (3.5–6.1) | 5.1 (3.9–6.6) | 5.3 (4.1–6.9) | 4.9 (3.7–6.4) | |||
| 96 | 0.8 | ✗ | ✗ | ✓ | 5.0 (3.8–6.5) | 3.6 (2.6–4.9) | 5.0 (3.8–6.5) | 4.8 (3.6–6.3) | 5.2 (4.0–6.8) | 4.4 (3.3–5.9) | 5.1 (3.9–6.6) | 5.2 (4.0–6.8) | 4.9 (3.7–6.4) | |||
| 96 | 0 | ✗ | ✓ | ✗ | 4.9 (3.7–6.4) | 5.2 (4.0–6.8) | 4.7 (3.6–6.2) | 4.8 (3.6–6.3) | 5.2 (4.0–6.8) | 4.5 (3.4–6.0) | 4.9 (3.7–6.4) | 3.9 (2.9–5.3) | 3.6 (2.6–4.9) | |||
N: number of observations; x1 and z1: regressors of interest and of no interest, respectively, being either continuous (c) or discrete (d). ρ: correlation between x1 and z1; : model partitioned or not (using the scheme of Beckmann et al. (2001), shown in Appendix A"); ϵ: distribution of the simulated errors, which can be normal (), uniform (), exponential () or Weibull (); ee: errors treated as exchangeable; ise: errors treated as independent and symmetric. The methods are the same shown in Table 2: Draper–Stoneman (D–S), Still–White (S–W), Freedman–Lane (F–L), ter Braak (tB), Kennedy (K), Manly (M), Huh–Jhun (H–J), Smith (S) and parametric (P), the last not using permutations.
Coding of the design matrix, exchangeability blocks and variance groups for Example 1. Under unrestricted exchangeability, all subjects are assigned to a single block, and with identical variances, all to a single variance group. The regressor m codes for the overall mean, whereas m codes for handedness.
| Coded data ( | Model ( | |||
|---|---|---|---|---|
| Subject 1 | 1 | 1 | 1 | |
| Subject 2 | 1 | 1 | 1 | |
| Subject 3 | 1 | 1 | 1 | |
| Subject 4 | 1 | 1 | 1 | |
| Subject 5 | 1 | 1 | 1 | |
| Subject 6 | 1 | 1 | 1 | |
| Subject 7 | 1 | 1 | 1 | |
| Subject 8 | 1 | 1 | 1 | |
| Subject 9 | 1 | 1 | 1 | |
| Subject 10 | 1 | 1 | 1 | |
| Subject 11 | 1 | 1 | 1 | |
| Subject 12 | 1 | 1 | 1 | |
| Contrast 1 ( | + 1 | 0 | ||
| Contrast 2 ( | − 1 | 0 | ||
Coding for Example 2. Under unrestricted exchangeability, all subjects are assigned to a single block. The regressors m and m code for the experimental groups, m and m for age and sex.
| Coded data ( | Model ( | |||||
|---|---|---|---|---|---|---|
| Subject 1 | 1 | 1 | 1 | 0 | ||
| Subject 2 | 1 | 1 | 1 | 0 | ||
| Subject 3 | 1 | 1 | 1 | 0 | ||
| Subject 4 | 1 | 1 | 1 | 0 | ||
| Subject 5 | 1 | 1 | 1 | 0 | ||
| Subject 6 | 1 | 1 | 1 | 0 | ||
| Subject 7 | 1 | 1 | 0 | 1 | ||
| Subject 8 | 1 | 1 | 0 | 1 | ||
| Subject 9 | 1 | 1 | 0 | 1 | ||
| Subject 10 | 1 | 1 | 0 | 1 | ||
| Subject 11 | 1 | 1 | 0 | 1 | ||
| Subject 12 | 1 | 1 | 0 | 1 | ||
| Contrast 1 ( | + 1 | − 1 | 0 | 0 | ||
| Contrast 2 ( | − 1 | + 1 | 0 | 0 | ||
Coding of the design matrix exchangeability blocks and variance groups for Example 3. Observations are exchangeable only within subject, and variance can be estimated considering all observations as a single group. The regressor m codes for treatment, whereas m to m code for subject-specific mean.
| Coded data ( | Model ( | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Subj. 1, obs. 1 | 1 | 1 | + 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| Subj. 2, obs. 1 | 2 | 1 | + 1 | 0 | 1 | 0 | 0 | 0 | 0 |
| Subj. 3, obs. 1 | 3 | 1 | + 1 | 0 | 0 | 1 | 0 | 0 | 0 |
| Subj. 4, obs. 1 | 4 | 1 | + 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| Subj. 5, obs. 1 | 5 | 1 | + 1 | 0 | 0 | 0 | 0 | 1 | 0 |
| Subj. 6, obs. 1 | 6 | 1 | + 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| Subj. 1, obs. 2 | 1 | 1 | − 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| Subj. 2, obs. 2 | 2 | 1 | − 1 | 0 | 1 | 0 | 0 | 0 | 0 |
| Subj. 3, obs. 2 | 3 | 1 | − 1 | 0 | 0 | 1 | 0 | 0 | 0 |
| Subj. 4, obs. 2 | 4 | 1 | − 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| Subj. 5, obs. 2 | 5 | 1 | − 1 | 0 | 0 | 0 | 0 | 1 | 0 |
| Subj. 6, obs. 2 | 6 | 1 | − 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| Contrast 1 ( | + 1 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| Contrast 2 ( | − 1 | 0 | 0 | 0 | 0 | 0 | 0 | ||
Coding of the design matrix and exchangeability blocks for Example 4. As the group variances cannot be assumed to be the same, each group constitutes an EB and VG; sign flippings happen within block. The regressors m and m code for the experimental groups, m and m for age and sex.
| Coded data ( | Model ( | |||||
|---|---|---|---|---|---|---|
| Subject 1 | 1 | 1 | 1 | 0 | ||
| Subject 2 | 1 | 1 | 1 | 0 | ||
| Subject 3 | 1 | 1 | 1 | 0 | ||
| Subject 4 | 1 | 1 | 1 | 0 | ||
| Subject 5 | 1 | 1 | 1 | 0 | ||
| Subject 6 | 1 | 1 | 1 | 0 | ||
| Subject 7 | 2 | 2 | 0 | 1 | ||
| Subject 8 | 2 | 2 | 0 | 1 | ||
| Subject 9 | 2 | 2 | 0 | 1 | ||
| Subject 10 | 2 | 2 | 0 | 1 | ||
| Subject 11 | 2 | 2 | 0 | 1 | ||
| Subject 12 | 2 | 2 | 0 | 1 | ||
| Contrast 1 ( | + 1 | − 1 | 0 | 0 | ||
| Contrast 2 ( | − 1 | + 1 | 0 | 0 | ||
Coding for Example 5. The different variances restrict exchangeability for within same sex only, and two exchangeability blocks are defined, for shuffling within block. The regressors m and m code for group (patients and controls), whereas m codes for sex.
| Coded data ( | Model ( | ||||
|---|---|---|---|---|---|
| Subject 1 | 1 | 1 | 1 | 0 | 1 |
| Subject 2 | 1 | 1 | 1 | 0 | 1 |
| Subject 3 | 1 | 1 | 1 | 0 | 1 |
| Subject 4 | 2 | 2 | 1 | 0 | − 1 |
| Subject 5 | 2 | 2 | 1 | 0 | − 1 |
| Subject 6 | 2 | 2 | 1 | 0 | − 1 |
| Subject 7 | 1 | 1 | 0 | 1 | 1 |
| Subject 8 | 1 | 1 | 0 | 1 | 1 |
| Subject 9 | 1 | 1 | 0 | 1 | 1 |
| Subject 10 | 2 | 2 | 0 | 1 | − 1 |
| Subject 11 | 2 | 2 | 0 | 1 | − 1 |
| Subject 12 | 2 | 2 | 0 | 1 | − 1 |
| Contrast 1 ( | 1 | − 1 | 0 | ||
| Contrast 2 ( | − 1 | 1 | 0 | ||
Coding of the design matrix, exchangeability blocks and variance groups for Example 6. Shufflings happen for the blocks as a whole, and variances are not assumed to be the same across all timepoints.
| Coded data (Y) | Model ( | |||||||
|---|---|---|---|---|---|---|---|---|
| Subject 1, Timepoint 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | |
| Subject 1, Timepoint 2 | 1 | 2 | 0 | 1 | 0 | 0 | 0 | |
| Subject 1, Timepoint 3 | 1 | 3 | 0 | 1 | 0 | 0 | 0 | |
| Subject 2, Timepoint 1 | 2 | 1 | 0 | 0 | 1 | 0 | 0 | |
| Subject 2, Timepoint 2 | 2 | 2 | 0 | 0 | 1 | 0 | 0 | |
| Subject 2, Timepoint 3 | 2 | 3 | 0 | 0 | 1 | 0 | 0 | |
| Subject 3, Timepoint 1 | 3 | 1 | 0 | 0 | 0 | 1 | 0 | |
| Subject 3, Timepoint 2 | 3 | 2 | 0 | 0 | 0 | 1 | 0 | |
| Subject 3, Timepoint 3 | 3 | 3 | 0 | 0 | 0 | 1 | 0 | |
| Subject 4, Timepoint 1 | 4 | 1 | 0 | 0 | 0 | 0 | 1 | |
| Subject 4, Timepoint 2 | 4 | 2 | 0 | 0 | 0 | 0 | 1 | |
| Subject 4, Timepoint 3 | 4 | 3 | 0 | 0 | 0 | 0 | 1 | |
| Contrast 1 ( | 1 | − 1 | 0 | 0 | 0 | 0 | ||
| Contrast 2 ( | − 1 | 1 | 0 | 0 | 0 | 0 | ||