| Literature DB >> 18584033 |
Abstract
Experimental designs that efficiently embed a fixed effects treatment structure within a random effects design structure typically require a mixed-model approach to data analyses. Although mixed model software tailored for the analysis of two-color microarray data is increasingly available, much of this software is generally not capable of correctly analyzing the elaborate incomplete block designs that are being increasingly proposed and used for factorial treatment structures. That is, optimized designs are generally unbalanced as it pertains to various treatment comparisons, with different specifications of experimental variability often required for different treatment factors. This paper uses a publicly available microarray dataset, as based upon an efficient experimental design, to demonstrate a proper mixed model analysis of a typical unbalanced factorial design characterized by incomplete blocks and hierarchical levels of variability.Entities:
Year: 2008 PMID: 18584033 PMCID: PMC2431090 DOI: 10.1155/2008/584360
Source DB: PubMed Journal: Int J Plant Genomics ISSN: 1687-5389
Figure 1Optimized interwoven loop design for 9 treatments using R package SMIDA (Wit et al., 2005). Each circle represents a different treatment group. Each arrow represents a single array hybridization with circle base representing the Cy3 labeled sample and tail end representing the Cy5 labeled sample.
Figure 2Experimental design for one replicate from Zou et al. (2005). Treatments included a full 3 × 3 factorial of inoculate and time effects plus a 10th null control group at time 2 (N2). Samples indicated by circles with letters indicating inoculate assignment: bacteria resistant (R), a bacteria susceptible (S), and MgCl2 (M) control inoculate and numbers indicating time (2, 8, or 24 hours) after inoculation. Each arrow represents a single array hybridization with circle base representing the Cy3 labeled sample and tail end representing the Cy5 labeled sample. Solid arrows refer to the A-loop design of Landgrebe et al. (2006).
Dataset for ID_REF #30 for all hybridizations (14 arrays/loop x2 loops) in Figure 1 for each of two replicates per 10 inoculate by time groups, fluorescence intensities provided as y, log(base 2) intensities provided as ly.
| Obs | array | inoculate | time | rep | dye | y | ly |
|---|---|---|---|---|---|---|---|
| 1 | 1 | R | 2 | 1R2 | Cy3 | 16322.67 | 13.9946 |
| 2 | 1 | M | 2 | 1M2 | Cy5 | 20612.48 | 14.3312 |
| 3 | 2 | M | 2 | 1M2 | Cy3 | 10552.21 | 13.3653 |
| 4 | 2 | S | 2 | 1S2 | Cy5 | 10640.89 | 13.3773 |
| 5 | 3 | S | 2 | 1S2 | Cy3 | 24852.98 | 14.6011 |
| 6 | 3 | R | 2 | 1R2 | Cy5 | 21975.92 | 14.4236 |
| 7 | 4 | R | 8 | 1R8 | Cy3 | 30961.96 | 14.9182 |
| 8 | 4 | M | 8 | 1M8 | Cy5 | 13405.08 | 13.7105 |
| 9 | 5 | M | 8 | 1M8 | Cy3 | 13103.51 | 13.6777 |
| 10 | 5 | S | 8 | 1S8 | Cy5 | 15659.44 | 13.9347 |
| 11 | 6 | S | 8 | 1S8 | Cy3 | 20424.47 | 14.3180 |
| 12 | 6 | R | 8 | 1R8 | Cy5 | 34244.92 | 15.0636 |
| 13 | 7 | R | 24 | 1R24 | Cy3 | 15824.29 | 13.9499 |
| 14 | 7 | M | 24 | 1M24 | Cy5 | 13014.05 | 13.6678 |
| 15 | 8 | M | 24 | 1M24 | Cy3 | 17503.11 | 14.0953 |
| 16 | 8 | S | 24 | 1S24 | Cy5 | 27418.99 | 14.7429 |
| 17 | 9 | S | 24 | 1S24 | Cy3 | 37689.16 | 15.2019 |
| 18 | 9 | R | 24 | 1R24 | Cy5 | 55821.64 | 15.7685 |
| 19 | 10 | S | 2 | 1S2 | Cy3 | 28963.28 | 14.8219 |
| 20 | 10 | S | 8 | 1S8 | Cy5 | 38659.44 | 15.2385 |
| 21 | 11 | S | 8 | 1S8 | Cy3 | 41608.78 | 15.3446 |
| 22 | 11 | S | 24 | 1S24 | Cy5 | 41844.79 | 15.3528 |
| 23 | 12 | R | 2 | 1R2 | Cy3 | 12132.41 | 13.5666 |
| 24 | 12 | R | 8 | 1R8 | Cy5 | 19131.53 | 14.2237 |
| 25 | 13 | R | 8 | 1R8 | Cy3 | 31067.04 | 14.9231 |
| 26 | 13 | R | 24 | 1R24 | Cy5 | 26197.03 | 14.6771 |
| 27 | 14 | N | 2 | 1N2 | Cy3 | 18540.91 | 14.1784 |
| 28 | 14 | M | 2 | 1M2 | Cy5 | 24971.88 | 14.6080 |
| 29 | 15 | R | 2 | 2R2 | Cy3 | 9612.25 | 13.2307 |
| 30 | 15 | M | 2 | 2M2 | Cy5 | 9212.11 | 13.1693 |
| 31 | 16 | M | 2 | 2M2 | Cy3 | 10322.23 | 13.3335 |
| 32 | 16 | S | 2 | 2S2 | Cy5 | 10979.19 | 13.4225 |
| 33 | 17 | S | 2 | 2S2 | Cy3 | 8061.40 | 12.9768 |
| 34 | 17 | R | 2 | 2R2 | Cy5 | 6737.37 | 12.7180 |
| 35 | 18 | R | 8 | 2R8 | Cy3 | 8807.09 | 13.1044 |
| 36 | 18 | M | 8 | 2M8 | Cy5 | 8696.95 | 13.0863 |
| 37 | 19 | M | 8 | 2M8 | Cy3 | 15186.20 | 13.8905 |
| 38 | 19 | S | 8 | 2S8 | Cy5 | 23477.49 | 14.5190 |
| 39 | 20 | S | 8 | 2S8 | Cy3 | 19424.30 | 14.2456 |
| 40 | 20 | R | 8 | 2R8 | Cy5 | 18198.99 | 14.1516 |
| 41 | 21 | R | 24 | 2R24 | Cy3 | 19630.00 | 14.2608 |
| 42 | 21 | M | 24 | 2M24 | Cy5 | 15629.14 | 13.9320 |
| 43 | 22 | M | 24 | 2M24 | Cy3 | 10875.49 | 13.4088 |
| 44 | 22 | S | 24 | 2S24 | Cy5 | 20816.21 | 14.3454 |
| 45 | 23 | S | 24 | 2S24 | Cy3 | 24647.70 | 14.5892 |
| 46 | 23 | R | 24 | 2R24 | Cy5 | 22148.96 | 14.4350 |
| 47 | 24 | S | 2 | 2S2 | Cy3 | 17795.09 | 14.1192 |
| 48 | 24 | S | 8 | 2S8 | Cy5 | 34569.11 | 15.0772 |
| 49 | 25 | S | 8 | 2S8 | Cy3 | 44175.28 | 15.4310 |
| 50 | 25 | S | 24 | 2S24 | Cy5 | 38020.46 | 15.2145 |
| 51 | 26 | R | 2 | 2R2 | Cy3 | 34689.07 | 15.0822 |
| 52 | 26 | R | 8 | 2R8 | Cy5 | 62219.10 | 15.9251 |
| 53 | 27 | R | 8 | 2R8 | Cy3 | 22724.21 | 14.4719 |
| 54 | 27 | R | 24 | 2R24 | Cy5 | 19594.71 | 14.2582 |
| 55 | 28 | N | 2 | 2N2 | Cy3 | 11755.32 | 13.5210 |
| 56 | 28 | M | 2 | 2M2 | Cy5 | 12599.55 | 13.6211 |
Classical ANOVA of log intensities for duplicated A-loop design component of Figure 2 for any particular gene using (1).
| Source | SS* |
| Mean square | Expected mean square |
|---|---|---|---|---|
| Inoculate | SS |
| MS |
|
| Time | SS |
| MS |
|
| Inoculate*time | SS |
| MS |
|
| Dye | SS |
| MS |
|
| Rep(inoculate*time) | SS |
| MS |
|
| Array(time) | SS |
| MS |
|
| Error | SS |
| MS |
|
*Sums of squares.
†Degrees of freedom.
‡ φ is the noncentrality parameter for factor X. For example, when , there are no overall mean inoculate differences such that inoculate and Rep(inoculate*time) have the same expected mean square and F = MS/MS is a random draw from an F distribution with v numerator and v denominator degrees of freedom.
Classical ANOVA of log intensities for duplicated A-loop design component of Figure 2 on ID_REF #30 from Zou et al. (2005) using output from SAS PROC MIXED (code in Figure 3).
| Type 3 analysis of variance | ||||||||
|---|---|---|---|---|---|---|---|---|
| Source | DF† | Sum of squares | Mean square | Expected mean square | Error term | Error DF |
| Pr > F‡ |
| Trt | 2 | 0.7123 | 0.3561 | Var(Residual) + 1.5 Var(sample(inoc*time)) + Q(inoc,inoc*time) | MS(sample(inoc*time)) | 6 | 3.13 | 0.1172 |
| Time | 2 | 3.7737 | 1.8868 | Var(Residual) + 2 Var(sample(inoc*time)) + 2Var(array(time)) + Q(time,inoc*time) | 1.3333 MS(array(time)) + 1.3333 MS(sample(inoc*time)) − 1.6667 MS(Residual) | 13.969 | 3.27 | 0.0683 |
| Inoc*time | 4 | 0.6294 | 0.1573 | Var(Residual) + 1.5 Var(sample(inoc*time)) + Q(inoc*time) | MS(sample(inoc*time)) | 6 | 1.38 | 0.3435 |
| Dye | 1 | 0.0744 | 0.0744 | Var(Residual) + Q(dye) | MS(Residual) | 5 | 2.19 | 0.1989 |
| Rep(inoc*time) | 6 | 0.6826 | 0.1137 | Var(Residual) + 1.5 Var(sample(inoc*time)) | MS(Residual) | 5 | 3.35 | 0.1030 |
| Array(time) | 12 | 4.3330 | 0.3610 | Var(Residual) + 1.5 Var(array(time)) | MS(Residual) | 5 | 10.63 | 0.0085 |
| Residual | 5 | 0.1699 | 0.0339 | Var(Residual) | . | . | . | . |
†Degrees of freedom.
‡ P-value.
Figure 3SAS code for classical ANOVA and EGLS inference. Comments describing purpose immediately provided after corresponding code between /* and*/ as with a regular SAS program. EGLS based on REML would simply involve substituting method = reml for method = type3 in the third line of the code.
EGLS inference on overall importance of fixed effects for ID_REF #30 based on REML versus ANOVA (type III quadratic forms) for estimation of variance components using output from SAS PROC MIXED (code in Figure 3).
| Type 3 tests of fixed effects using REML | Type 3 tests of fixed effects using ANOVA | ||||||
|---|---|---|---|---|---|---|---|
| Effect | Num DF* | Den DF* | F value | Pr > F† | Den DF* | F value | Pr > F† |
| Inoc | 2 | 5.28 | 3.12 | 0.1273 | 6.36 | 3.48 | 0.0954 |
| Time | 2 | 17.8 | 2.81 | 0.0870 | 22.8 | 3.27 | 0.0563 |
| Inoc*time | 4 | 5.28 | 1.26 | 0.3893 | 6.36 | 1.38 | 0.3392 |
| Dye | 1 | 5.43 | 2.27 | 0.1879 | 5.15 | 2.19 | 0.1973 |
*Num Df = numerator degrees of freedom; Den DF = denominator degrees of freedom.
† P-value.