| Literature DB >> 15626348 |
Hao Li1, Constance L Wood, Thomas V Getchell, Marilyn L Getchell, Arnold J Stromberg.
Abstract
BACKGROUND: Two or more factor mixed factorial experiments are becoming increasingly common in microarray data analysis. In this case study, the two factors are presence (Patients with Alzheimer's disease) or absence (Control) of the disease, and brain regions including olfactory bulb (OB) or cerebellum (CER). In the design considered in this manuscript, OB and CER are repeated measurements from the same subject and, hence, are correlated. It is critical to identify sources of variability in the analysis of oligonucleotide array experiments with repeated measures and correlations among data points have to be considered. In addition, multiple testing problems are more complicated in experiments with multi-level treatments or treatment combinations.Entities:
Mesh:
Year: 2004 PMID: 15626348 PMCID: PMC544885 DOI: 10.1186/1471-2105-5-209
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Summary of genes with main effects
| Direction | ||||
| # of genes | 32 | 17 | 331 | 228 |
| Fold change | 1.1~2.9 | 1.2~2.8 | 1.1~104.3 | 1.1~121.7 |
I: significant upregulation of gene expression;
D: significant downregulation of gene expression.
Figure 1Simple diagrams to illustrate significant interaction, main effects of Disease and Region. The average log transformed intensities under control and AD conditions for both OB and CER were plotted together for each gene with either significant interaction or main effects. The two points from each region were connected using a straight line and the non-parallel lines imply interaction. Two examples genes with interaction effect were shown in A. A1, represents a directional interaction and A2 indicates an interaction in magnitude. Two genes with only main effect of disease were illustrated in B, one of which showed down-regulation in AD (B1), while the other genes were upregulated in AD for both OB and CER. In the bottom panel, two genes with only regional differences were shown. The gene in C1 has high expression level in CER, and the gene in C2 has an opposite situation. See also Table 4, 5.
Example of genes with significant interaction effects
| LOC91614 | Cathepsin H | ||||||||||||
| OB | CER | OB | CER | ||||||||||
| Con | 7.02 | 6.61 | 6.58 | 6.78 | 7.09 | 6.83 | 10.53 | 10.63 | 10.49 | 9.46 | 9.71 | 9.42 | |
| 6.74 ± 0.25 | 6.90 ± 0.17 | 10.55 ± 0.07 | 9.53 ± 0.16 | ||||||||||
| AD | 7.58 | 7.59 | 8.21 | 5.30 | 5.03 | 5.58 | 12.30 | 12.41 | 12.37 | 9.54 | 9.82 | 9.79 | |
| 7.80 ± 0.36 | 5.30 ± 0.28 | 12.36 ± 0.06 | 9.72 ± 0.15 | ||||||||||
| Overall P | 0.00058 | 2.12e-06 | |||||||||||
| Interaction P | 0.00036 | 5.86e-06 | |||||||||||
| ConOB | Re | + | + | ||||||||||
| fold | 2.08 | 3.51 | |||||||||||
| dir | |||||||||||||
| ConCER | Re | + | + | ||||||||||
| fold | 0.33 | 1.14 | |||||||||||
| dir | N | ||||||||||||
| ConOB | Re | - | + | ||||||||||
| fold | 0.90 | 2.03 | |||||||||||
| dir | N | ||||||||||||
| ADOB | Re | + | + | ||||||||||
| fold | 5.66 | 6.23 | |||||||||||
| dir | |||||||||||||
ConOB_ADOB: control vs AD for OB; ConCER_ADCER: control vs AD for CER; ConOB_ConCER: OB vs CER for control subjects; ADOB_ADCER: OB vs CER for AD patients. Overall P: the P value of the generalized F test. Interaction P: the P value of the interaction term. Re: the results of the protected Fisher's LSD procedure, where "+" indicates a significant difference and "-" implies a non significant difference ; dir: the direction of alteration in gene expression levels; D for decrease, I for increase and N for no change in AD when comparing control vs AD, or in OB when comparing CER vs OB; fold: fold change of each pairwise comparison calculated from the inverse transformed log2 based data. The fold change of LOC91614 gene expression between AD and control in OB was calculated as: 2(7.58+7.59+8.21)/3/2(7.02+6.61+6.58)/3 = 2.08. The fold changes in other situations were calculated in a similar way. Similar notations were also used in Table 3. See also Figure 1.
Example of genes with significant main effects
| Log2 based data | Overall P-value | Interaction P-value | Main effect of Disease | Main effect of Region | |||||||||||
| OB | CER | P value | fold | dir | P value | fold | dir | ||||||||
| Con | 12.68 | 12.73 | 12.83 | 12.37 | 12.34 | 12.53 | |||||||||
| 12.75 ± 0.08 | 12.41 ± 0.10 | ||||||||||||||
| 0.0003 | 0.6462 | 0.0009* | 0.63 | 0.011† | 1.24 | N | |||||||||
| AD | 12.01 | 11.94 | 12.16 | 11.53 | 12.04 | 11.81 | |||||||||
| 12.04 ± 0.11 | 11.79 ± 0.26 | ||||||||||||||
| Con | 10.44 | 10.22 | 10.26 | 10.09 | 10.16 | 9.80 | |||||||||
| 10.31 ± 0.12 | 10.02 ± 0.19 | ||||||||||||||
| 0.0002 | 0.0726 | 0.0021* | 1.51 | 0.056 | 1.42 | N | |||||||||
| AD | 11.20 | 11.01 | 10.96 | 10.30 | 10.59 | 10.34 | |||||||||
| 11.06 ± 0.13 | 10.41 ± 0.16 | ||||||||||||||
| Con | 11.49 | 10.91 | 10.83 | 12.56 | 12.47 | 12.62 | |||||||||
| 11.08 ± 0.36 | 12.55 ± 0.08 | ||||||||||||||
| 0.0025 | 0.8328 | 0.5004 | 0.92 | N | 0.0005* | 0.37 | |||||||||
| AD | 11.33 | 11.00 | 10.66 | 12.45 | 12.35 | 12.41 | |||||||||
| 11.00 ± 0.34 | 12.41 ± 0.06 | ||||||||||||||
| Con | 14.01 | 14.54 | 13.89 | 13.21 | 13.34 | 12.74 | |||||||||
| 14.15 ± 0.35 | 13.10 ± 0.32 | ||||||||||||||
| 0.0011 | 0.3767 | 0.5986 | 0.94 | N | 0.0002* | 2.20 | |||||||||
| AD | 14.03 | 13.84 | 14.47 | 12.70 | 12.88 | 13.06 | |||||||||
| 14.11 ± 0.32 | 12.88 ± 0.18 | ||||||||||||||
* indicates the P values pass the FDR 5% criteria. † indicates the P values are smaller than 0.05 but larger than the critical value 0.0033 determined by the FDR procedure. Overall P-value and interaction P-value are the same with the Overall P and Interaction P in Table 4. P values of main effect of disease and region are the P values of Type III ANOVA test using proc mixed procedure in SAS. Fold of main effect of disease: the ratio of the average intensities of AD (average over OB and CER) over Control (average over OB and CER). Fold of main effect of region: the ratio of the average intensities of OB (average over Control and AD) over CER (average over Control and AD). See also Figure 1.
The arrangement for the 2 × 2 factorial design with repeated measures
, , , and are the true means of measurements in OB of controls, CER of controls, OB of AD patients and CER of AD patients respectively.
Set up hypothesis using linear contrasts
| ( | ( | ( | ( | ||||||
| (DR) | ( | 0 | 0 | 0 | 0 | -1 | 1 | 1 | -1 |
| D | ( | 2 | -2 | 0 | 0 | 1 | 1 | -1 | -1 |
| R | ( | 0 | 0 | 2 | -2 | 1 | -1 | 1 | -1 |
The hypothesis in terms of model parameters and means were listed. The coefficients for the model parameters of the linear contrasts were determined for the corresponding hypotheses.
Figure 2Histograms of Simulated F statistics. The histograms of the F statistics from F[3, 4] (grey in A, B), simulated data with same covariance structure among individuals (cyan, case I in A) or unequal variance for subjects from controls and AD patients (blue, case II in B). Case I has slightly larger tail than random generated F values, and the right tail of case II were thicker than both of cases above.