| Literature DB >> 21722407 |
Ke Zhang1, Haiyan Wang, Arne C Bathke, Solomon W Harrar, Hans-Peter Piepho, Youping Deng.
Abstract
BACKGROUND: Gene set analysis (GSA) has become a successful tool to interpret gene expression profiles in terms of biological functions, molecular pathways, or genomic locations. GSA performs statistical tests for independent microarray samples at the level of gene sets rather than individual genes. Nowadays, an increasing number of microarray studies are conducted to explore the dynamic changes of gene expression in a variety of species and biological scenarios. In these longitudinal studies, gene expression is repeatedly measured over time such that a GSA needs to take into account the within-gene correlations in addition to possible between-gene correlations.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21722407 PMCID: PMC3142525 DOI: 10.1186/1471-2105-12-273
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Estimated Type I errors for the test of no treatment effect based on asymptotic distribution
| #time.points | #genes | normal | exponential | Poisson | Cauchy |
|---|---|---|---|---|---|
| 2 | 5 | 0.060 | 0.053 | 0.063 | 0.021 |
| 10 | 0.047 | 0.052 | 0.048 | 0.024 | |
| 20 | 0.053 | 0.046 | 0.054 | 0.026 | |
| 30 | 0.048 | 0.063 | 0.058 | 0.019 | |
| 40 | 0.044 | 0.052 | 0.053 | 0.021 | |
| 50 | 0.043 | 0.052 | 0.057 | 0.020 | |
| 100 | 0.040 | 0.050 | 0.042 | 0.020 | |
| 5 | 5 | 0.056 | 0.052 | 0.059 | 0.032 |
| 10 | 0.053 | 0.055 | 0.057 | 0.025 | |
| 20 | 0.047 | 0.045 | 0.066 | 0.020 | |
| 30 | 0.060 | 0.058 | 0.050 | 0.014 | |
| 40 | 0.050 | 0.049 | 0.047 | 0.018 | |
| 50 | 0.044 | 0.041 | 0.041 | 0.016 | |
| 100 | 0.062 | 0.047 | 0.050 | 0.023 | |
The data from the same gene have unstructured correlation.
Estimated type I error of the test of no treatment by time interaction at 0
| #genes | normal | exponential | Poisson | Cauchy |
|---|---|---|---|---|
| 5 | 0.087 | 0.103 | 0.099 | 0.046 |
| 10 | 0.074 | 0.082 | 0.064 | 0.035 |
| 20 | 0.061 | 0.063 | 0.050 | 0.024 |
| 30 | 0.070 | 0.071 | 0.063 | 0.019 |
| 40 | 0.071 | 0.060 | 0.065 | 0.019 |
| 50 | 0.064 | 0.052 | 0.056 | 0.011 |
| 100 | 0.037 | 0.051 | 0.048 | 0.012 |
| 200 | 0.043 | 0.050 | 0.052 | 0.018 |
| 500 | 0.048 | 0.040 | 0.051 | 0.022 |
| 1000 | 0.057 | 0.046 | 0.048 | 0.013 |
The data from the same gene followed unstructured correlation. For each simulation, there are two time points.
Figure 1The power curve of NP statistic based on the asymptotic distribution compared to LME and GEE. The empirical powers of the NP statistics for testing of no treatment effect based on the asymptotic distribution compared to LME and GEE are given here. The powers were estimated at level 0.05. Δ is the log-scale mean difference between successive treatment groups.
Estimated type I errors for the permutation test of no treatment effect compared to GEE
| distribution | n1 | n2 | G | permutation NP | GEE |
|---|---|---|---|---|---|
| Normal | 5 | 6 | 20 | 0.041 | 0.097 |
| 25 | 25 | 20 | 0.055 | 0.058 | |
| 45 | 50 | 20 | 0.047 | 0.056 | |
| 5 | 6 | 50 | 0.045 | 0.105 | |
| 25 | 25 | 50 | 0.058 | 0.053 | |
| 45 | 50 | 50 | 0.046 | 0.044 | |
| 5 | 6 | 100 | 0.033 | 0.087 | |
| 25 | 25 | 100 | 0.053 | 0.058 | |
| 45 | 50 | 100 | 0.047 | 0.045 | |
| Poisson | 5 | 6 | 20 | 0.040 | 0.096 |
| 25 | 25 | 20 | 0.058 | 0.055 | |
| 45 | 50 | 20 | 0.058 | 0.049 | |
| 5 | 6 | 50 | 0.041 | 0.109 | |
| 25 | 25 | 50 | 0.058 | 0.061 | |
| 45 | 50 | 50 | 0.052 | 0.062 | |
| 5 | 6 | 100 | 0.028 | 0.075 | |
| 25 | 25 | 100 | 0.053 | 0.063 | |
| 45 | 50 | 100 | 0.050 | 0.048 | |
| Exponential | 5 | 6 | 20 | 0.040 | 0.101 |
| 25 | 25 | 20 | 0.046 | 0.070 | |
| 45 | 50 | 20 | 0.047 | 0.062 | |
| 5 | 6 | 50 | 0.041 | 0.083 | |
| 25 | 25 | 50 | 0.048 | 0.056 | |
| 45 | 50 | 50 | 0.052 | 0.051 | |
| 5 | 6 | 100 | 0.041 | 0.087 | |
| 25 | 25 | 100 | 0.046 | 0.053 | |
| 45 | 50 | 100 | 0.044 | 0.059 | |
The data from different genes and repeated measurements from the same gene have AR(1) correlation with correlation coefficient 0.5. The n1 and n2 are the sample sizes for treatment groups 1 and 2, respectively. G is the number of genes in the gene set. The estimate is at 0.05 level.
Figure 2Power comparisons for the permutation test of no treatment effect compared with GEE. The power curves for using permutation tests for treatment effect are given here. The powers were estimated at level 0.05. G is the number of genes, n is the number of replicates in the two treatment groups, and Δ is the mean difference between the treatment groups.
Figure 3The distribution of the gene set sizes. The histogram showed the distribution of the size of the 548 gene sets used for data analysis.
The IL-2 regulated gene sets.
| Gene Set | FDR |
|---|---|
| Ross cbf | 0.020 |
| Peart histone up | 0.047 |
| Rome insulin 2f up | 0.038 |
| Hivnefpathway | 0.025 |
| Cell adhesion | 0.041 |
| Haddad hsc cd7 up | 0.010 |
| Flechner kidney transplant rejection pbl up | 0.009 |
| Shepard pos reg of cell proliferation | 0.029 |
| Haddad cd45cd7 plus vs minus up | 0.010 |
| Hsiao liver specific genes | 0.031 |
| Takeda nup8 hoxa9 3d up | 0.030 |
| Cromer hypopharyngeal met vs non dn | 0.028 |
| Vanasse bcl2 targets | 0.006 |
| Gamma unique fibro dn | 0.018 |
| Tnfalpha adip dn | 0.026 |
| Gn camp granulosa dn | 0.041 |
| Aged mouse neocortex up | 0.026 |
| Adip diff up | 0.006 |
| Hsa04370 vegf signaling pathway | 0.016 |
| Hsa04520 adherens junction | 0.008 |