| Literature DB >> 18986519 |
Nathan L Tintle1, Aaron A Best, Matthew DeJongh, Dirk Van Bruggen, Fred Heffron, Steffen Porwollik, Ronald C Taylor.
Abstract
BACKGROUND: Despite the widespread usage of DNA microarrays, questions remain about how best to interpret the wealth of gene-by-gene transcriptional levels that they measure. Recently, methods have been proposed which use biologically defined sets of genes in interpretation, instead of examining results gene-by-gene. Despite a serious limitation, a method based on Fisher's exact test remains one of the few plausible options for gene set analysis when an experiment has few replicates, as is typically the case for prokaryotes.Entities:
Mesh:
Year: 2008 PMID: 18986519 PMCID: PMC2587482 DOI: 10.1186/1471-2105-9-469
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Average Type I error rate (empirical α) when nominal α is 0.05
| Set Size | True mean log-ratio of genes in the set | Standard deviation of log-ratios of genes in the set | GSEA-NR | ABSSUM-NR | SUMSQ-NR | SUM-NR | MAX-MEAN-NR | FET 1/2 | FET 1 | FET 2 | FET 3 |
| 2–5 genes | 0 | 0.01–0.05 | 0.0005 | 0 | 0 | 0.056 | 0 | 0 | 0 | 0 | 0 |
| 0.10–0.20 | 0.005 | 0.001 | 0.0006 | 0.058 | 0.002 | 0.0004 | 0 | 0 | 0 | ||
| ± 0.05 | 0.01–0.05 | 0.02 | 0 | 0 | 0.056 | 0 | 0 | 0 | 0 | 0 | |
| 0.10–0.20 | 0.01 | 0.002 | 0.0008 | 0.06 | 0.003 | 0.0005 | 0 | 0 | 0 | ||
| 10 or more genes | 0 | 0.01–0.05 | 0.29 | 0 | 0 | 0.13 | 0 | 0 | 0 | 0 | 0 |
| 0.10–0.20 | 0.03 | 0.004 | 0 | 0.14 | 0.0005 | 0.0001 | 0 | 0 | 0 | ||
| ± 0.05 | 0.01–0.05 | 0.45 | 0 | 0 | 0.15 | 0 | 0 | 0 | 0 | 0 | |
| 0.10–0.20 | 0.08 | 0.006 | 0 | 0.15 | 0.004 | 0.0002 | 0 | 0 | 0 |
Percentage of time method is within 5% of the maximum observed power
| 2–5 | 0.02–0.24 | < 1.0 | 72 | 33 | 15 |
| 1.0+ | 98 | 12 | 16 | ||
| 0.50–0.99 | < 1.0 | 60 | 76 | 58 | |
| 1.0+ | 97 | 26 | 50 | ||
| 1.0+ | < 1.0 | 89 | 95 | 93 | |
| 1.0+ | 100 | 96 | 99 | ||
| 10–20 | 0.02–0.24 | < 1.0 | 60 | 56 | 45 |
| 1.0+ | 100 | 26 | 86 | ||
| 0.50–0.99 | < 1.0 | 67 | 95 | 93 | |
| 1.0+ | 100 | 75 | 87 | ||
| 1.0+ | < 1.0 | 99 | 99 | 100 | |
| 1.0+ | 100 | 100 | 100 | ||
| 50 | 0.02–0.24 | < 1.0 | 57 | 78 | 69 |
| 1.0+ | 100 | 66 | 99 | ||
| 0.50–0.99 | < 1.0 | 88 | 98 | 99 | |
| 1.0+ | 100 | 92 | 93 | ||
| 1.0+ | < 1.0 | 100 | 100 | 100 | |
| 1.0+ | 100 | 100 | 100 |
Percentage of time method is within 5% of the maximum observed power
| 2–5 | 0.02–0.24 | < 1.0 | 31 | 32 | 55 |
| 1.0+ | 63 | 53 | 20 | ||
| 0.50–0.99 | < 1.0 | 25 | 19 | 83 | |
| 1.0+ | 66 | 70 | 71 | ||
| 1.0+ | < 1.0 | 58 | 66 | 99 | |
| 1.0+ | 100 | 76 | 99 | ||
| 10–20 | 0.02–0.24 | < 1.0 | 20 | 51 | 69 |
| 1.0+ | 100 | 100 | 86 | ||
| 0.50–0.99 | < 1.0 | 41 | 56 | 97 | |
| 1.0+ | 97 | 90 | 88 | ||
| 1.0+ | < 1.0 | 93 | 98 | 100 | |
| 1.0+ | 100 | 100 | 100 | ||
| 50 | 0.02–0.24 | < 1.0 | 24 | 59 | 82 |
| 1.0+ | 100 | 100 | 99 | ||
| 0.50–0.99 | < 1.0 | 62 | 87 | 99 | |
| 1.0+ | 100 | 100 | 93 | ||
| 1.0+ | < 1.0 | 100 | 100 | 100 | |
| 1.0+ | 100 | 100 | 100 |
Number of sets found as significantly regulated across the 18 experiments
| FET 3 | 32 | 6 | 3 | 3 |
| FET 2 | 78 | 23 | 11 | 11 |
| FET 1 | 196 | 85 | 58 | 50 |
| FET 1/2 | 338 | 109 | 54 | 45 |
| GSEA-NR | 522 | 137 | 61 | 50 |
| SUMSQ-NR | 311 | 64 | 34 | 31 |
| ABSSUM-NR | 444 | 124 | 66 | 57 |
| SUM-NR | 573 | 142 | 79 | 67 |
| MAXMEAN-NR | 613 | 181 | 91 | 74 |
Simulation settings
| π | 1.0, 0.9, 0.8, 0.5 |
| μ1 | +/-2.0, +/- 1.0, +/- 0.5, +/- 0.25 |
| μ2 | +/- 1.5, +/- 0.5 |
| σ1 | 0.25, 0.5 |
| σ2 | 0.25, 0.5 |
| Gene set sizes (n) | 2, 5, 10, 20, 50 |