| Literature DB >> 17419876 |
Abstract
BACKGROUND: Currently, most tests of differential gene expression using Affymetrix expression array data are performed using expression summary values representing each probe set on a microarray. Recently testing methods have been proposed which incorporate probe level information. We propose a new approach that uses Fisher's method of combining evidence from multiple sources of information. Specifically, we combine p-values from probe level tests of significance.Entities:
Mesh:
Year: 2007 PMID: 17419876 PMCID: PMC1854896 DOI: 10.1186/1471-2164-8-96
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1ROC and FDR plots for each of the spike-in datasets. (a) ROC curves for the Golden Spike data. (b) ROC curves for the Gene Logic Tonsil data. (c) ROC curves for the Affymetrix Latin Square data. (d) FDR plot for the Golden Spike data. (e) FDR plot for the Gene Logic Tonsil data. (f) FDR plot for the Affymetrix Latin Square data.
IQR of Ranks for True Positives.
| Data | Method | Q1 | Median | Q3 |
| Golden Spike | ANOVA | 345.00 | 1624.00 | 3986.50 |
| Combined P | 337.50 | 1173.00 | 3527.50 | |
| Cyber-T | 356.50 | 1490.00 | 4640.00 | |
| Median t | 339.50 | 1320.00 | 3719.00 | |
| Moderated t | 366.50 | 1543.00 | 5403.00 | |
| Original t | 445.50 | 1652.00 | 5545.50 | |
| Gene Logic Tonsil | ANOVA | 182.25 | 363.50 | 1455.50 |
| Combined P | 182.25 | 363.50 | 1031.00 | |
| Cyber-T | 182.25 | 363.50 | 3749.00 | |
| Median t | 182.25 | 363.50 | 2638.50 | |
| Moderated t | 182.25 | 365.50 | 27707.00 | |
| Original t | 182.25 | 390.50 | 39284.00 | |
| Affymetrix Latin Square | ANOVA | 956.25 | 1951.50 | 3023.75 |
| Combined P | 956.25 | 1946.50 | 2984.50 | |
| Cyber-T | 956.25 | 1957.50 | 2995.75 | |
| Median t | 956.25 | 1949.50 | 2998.75 | |
| Moderated t | 965.25 | 1961.50 | 3002.75 | |
| Original t | 965.25 | 1961.50 | 3002.75 | |
This table shows the interquartile range (IQR) of ranks by method for true positives for each of the datasets. For the Golden Spike data there are a total of 1331 true positives. For the Gene Logic Tonsil data there are 66 comparisons on 11 differentially spiked transcripts for a total of 726 true positives. For the Affymetrix Latin Square data there are 91 comparisons on 42 differentially spiked transcripts for a total of 3822 true positives.
Power by Intensity Range.
| Power by Intensity Quartile | ||||||
| Data | Method | Q1 | Q2 | Q3 | Q4 | Overall Power |
| Golden Spike | ANOVA | 0.096 | 0.192 | 0.295 | 0.465 | 0.262 |
| Combined P | 0.144 | 0.315 | 0.413 | 0.568 | 0.360 | |
| Cyber-T | 0.042 | 0.141 | 0.259 | 0.477 | 0.230 | |
| Median t | 0.141 | 0.306 | 0.404 | 0.556 | 0.352 | |
| Moderated t | 0.060 | 0.159 | 0.265 | 0.396 | 0.220 | |
| Original t | 0.006 | 0.009 | 0.021 | 0.033 | 0.017 | |
| Gene Logic Tonsil | ANOVA | 0.341 | 0.619 | 0.829 | 0.577 | 0.591 |
| Combined P | 0.390 | 0.646 | 0.867 | 0.615 | 0.629 | |
| Cyber-T | 0.374 | 0.624 | 0.790 | 0.566 | 0.588 | |
| Median t | 0.368 | 0.630 | 0.851 | 0.599 | 0.612 | |
| Moderated t | 0.258 | 0.597 | 0.746 | 0.549 | 0.537 | |
| Original t | 0.203 | 0.547 | 0.713 | 0.473 | 0.483 | |
| Affymetrix Latin Square | ANOVA | 0.418 | 0.819 | 0.915 | 0.813 | 0.741 |
| Combined P | 0.518 | 0.887 | 0.945 | 0.895 | 0.811 | |
| Cyber-T | 0.646 | 0.925 | 0.986 | 0.972 | 0.882 | |
| Median t | 0.591 | 0.916 | 0.982 | 0.967 | 0.864 | |
| Moderated t | 0.584 | 0.888 | 0.968 | 0.927 | 0.841 | |
| Original t | 0.518 | 0.847 | 0.941 | 0.872 | 0.795 | |
This table shows the power for each of the four intensity quartiles as well as the overall power for each of the datasets and methods. The power is calculated as the proportion of true positives that were detected while maintaining an overall false discovery rate of 0.05 or less.
False Positive Rates.
| Method | Golden Spike | Gene Logic Tonsil | Affymetrix Latin Square |
| ANOVA | 0.286 (0.135) | 0.146 (0.007) | 0.096 (0.018) |
| Combined P | 0.374 (0.110) | 0.146 (0.003) | 0.155 (0.007) |
| Cyber-T | 0.190 (0.096) | 0.037 (0.005) | 0.047 (0.017) |
| Median t | 0.004 (0.131) | 2.8 × 10-4 (0.008) | 2.8 × 10-4 (0.023) |
| Moderated t | 0.160 (0.079) | 0.027 (0.006) | 0.034 (0.014) |
| Original t | 0.123 (0.051) | 0.024 (0.007) | 0.035 (0.016) |
This table shows the observed false positive rate calculated using a p-value cutoff of 0.01. The false positive rate based on a calibrated p-value of 0.01 is shown in parentheses. The calibrated p-value was calculated using probe sets called Absent on all arrays for a given experiment. For the combined p method, a p-value cutoff of 0.005 was used because we use the minimum of the two one-sided combined p-values.
Figure 2Selection Curves. The selection curves show the proportion of genes selected in common with each of the other methods when a given number of top ranked genes is selected as differentially expressed by combined p. (a) Selection curves for the Golden Spike data. (b) Selection curves for the Gene Logic Tonsil data. (c) Selection curves for the Affymetrix Latin Square data. (d) Selection curves for the YMCAT versus YWT comparison for the MCAT data. (e) Selection curves for the OMCAT versus OWT comparison for the MCAT data. (f) Selection curves for the YWT versus OWT comparison for the MCAT data.
Proportion of the MCAT qRT-PCR assayed genes selected by method.
| YMCAT vs YWT | OMCAT vs OWT | OWT vs YWT | |||||||
| Method | Top 100 | Top 150 | Top 200 | Top 100 | Top 150 | Top 200 | Top 100 | Top 150 | Top 200 |
| ANOVA | 0.277 | 0.362 | 0.489 | 0.085 | 0.106 | 0.106 | 0.128 | 0.128 | 0.128 |
| Combined P | 0.277 | 0.340 | 0.426 | 0.064 | 0.106 | 0.106 | 0.085 | 0.128 | 0.128 |
| Cyber-T | 0.213 | 0.255 | 0.362 | 0.064 | 0.106 | 0.149 | 0.149 | 0.149 | 0.149 |
| Median t | 0.255 | 0.362 | 0.426 | 0.043 | 0.064 | 0.085 | 0.106 | 0.128 | 0.128 |
| Moderated t | 0.085 | 0.106 | 0.128 | 0.021 | 0.043 | 0.043 | 0.128 | 0.128 | 0.128 |
| Original t | 0.021 | 0.043 | 0.064 | 0.000 | 0.000 | 0.000 | 0.064 | 0.085 | 0.106 |
This table shows the proportion of the 47 qRT-PCR assayed genes ranked in the top 100, 150 and 200 by each of the testing methods applied to the corresponding array data.
Golden Spike Probe Set Examples.
| Rank by Method | |||||||||||
| Probe Set | Ordered t-statistics | ANOVA | Combined P | Cyber-T | Median t | Moderated t | Original t | ||||
| 154940-at | -17.72 | -13.75 | -11.15 | -5.34 | -2.30 | 3513 | 1292 | 8596 | 1172 | 8515 | 7815 |
| -0.38 | 2.15 | 4.14 | 4.68 | 7.2 | |||||||
| 8.34 | 9.16 | 12.57 | 15.11 | ||||||||
| 146788-at | -3.58 | -2.44 | -2.43 | -2.22 | -1.31 | 8410 | 6012 | 8172 | 8337 | 8679 | 7217 |
| -1.06 | -0.82 | -0.11 | 0.14 | 0.44 | |||||||
| 1.14 | 1.44 | 2.67 | 48.25 | ||||||||
| 151862-at | -0.79 | -0.21 | -0.20 | -0.12 | 0.03 | 2680 | 1173 | 2556 | 698 | 4030 | 5631 |
| 0.15 | 3.47 | 4.63 | 5.74 | 7.02 | |||||||
| 7.89 | 7.98 | 9.08 | 10.61 | ||||||||
| 153041-at | -0.48 | -0.32 | 0.66 | 0.76 | 0.87 | 1664 | 603 | 1155 | 1822 | 1815 | 2968 |
| 1.37 | 1.69 | 3.08 | 3.42 | 10.25 | |||||||
| 18.85 | 19.24 | 19.57 | 29.19 | ||||||||
This table shows the ordered probe level t-statistics and rankings by method for selected probe sets from the Golden Spike data.