| Literature DB >> 12354329 |
Yuanyuan Xiao1, Mark R Segal, Douglas Rabert, Andrew H Ahn, Praveen Anand, Lakshmi Sangameswaran, Donglei Hu, C Anthony Hunt.
Abstract
BACKGROUND: Microarray technology is a powerful methodology for identifying differentially expressed genes. However, when thousands of genes in a microarray data set are evaluated simultaneously by fold changes and significance tests, the probability of detecting false positives rises sharply. In this first microarray study of brachial plexus injury, we applied and compared the performance of two recently proposed algorithms for tackling this multiple testing problem, Significance Analysis of Microarrays (SAM) and Westfall and Young step down adjusted p values, as well as t-statistics and Welch statistics, in specifying differential gene expression under different biological states.Entities:
Year: 2002 PMID: 12354329 PMCID: PMC137578 DOI: 10.1186/1471-2164-3-28
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Global comparison of the differences in gene expression levels between a) a patient (P15) and a normal subject (C1); b) a pair of patients (P15 and P12); c) a pair of normal subjects (C1 and C2).
Figure 2Histogram and QQ plot of t-statistics calculated from the normalized gene expression levels. Large t-statistics are labeled in red.
Figure 3Plots of t-statistics, t-numerators, t-denominators and average intensities against each other. Large t-statistics are labeled in red.
Figure 4The relationship between FDR and in SAM. t-SAM is represented with three different lines with different s0 values: s0 = 0, S0 = 0.034 (the minimum of the pooled standard deviations) and S0 = 2.761 (5th percentile of the pooled standard deviations). Welch-SAM is graphed with S0 = 2.000 (5th percentile of the pooled standard deviations).
Comparison of t-statistics and Welch statistics in identifying significant genes under different values of Δs
| Δ | Significant genes | False positives | FDR |
| 2 | 622 | 312 | 50% |
| 4 | 223 | 77.3 | 35% |
| 6 | 131 | 35.1 | 27% |
| 8 | 88 | 21.2 | 24% |
| 12 | 73 | 13.3 | 18% |
| 16 | 62 | 9.8 | 16% |
| 20 | 57 | 7.7 | 14% |
| 25 | 46 | 5.8 | 13% |
| 40 | 30 | 3.4 | 11% |
| 60 | 27 | 2.4 | 9% |
| 90 | 19 | 1.5 | 8% |
| Welch statistics | |||
| 2 | 1348 | 648.3 | 48% |
| 4 | 525 | 155.9 | 30% |
| 6 | 264 | 64.5 | 24% |
| 8 | 97 | 24.0 | 25% |
| 12 | 50 | 10.4 | 21% |
| 16 | 20 | 4.0 | 20% |
| 20 | 12 | 2.5 | 21% |
| 25 | 7 | 1.0 | 14% |
| 40 | 7 | 1.0 | 14% |
| 60 | 6 | 0.4 | 7% |
| 90 | 6 | 0.4 | 7% |
Figure 5The relationship between adjusted p values and number of genes tested. a) t-statistics; b) Welch statistics. Colored lines are genes ordered by their unadjusted p values, e.g. gene1 has the smallest unadjusted p value and gene600 has the 600th smallest unadjusted p value. c) A comparison of panel a) and b). Red lines represent the results of Welch statistics and black lines represent the results of t-statistics.