| Literature DB >> 16042779 |
Nitin Jain1, HyungJun Cho, Michael O'Connell, Jae K Lee.
Abstract
BACKGROUND: The evaluation of statistical significance has become a critical process in identifying differentially expressed genes in microarray studies. Classical p-value adjustment methods for multiple comparisons such as family-wise error rate (FWER) have been found to be too conservative in analyzing large-screening microarray data, and the False Discovery Rate (FDR), the expected proportion of false positives among all positives, has been recently suggested as an alternative for controlling false positives. Several statistical approaches have been used to estimate and control FDR, but these may not provide reliable FDR estimation when applied to microarray data sets with a small number of replicates.Entities:
Mesh:
Year: 2005 PMID: 16042779 PMCID: PMC1187876 DOI: 10.1186/1471-2105-6-187
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Scatter plots of null data. (a) null data within the same condition from the resampling method; (b) null data between the different conditions from the resampling method; (c) null data within the same condition from the Mix-all method; (d) null data between the different conditions from the mix-all method;
Figure 2M vs A plot of simulated data. The simulated data contains 10% significant genes (indicated by 'x'), and 90% insignificant genes.
Figure 3Comparison of four FDR estimation methods. (a), (b), (c), and (d) are the plots between true and estimated FDR for simulated data with 5%, 10%, 20%, and 50% differentially expressed genes, respectively.
Numbers of differentially expressed genes discovered by five methods
| FDR cutoff | BY | BH | SPLOSH | Mix-all | RIR |
| 0.0001 | 1397 | 1730 | 2876 | 2542 | 2074 |
| 0.001 | 1730 | 2162 | 3134 | 2958 | 2485 |
| 0.01 | 2160 | 2849 | 3467 | 3694 | 3382 |
| 0.05 | 2670 | 3661 | 5654 | 4594 | 4548 |
Minimum FDR estimates of well-known genes found to be differentially regulated genes
| Gene Symbol | Gene Title | BY | BH | SPLOSH | Mix-all | RIR |
| CD97 | CD97 antigen | 0.0230 | 0.0023 | 0.0489 | <0.0001 | 0.0006 |
| GATA3 | GATA-binding protein-3 | 0.0208 | 0.0021 | 0.0489 | <0.0001 | 0.0006 |
| Clast3-pending | CD40 ligand-activated specific transcript | 0.1005 | 0.0103 | <0.0001 | 0.0007 | 0.0034 |
| GZMK | Granzyme K | 0.2768 | 0.0277 | 0.0524 | 0.0037 | 0.0091 |
| FAF1 | Fas-associated factor-1 | 1.0000 | 0.1100 | <0.0001 | 0.0335 | 0.0038 |