| Literature DB >> 17612399 |
Irina Dinu1, John D Potter, Thomas Mueller, Qi Liu, Adeniyi J Adewale, Gian S Jhangri, Gunilla Einecke, Konrad S Famulski, Philip Halloran, Yutaka Yasui.
Abstract
BACKGROUND: Gene-set analysis evaluates the expression of biological pathways, or a priori defined gene sets, rather than that of individual genes, in association with a binary phenotype, and is of great biologic interest in many DNA microarray studies. Gene Set Enrichment Analysis (GSEA) has been applied widely as a tool for gene-set analyses. We describe here some critical problems with GSEA and propose an alternative method by extending the individual-gene analysis method, Significance Analysis of Microarray (SAM), to gene-set analyses (SAM-GS).Entities:
Mesh:
Substances:
Year: 2007 PMID: 17612399 PMCID: PMC1931607 DOI: 10.1186/1471-2105-8-242
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Performance of GSEA and SAM-GS on Test 1. Proportions of randomly generated null gene sets that are identified by each method to be associated with the phenotype (p-value ≤ 0.05) in a mouse-microarray study.
| | | GSEA | 100% | 100% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
| | | GSEA | 100% | 100% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
| | | GSEA | 100% | 100% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
| | | GSEA | 100% | 100% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
| | | GSEA | 96% | 100% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
| | | GSEA | 13% | 100% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
| | | GSEA | 8% | 100% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
| | | GSEA | 0% | 24% | 100% | 100% |
| SAM-GS | 0% | 0% | 0% | 0% | |
Performance of GSEA and SAM-GS on Test 2. Proportions of randomly generated non-null gene sets that are identified by each method to be associated with the phenotype (p-value ≤ 0.05) in a mouse-microarray study.
| Half of genes with | | GSEA | 1% | 3% | 0% | 1% |
| SAM-GS | 93% | 100% | 100% | 100% | |
| Half of genes with | | GSEA | 3% | 4% | 3% | 1% |
| SAM-GS | 100% | 100% | 100% | 100% | |
| Half of genes with | | GSEA | 6% | 7% | 7% | 18% |
| SAM-GS | 100% | 100% | 100% | 100% | |
| Half of genes with | | GSEA | 12% | 18% | 31% | 66% |
| SAM-GS | 100% | 100% | 100% | 100% | |
| Half of genes with | | GSEA | 20% | 64% | 88% | 100% |
| SAM-GS | 100% | 100% | 100% | 100% | |
| Half of genes with | | GSEA | 69% | 100% | 100% | 100% |
| SAM-GS | 100% | 100% | 100% | 100% | |
Figure 1A statistically significant GSEA result. An illustration of a statistically-significant GSEA result with 100 genes selected at random from genes with no or weak correlation of expression with the phenotype (|r| < 0.4).
Results of the analyses of three datasets by GSEA and SAM-GS.
| 0.1% | 4 | 5 | 6 | 6 | 0.78/0.98 (0.94) | |
| 0.3% | 3 | 36 | 6 | 308 | 0.21/0.94 (0.68) | |
| 79.9% | 0 | 182 | 5 | 182 | 0.06/NA§ (NA§) | |
* FDR = False discovery rate estimate
† AUC = Area under the ROC curve
‡ Taking SAM-GS p ≤ 0.05 as the target to be predicted
§ All gene sets in the leukemia dataset had SAM-GS p ≥ 0.05
The 31 gene sets for which SAM-GS and GSEA strongly disagreed (SAM-GS FDR ≤ 0.01, GSEA FDR ≥ 0.49) in the p53 analysis.
| 0.87 | 0.21 | ≤ 0.01 | < 0.001 | Pathway member | |
| 0.57 | 0.04 | ≤ 0.01 | < 0.001 | Apoptosis | |
| 0.84 | 0.13 | ≤ 0.01 | < 0.001 | ||
| 0.90 | 0.29 | ≤ 0.01 | < 0.001 | Cell cycle | |
| 0.88 | 0.32 | ≤ 0.01 | < 0.001 | Apoptosis | |
| 0.51 | 0.01 | ≤ 0.01 | < 0.001 | Pathway member | |
| 0.83 | 0.56 | ≤ 0.01 | < 0.001 | Cell cycle | |
| 0.83 | 0.34 | ≤ 0.01 | < 0.001 | Integrated negative feedback loop between Akt and | |
| 0.83 | 0.42 | ≤ 0.01 | 0.001 | Apoptosis | |
| 0.98 | 0.49 | ≤ 0.01 | 0.001 | Cell cycle | |
| 0.88 | 0.30 | ≤ 0.01 | 0.001 | Apoptosis | |
| 0.85 | 0.23 | ≤ 0.01 | 0.002 | Pathway member | |
| 0.93 | 0.27 | ≤ 0.01 | 0.002 | Cytokines; JAK/STAT signaling | |
| 0.89 | 0.72 | ≤ 0.01 | 0.003 | Pathway member | |
| 0.81 | 0.50 | ≤ 0.01 | 0.003 | Pathway member | |
| 0.53 | 0.04 | ≤ 0.01 | 0.005 | Pathway member | |
| 0.86 | 0.08 | ≤ 0.01 | 0.005 | Pathway member | |
| 0.81 | 0.37 | ≤ 0.01 | 0.005 | Pathway member | |
| 1.00 | 0.85 | ≤ 0.01 | 0.006 | Pathway member | |
| 0.78 | 0.08 | ≤ 0.01 | 0.007 | Apoptosis (and cytokines) | |
| 0.53 | 0.05 | ≤ 0.01 | 0.007 | Cytokines | |
| 0.84 | 0.07 | ≤ 0.01 | 0.007 | Cytokines; JAK/STAT signaling | |
| 0.86 | 0.31 | ≤ 0.01 | 0.008 | Pathway member | |
| 0.92 | 0.29 | ≤ 0.01 | 0.010 | CPO regulated by | |
| 0.49 | 0.02 | ≤ 0.01 | 0.011 | Cdk5 phosphorylates | |
| 0.95 | 0.48 | ≤ 0.01 | 0.011 | Apoptosis | |
| 0.79 | 0.45 | ≤ 0.01 | 0.012 | Ets1 required for | |
| 0.80 | 0.13 | ≤ 0.01 | 0.012 | At least one known link between wnt and | |
| 0.84 | 0.42 | ≤ 0.01 | 0.013 | unknown | |
| 0.60 | 0.04 | ≤ 0.01 | 0.013 | unknown | |
| 0.80 | 0.52 | ≤ 0.01 | 0.013 | Pathway member | |