| Literature DB >> 23413432 |
Abstract
Many methods of gene set analysis developed in recent years have been compared empirically in a number of comprehensive review articles. Although it is recognized that different methods tend to identify different gene sets as significant, no consensus has been worked out as to which method is preferable, as the recommendations are often contradictory. In this article, we want to group and compare different methods in terms of the methodological assumptions pertaining to definition of a sample and formulation of the actual null hypothesis. We discuss four models of statistical experiment explicitly or implicitly assumed by most if not all currently available methods of gene set analysis. We analyse validity of the models in the context of the actual biological experiment. Based on this, we recommend a group of methods that provide biologically interpretable results in statistically sound way. Finally, we demonstrate how correlated or low signal-to-noise data affects performance of different methods, observed in terms of the false-positive rate and power.Entities:
Mesh:
Year: 2014 PMID: 23413432 PMCID: PMC4103537 DOI: 10.1093/bib/bbt002
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Examples of competitive, self-contained and parametric methods
| Method | Statistic | Significance assessment |
|---|---|---|
| Q1 (Tian | Gene randomization | |
| FCS—Functional Class Score (Pavlidis | Gene randomization | |
| GSEA (Subramanian | Kolmogorov–Smirnov statistic comparing ranks of | Sample randomization |
| GSA (Efron and Tibshirani [ | The | Sample randomization for standardized test statistics |
| SAFE (Barry | Kolmogorov–Smirnov or Wilcoxon rank-sum statistic comparing | Sample randomization |
| Globaltest (Goeman | Asymptotic normal distribution, or for small sample approximation by scaled | |
| Q2 (Tian | Same as Q1 | Sample randomization |
| FCS.SC—self-contained version of FCS (Pavlidis | Same as FCS | Sample randomization |
| ES.SC—self-contained version of the enrichment score (Subramanian | Kolmogorov–Smirnov statistic comparing | Analytical Kolmogorov–Smirnov distribution or randomization of samples |
| PAGE (Kim and Volksy [ | Null distribution of | |
| where | ||
| CATEGORY (Jiang | Null distribution of | |
| Other parametric tests proposed by Irizarry | Wilcoxon rank-sum statistic or | Corresponding analytical distributions |
False-positive rates for the self-contained hypothesis for varying level of correlation in the gene set observed for different methods of gene set analysis
| Method | Correlation | ||||
|---|---|---|---|---|---|
| 0 | 0.2 | 0.4 | 0.6 | 0.8 | |
| Q1 | 0.05 | 0.096 | 0.178 | 0.22 | 0.282 |
| GSEA | 0.034 | 0.07 | 0.066 | 0.062 | 0.06 |
| GSA | 0.056 | 0.072 | 0.07 | 0.086 | 0.096 |
| GSA2 | 0.05 | 0.057 | 0.047 | 0.06 | 0.05 |
| SAFE | 0.053 | 0.043 | 0.06 | 0.04 | 0.057 |
| Globaltest | 0.05 | 0.036 | 0.038 | 0.062 | 0.066 |
| Q2 | 0.054 | 0.034 | 0.046 | 0.044 | 0.05 |
| ES.SC | 0.048 | 0.052 | 0.054 | 0.052 | 0.054 |
| ES.SC Analytic | 0.044 | 0.13 | 0.318 | 0.556 | 0.822 |
| PAGE | 0.03 | 0.148 | 0.32 | 0.606 | 0.702 |
| CATEGORY | 0.066 | 0.482 | 0.632 | 0.682 | 0.772 |
| PAR Wilcoxon | 0.056 | 0.516 | 0.644 | 0.716 | 0.802 |
Methods are denoted as in Table 1. The following specific settings of the methods were used: SAFE uses the Wilcoxon statistic; Globaltest uses the asymptotic null distribution; ES.SC uses null distribution based on permutation of samples; ES.SC Analytic uses the Kolmogorov–Smirnov null distribution; GSA uses all genes in the data set for re-standardization; GSA2 is the modified version of GSA with the P-value based on Equation (6). PAR Wilcoxon is the parametric method suggested by Irizarry et al. [16], see last row in Table 1.
Figure 3:Power of selected methods as a function of correlation and the number of differentially expressed genes (n.DE) in the gene set. Strong effect, Δ = 1.5. Number of genes d = 1000.
Figure 1:Power of selected methods as a function of correlation and the number of differentially expressed genes (n.DE) in the gene set. Small effect, Δ = 0.5. Number of genes d = 1000.
Figure 2:Power of selected methods as a function of correlation and the number of differentially expressed genes (n.DE) in the gene set. Medium effect, Δ = 1. Number of genes d = 1000.
Figure 4:Power of sample randomization, competitive methods for different number of genes in the study d = 100 or d = 1000, as a function of correlation and the number of differentially expressed genes (n.DE) in the gene set. Medium effect, Δ = 1.
False-positive rates for the competitive hypothesis as a function of the average number of differentially expressed genes in all the gene sets (n.DE) and correlation in GS1
| Method | Correlation | |||||
|---|---|---|---|---|---|---|
| 0 | 0.2 | 0.4 | 0.6 | 0.8 | ||
| Q1 | 6 | 0.005 | 0.01 | 0.055 | 0.035 | 0.095 |
| GSEA | 0.15 | 0.205 | 0.155 | 0.175 | 0.16 | |
| GSA | 0.01 | 0.01 | 0.005 | 0 | 0.005 | |
| GSA2 | 0.01 | 0 | 0 | 0.005 | 0 | |
| SAFE | 0.01 | 0.01 | 0 | 0 | 0 | |
| PAGE | 0 | 0.005 | 0.035 | 0.045 | 0.09 | |
| PAR Wilcoxon | 0.005 | 0 | 0.02 | 0.01 | 0.01 | |
| Q1 | 12 | 0.01 | 0.065 | 0.09 | 0.115 | 0.175 |
| GSEA | 0.04 | 0.085 | 0.075 | 0.09 | 0.075 | |
| GSA | 0.02 | 0.02 | 0.005 | 0.03 | 0.005 | |
| GSA2 | 0 | 0.02 | 0.025 | 0.02 | 0.01 | |
| SAFE | 0.005 | 0.01 | 0.01 | 0.025 | 0.035 | |
| PAGE | 0.005 | 0.07 | 0.15 | 0.26 | 0.285 | |
| PAR Wilcoxon | 0.005 | 0.02 | 0.115 | 0.175 | 0.245 | |
Figure 5:Power of methods comparing expression in GS1 with expression in the remaining gene sets as a function of the number of differentially expressed genes in GS1 (n.DE) and their correlation.