| Literature DB >> 20942918 |
Fabrice Berger1, Bertrand De Meulder, Anthoula Gaigneaux, Sophie Depiereux, Eric Bareke, Michael Pierre, Benoît De Hertogh, Mauro Delorenzi, Eric Depiereux.
Abstract
BACKGROUND: Microarray data is frequently used to characterize the expression profile of a whole genome and to compare the characteristics of that genome under several conditions. Geneset analysis methods have been described previously to analyze the expression values of several genes related by known biological criteria (metabolic pathway, pathology signature, co-regulation by a common factor, etc.) at the same time and the cost of these methods allows for the use of more values to help discover the underlying biological mechanisms.Entities:
Mesh:
Year: 2010 PMID: 20942918 PMCID: PMC2964684 DOI: 10.1186/1471-2105-11-510
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Formulation of the different questions asked by the different genesets analysis methods
| Question | Formulation |
|---|---|
| Q0 | Which known genesets are associated with different expression profiles under the two conditions compared? |
| Qcomp | Which geneset definitions are associated with the biggest difference in expression profiles observed under each condition? |
| Qself | Which genesets are associated with diverging expression profiles between conditions compared with random definitions of phenotype? |
| Qcor | Which genesets are defined by members associated with correlated expression changes? |
| Quni | Which genesets are associated with an increase or decrease of all member expression values? |
| Qbidir | Which genesets are defined by differentially expressed genes, regardless of the direction of the regulation? |
| Qint | Which genesets are associated with variable individual expression changes? |
| Quc | Which unidirectional groups have members that are associated with a correlated answer? |
Question Q0 covers all the others; therefore, a method answering this question answers all the others.
Comparison of the properties of several geneset differential expression analysis methods
| Author | Year | Hypothesis | Data used | Group statistic | Significance | Name | Properties | Individual statistic |
|---|---|---|---|---|---|---|---|---|
| * | ||||||||
| * | ||||||||
| Dinu et al | 2007 | Self-contained | Expression data | Sum (d2) | Sample permutations | SAM-GS | Determination of S0 | SAM d statistic |
| Goeman et al | 2004 | Self-contained | Expression data | Q(g) = mean(Q(i)) | Permutation/Gamma/Asymptotic | GlobalTest | P (Y|X) | Q(i) |
| Mansmann & Meister | 2005 | Self-contained | Expression data | F | Sample permutations | GlobalAncova | P (X|Y) | |
The methods are grouped into categories. The upper and lower parts of the table list respectively the competitive and self-contained methods. The methods highlighted in bold rely on a two-step procedure. The methods in plain writing rely on a global analysis, which uses the expression data to compute the geneset statistic in one single step, based on multivariate models. Finally, ANOVA-2 and FAERI are shown in italics.
Figure 1Illustration of the effect of Z-value data reduction on a variance analysis with two classification criteria. The p-values associated with the effect of the condition studied and the intersection are compared before and after Z reduction. After this step, the effect associated with the probeset is null (data not shown). Prior to Z reduction, the probesets are expressed at variable levels. After reduction, the expression level is standardized for all of the probesets and their individual contributions are balanced during the variance analysis. The genesets analyzed are distributed differently and reveal a more pronounced effect of the condition and/or of the interaction between the condition and the probesets. Put differently, both the strength and variability of the individual answer are revealed by this step.
Figure 2Illustration of the distribution of the F statistic evaluated compared with geneset size using the ANOVA-2 procedure on the initial expression data (left panel), on the standardized data (center) and on standardized and unidirectional data (right, FAERI procedure). The graphs in the upper part are generated from random data and show that the directional reduction step induces dependence on the number of members in the geneset. The graphs in the lower part show results obtained from real data (E-GEOD-7479), and illustrate the impact of the standardization of data relative to each probeset as well as dependence on the number of members following the directional reduction step.
Figure 3Illustration of the logarithm of the p-values obtained by ANOVA-2 (left), FAERI based on random data (center) or permutations (right), versus the number of members in the geneset (real dataset E-GEOD-7479). The graphs presented in the center and on the right show that the two procedures to evaluate the significance of the FAERI test give p-values dependant on geneset size.
Figure 4Comparison of the logarithm of the p-values obtained by FAERI based on random data or permutations. The left graph shows the comparison of the p-values obtained during analysis of simulated data. The right graph shows results obtained when analyzing real data (E-GEOD-7479), illustrating that the null distribution evaluated by the two procedures is different in the case of real data, but, nonetheless, that part of the genesets present a similar p-value (diagonally).
Definition of the series of measurements used to evaluate the performances of the geneset differential expression analysis methods
| Difference of expression | Correlation | Diff. Expressed | Over-Expressed | Under-Expressed | Design | |
|---|---|---|---|---|---|---|
| Set 1 | 0.75 | 0.6 | 20 | 20 | 0 | Unidirectional |
| Set 2 | 0.75 | 0 | 20 | 20 | 0 | Unidirectional |
| Set 3 | 0 | 0 | 0 | 0 | 0 | H0 |
| Set 4 | 0.75 | 0.6 | 10 | 10 | 0 | Unidirectional |
| Set 5 | 0.75 | 0 | 10 | 10 | 0 | Unidirectional |
| Set 6 | 1 | 0.6 | 20 | 10 | 10 | Bidirectional |
| Set 7 | 1 | 0 | 20 | 10 | 10 | Bidirectional |
| Set 8 | 1 | 0.6 | 10 | 5 | 5 | Bidirectional |
| Set 9 | 1 | 0 | 10 | 5 | 5 | Bidirectional |
Comparison of geneset differential expression analysis method performances, based on the simulation model proposed by M Ackerman
| Cut-Off | a2.fixed | faeri. | faeri. | GSA. | GSA. | GSA. | Globaltest. | Globaltest. | Globaltest. | Gsea. | Gsea. | Samgs. | Samgs. | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | ||||
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | ||||
| 3 | 3 | 3 | 12 | 0 | 0 | 2 | 5 | 3 | 2 | 1 | 2 | 27 | ||||
| 100 | 96 | 97 | 15 | 25 | 12 | 54 | 96 | 95 | 69 | 20 | 91 | 94 | ||||
| 100% DE | 100 | 99 | 99 | 15 | 25 | 12 | 95 | 99 | 97 | 99 | 77 | 98 | 99 | |||
| 100 | 100 | 100 | 78 | 72 | 64 | 99 | 99 | 99 | 100 | 100 | 99 | 100 | ||||
| 72 | 40 | 36 | 0 | 2 | 0 | 3 | 64 | 49 | 14 | 2 | 39 | 56 | ||||
| 50% DE | 89 | 75 | 78 | 0 | 2 | 0 | 42 | 89 | 84 | 42 | 25 | 81 | 84 | |||
| 100 | 94 | 94 | 7 | 17 | 3 | 93 | 99 | 98 | 93 | 84 | 96 | 99 | ||||
| 86 | 62 | 60 | 1 | 2 | 2 | 1 | 10 | 8 | 2 | 0 | 7 | 13 | ||||
| 100% DE | 89 | 69 | 70 | 1 | 2 | 2 | 20 | 35 | 25 | 16 | 3 | 23 | 41 | |||
| 95 | 80 | 80 | 11 | 23 | 11 | 58 | 59 | 58 | 46 | 47 | 56 | 73 | ||||
| 55 | 43 | 39 | 0 | 0 | 0 | 0 | 15 | 7 | 4 | 3 | 4 | 11 | ||||
| 50% DE | 65 | 52 | 54 | 0 | 0 | 0 | 18 | 30 | 24 | 13 | 14 | 23 | 43 | |||
| 81 | 65 | 65 | 1 | 13 | 1 | 56 | 60 | 57 | 43 | 47 | 56 | 73 | ||||
| 0 | 100 | 100 | 1 | 67 | 40 | 100 | 100 | 100 | 1 | 0 | 100 | 100 | ||||
| 100% DE | 0 | 100 | 100 | 1 | 67 | 40 | 100 | 100 | 100 | 10 | 1 | 100 | 100 | |||
| 6 | 100 | 100 | 22 | 98 | 82 | 100 | 100 | 100 | 69 | 32 | 100 | 100 | ||||
| 0 | 84 | 82 | 0 | 7 | 2 | 32 | 95 | 93 | 0 | 0 | 86 | 93 | ||||
| 50% DE | 0 | 95 | 95 | 0 | 7 | 2 | 93 | 100 | 100 | 1 | 0 | 100 | 100 | |||
| 9 | 100 | 100 | 16 | 48 | 15 | 100 | 100 | 100 | 27 | 8 | 100 | 100 | ||||
| 0 | 84 | 84 | 0 | 11 | 8 | 5 | 28 | 17 | 6 | 1 | 15 | 25 | ||||
| 100% DE | 0 | 88 | 89 | 0 | 11 | 8 | 39 | 50 | 43 | 17 | 8 | 43 | 61 | |||
| 2 | 93 | 93 | 35 | 49 | 54 | 79 | 79 | 78 | 57 | 42 | 78 | 88 | ||||
| 0 | 63 | 62 | 0 | 2 | 1 | 3 | 30 | 14 | 0 | 0 | 13 | 23 | ||||
| 50% DE | 0 | 75 | 76 | 0 | 2 | 1 | 36 | 50 | 42 | 3 | 0 | 40 | 58 | |||
| 3 | 86 | 86 | 28 | 27 | 12 | 75 | 78 | 75 | 31 | 15 | 76 | 85 | ||||
Each set of measurements presented was generated 100 times. The measurements in the table show, for each method, the number of detections of the different sets of measurements defined. The upper part of the table presents the results for unidirectional genesets only (all members over expressed) and the lower part concerns groups with a simulated mixed answer (over and under expression). The data related to H0 shows the number of genesets detected by chance (false positives).
Characterization of each method's performances
| a2.fixed | faeri. | faeri. | GSA. | GSA. | GSA. | Globaltest. | Globaltest. | Globaltest. | Gsea. | Gsea. | Samgs. | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 62% | 56% | |||||||||||
| 51% | 50% | 71% | 9'% | 92% | 71% | 62% | ||||||
| 51% | 51% | 60% | 67% | 62% | 58% | 51% | 61% | |||||
| 50% | 50% | 59% | 65% | 62% | 56% | 57% | 61% | |||||
| 50% | 70% | 55% | 50% | |||||||||
| 50% | 53% | 51% | 50% | 50% | ||||||||
| 50% | 55% | 54% | 69% | 75% | 71% | 58% | 54% | |||||
| 50% | 51% | 50% | 68% | 75% | 71% | 51% | 50% | |||||
| 71% | 57% | 54% | 62% | 58% | ||||||||
| 7 | 2 | 1 | 12 | 10 | 11 | 6 | 3 | 4 | 8 | 9 | 5 |
Comparison of the number of true positives and true negatives to the total number of tests (VP+VN)/(P+N). For information, we also computed the mean value for the 8 types of groups studied. The significance threshold to select groups was set at 0.01. The highlighted (in bold) values are relative to methods obtaining a score of more than 75%. The four global methods are more appropriate for geneset analysis and the two-step methods show poor performance. Among the global methods, only ANOVA-2 and FAERI happen to be suited for unidirectional correlated genesets. FAERI and SAMGS are the only methods adapted for the study of bidirectional correlated genesets. Across all genesets tested, the methods giving the best results were: FAERI (perms), FAERI (null), SAMGS (q-value), GlobaTest (gamma), GlobaTest (perms), SAMGS (p-value), GlobalTest (asymptotic). FAERI is the only method suited for the study of all types of genesets tested.
List of genesets in the C2.KEGG category detected significantly by the FAERI.perms method (detection threshold of 0.05 for the p-values)
| faeri. | faeri. | a2.fixed | GSA. | GSA. | GSA. | Globaltest. | Globaltest. | Globaltest. | Gsea. | Gsea. | Samgs. | Samgs. | Gse1056 | Gse4086 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| hsa00010_glycolysis_and_gluconeogenesis | 0.64 | 0.96 | 0.95 | 0.98 | 0.10 | 0.11 | 0.17 | 0.10 | 0.08 | ||||||
| hsa00020_citrate_circle | 0.38 | 0.19 | 0.65 | 0.10 | 0.08 | 0.10 | 0.22 | 0.40 | 0.19 | 0.08 | |||||
| hsa00030_pentose_phosphate_pathway | 0.41 | 0.95 | 0.90 | 0.97 | 0.10 | 0.23 | 0.28 | 0.10 | 0.08 | ||||||
| hsa00051_fructose_and_mannose_metabolism | 0.97 | 0.92 | 0.68 | 0.97 | 0.10 | 0.24 | 0.38 | 0.10 | 0.08 | ||||||
| hsa00100_biosynthesis_of_steroids | 0.94 | 0.15 | 0.75 | 0.14 | 0.10 | 0.30 | 0.11 | 0.32 | 0.10 | 0.08 | |||||
| hsa00190_oxidative_phosphorylation | 0.99 | 0.23 | 0.48 | 0.27 | 0.30 | 0.35 | 0.50 | 0.40 | 0.53 | 0.49 | 0.08 | ||||
| hsa00230_purine_metabolism | 1.00 | 0.12 | 0.69 | 0.43 | 0.63 | 0.13 | 0.10 | 0.30 | 0.23 | 0.51 | 0.49 | 0.08 | |||
| hsa00480_glutathione_metabolsim | 1.00 | 0.19 | 0.52 | 0.10 | 0.27 | 0.30 | 0.50 | 0.22 | 0.37 | 0.30 | 0.08 | ||||
| hsa00511_n_glycan_degradation | 0.90 | 0.19 | 0.59 | 0.08 | 0.15 | 0.14 | 0.30 | 0.29 | 0.48 | 0.30 | 0.08 | 0.06 | 0.21 | ||
| Hsa00530_aminosugars_metabolism | 0.19 | 0.19 | 0.55 | 0.07 | 0.10 | 0.06 | 0.30 | 0.18 | 0.48 | 0.30 | 0.08 | 0.12 | |||
| Hsa00620_pyruvate_metabolism | 1.00 | 0.09 | 0.68 | 0.06 | 0.10 | 0.11 | 0.19 | 0.10 | 0.08 | ||||||
| Hsa00710_carbon_fixation | 1.00 | 0.95 | 0.90 | 0.97 | 0.10 | 0.23 | 0.35 | 0.10 | 0.08 | ||||||
| Hsa00720_reductive_caboxylate_cycle | 1.00 | 0.10 | 0.75 | 0.07 | 0.10 | 0.11 | 0.30 | 0.10 | 0.08 | ||||||
| Hsa00900_terpenoid_biosynthesis | 0.25 | 0.77 | 0.06 | 0.10 | 0.48 | 0.63 | 0.10 | 0.08 | 0.07 | ||||||
| Hsa01032_glycan_structures_degradation | 0.83 | 0.19 | 0.55 | 0.13 | 0.14 | 0.13 | 0.30 | 0.10 | 0.43 | 0.40 | 0.08 | 0.05 | 0.08 | ||
| Hsa01510_neurodegenerative_diseases | 1.00 | 0.21 | 0.92 | 0.92 | 0.97 | 0.09 | 0.10 | 0.71 | 0.72 | 0.10 | 0.08 | 0.08 | |||
| Hsa03010_ribosome | 0.91 | 0.78 | 0.57 | 0.86 | 0.24 | 0.26 | 0.40 | 0.11 | 0.22 | 0.29 | 0.08 | 0.27 | |||
| Hsa03320_ppar_signaling_pathway | 1.00 | 0.17 | 0.55 | 0.06 | 0.10 | 0.11 | 0.27 | 0.40 | 0.08 | ||||||
| Hsa04010_mapk_signaling_pathway | 0.90 | 0.57 | 0.59 | 0.30 | 0.33 | 0.22 | 0.22 | 0.50 | 0.30 | 0.54 | 0.49 | 0.08 | |||
| Hsa04130_snare_interactions_in_vesicular_transport | 0.97 | 0.11 | 0.18 | 0.56 | 0.11 | 0.16 | 0.10 | 0.40 | 0.61 | 0.71 | 0.40 | 0.08 | 0.48 | ||
| Hsa4150_mtor_signaling_pathway | 0.23 | 0.81 | 0.47 | 0.75 | 0.10 | 0.12 | 0.32 | 0.38 | 0.08 | ||||||
| Hsa04210_apoptosis | 0.05 | 0.39 | 0.41 | 0.31 | 0.41 | 0.20 | 0.16 | 0.40 | 0.81 | 0.76 | 0.49 | 0.08 | |||
| Hsa04370_vegf_signaling_pathway | 0.08 | 0.87 | 0.29 | 0.83 | 0.10 | 0.11 | 0.23 | 0.10 | 0.08 | ||||||
| Hsa04510_focal_adhesion | 0.19 | 0.73 | 0.26 | 0.48 | 0.07 | 0.20 | 0.11 | 0.24 | 0.19 | 0.08 | |||||
| Hsa04620_toll_like_receptor_signaling_pathway | 0.61 | 0.47 | 0.66 | 0.10 | 0.11 | 0.44 | 0.10 | 0.08 | |||||||
| Hsa04650_natural_killer_cell_mediated_cytotoxicity | 0.10 | 0.58 | 0.29 | 0.43 | 0.17 | 0.20 | 0.17 | 0.50 | 0.60 | 0.72 | 0.72 | 0.08 | |||
| Hsa04660_t_cell_receptor_signaling_pathway | 0.10 | 0.65 | 0.95 | 0.55 | 0.97 | 0.07 | 0.10 | 0.11 | 0.41 | 0.41 | 0.08 | ||||
| Hsa04662_b_cell_receptor_signaling_pathway | 0.97 | 0.78 | 0.40 | 0.48 | 0.13 | 0.06 | 0.10 | 0.53 | 0.63 | 0.63 | 0.08 | ||||
| Hsa04664_fc_epsilon_ri_signaling_pathway | 0.97 | 0.05 | 0.91 | 0.55 | 0.97 | 0.10 | 0.11 | 0.25 | 0.25 | 0.08 | |||||
| Hsa04810_regulation_of_actin_cytoskeleton | 0.52 | 0.30 | 0.40 | 0.38 | 0.29 | 0.32 | 0.50 | 0.42 | 0.52 | 0.52 | 0.08 | ||||
| Hsa05040_huntingtons_disease | 0.50 | 0.78 | 0.95 | 0.98 | 0.13 | 0.06 | 0.10 | 0.29 | 0.45 | 0.45 | 0.08 | 0.83 | |||
| Hsa05150_cholera_infection | 1.00 | 0.37 | 0.73 | 0.11 | 0.10 | 0.37 | 0.51 | 0.51 | 0.08 | 0.12 | |||||
| Hsa05120_epithelial_cell_signaling_in_helicobacter_pylorii_infection | 1.00 | 0.11 | 0.55 | 0.15 | 0.09 | 0.30 | 0.21 | 0.48 | 0.48 | 0.08 | |||||
| Hsa05130_pathogenic_escherishia_coli_infection_ehec | 1.00 | 0.26 | 0.47 | 0.18 | 0.20 | 0.17 | 0.40 | 0.18 | 0.29 | 0.29 | 0.08 | ||||
| Hsa05131_pathogenic_escherishia_coli_infection_epec | 1.00 | 0.26 | 0.47 | 0.18 | 0.20 | 0.17 | 0.40 | 0.18 | 0.29 | 0.29 | 0.08 | ||||
| Hsa05210_colorectal_cancer | 0.36 | 0.67 | 0.29 | 0.41 | 0.16 | 0.09 | 0.40 | 0.22 | 0.35 | 0.35 | 0.08 | ||||
| Hsa05211_renal_cell_carcinoma | 0.91 | 0.55 | 0.97 | 0.10 | 0.12 | 0.24 | 0.24 | 0.08 | |||||||
| Hsa05212_pancreatic_cancer | 0.44 | 0.47 | 0.29 | 0.34 | 0.09 | 0.20 | 0.43 | 0.54 | 0.54 | 0.08 | |||||
| Hsa05216_thyroid_cancer | 0.86 | 0.91 | 0.53 | 0.55 | 0.20 | 0.17 | 0.15 | 0.30 | 0.30 | 0.54 | 0.54 | 0.08 | 0.10 | ||
| Hsa05219_bladder_cancer | 0.47 | 0.60 | 0.57 | 0.55 | 0.61 | 0.11 | 0.08 | 0.30 | 0.91 | 0.80 | 0.80 | 0.08 | 0.08 | ||
| Hsa05220_chronic_myeloid_leukemia | 1.00 | 0.73 | 0.37 | 0.31 | 0.41 | 0.15 | 0.09 | 0.40 | 0.61 | 0.65 | 0.65 | 0.08 | |||
| Hsa05221_acute_myeloid_leukemia | 1.00 | 0.24 | 0.24 | 0.31 | 0.26 | 0.15 | 0.11 | 0.30 | 0.60 | 0.72 | 0.72 | 0.08 |
For each geneset, the p-values obtained by each method are listed. The values in plain text are not significant. The p-values of the significant (0.05) and highly significant (scientific notation) genesets are shown in bold. The two last columns give the p-values obtained for datasets GSE-1056 and GSE-4086 by FAERI.perms, to illustrate the coherence of the results presented.
List of the genesets of the C2.KEGG category detected with a highly significant threshold (0.01) by the other methods
| Faeri. | Faeri. | GSA. | GSA. | GSA. | Globaltest. | Globaltest. | Globaltest. | Gsea. | Gsea. | Samgs. | Samgs. | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Hsa00010_glycolysis_and_neoglucogenesis | 0.64 | 0.96 | 0.95 | 0.98 | 0.1 | 0.11 | 0.17 | 0.1 | 0.08 | |||
| Hsa00030_pentose_phosphate_pathway | 0.41 | 0.95 | 0.9 | 0.97 | 0.1 | 0.23 | 0.28 | 0.1 | 0.08 | |||
| Hsa00052_galactose_metabolism | 0.73 | 0.08 | 0.95 | 0.77 | 0.97 | 0.1 | 0.11 | 0.13 | 0.1 | 0.08 | ||
| Hsa00053_ascorbate_and_aldarate_metabolism | 0.97 | 0.18 | 0.72 | 0.06 | 0.16 | 0.16 | 0.3 | 0.11 | 0.42 | 0.19 | 0.08 | |
| Hsa00071_fatty_acid_metabolism | 0.71 | 0.15 | 0.52 | 0.2 | 0.18 | 0.5 | 0.11 | 0.3 | 0.49 | 0.08 | ||
| Hsa00100_biosynthesis_of_steriods | 0.94 | 0.15 | 0.75 | 0.14 | 0.1 | 0.3 | 0.11 | 0.32 | 0.1 | 0.08 | ||
| Hsa00272_cysteine_metabolism | 0.94 | 0.25 | 0.34 | 0.62 | 0.1 | 0.1 | 0.21 | 0.46 | 0.19 | 0.08 | ||
| Hsa00340_histidine_metabolism | 1 | 0.17 | 0.36 | 0.11 | 0.35 | 0.41 | 0.5 | 0.11 | 0.21 | 0.59 | 0.9 | |
| Hsa00500_starch_and_sucrose_metabolism | 1 | 0.08 | 0.95 | 0.6 | 0.97 | 0.1 | 0.11 | 0.18 | 0.1 | 0.08 | ||
| Hsa00512_o_glycan_biosynthesis | 0.2 | 0.22 | 0.44 | 0.36 | 0.42 | 0.5 | 0.64 | 0.71 | 0.1 | 0.08 | ||
| Hsa00521_streptomycin_biosynthesis | 0.94 | 0.1 | 0.95 | 0.96 | 0.97 | 0.1 | 0.11 | 0.17 | 0.1 | 0.08 | ||
| Hsa00640_propanoate_metabolism | 0.92 | 0.06 | 0.15 | 0.6 | 0.1 | 0.07 | 0.3 | 0.11 | 0.31 | 0.19 | 0.08 | |
| Hsa00720_reductive_carboxylate_pathway | 1 | 0.1 | 0.75 | 0.7 | 0.1 | 0.11 | 0.3 | 0.1 | 0.08 | |||
| Hsa04664_fc_epsilon_ri_signaling_pathway | 0.97 | 0.91 | 0.55 | 0.97 | 0.1 | 0.11 | 0.25 | 0.1 | 0.08 | |||
| Hsa0521_renal_cell_carcinoma | 7.1E-03 | 0.91 | 0.55 | 0.97 | 0.1 | 0.12 | 0.24 | 0.1 | 0.08 |
For each geneset, the p-values are given for each method we used.
Comparison of the number of groups detected in the C2.KEGG category and of the number of common detections in three datasets
| 0.010 | A2. | Faeri. | Faeri. | GSA. | GSA. | GSA. | Globaltest. | Globaltest. | Globaltest. | Gsea. | Gsea. | Samgs. | Samgs. |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Emexp445 | 62 | 7 | 14 | 14 | 5 | 19 | 0 | 8 | 0 | 0 | 0 | 0 | 0 |
| GSE-1056 | 36 | 9 | 51 | 5 | 0 | 8 | 1 | 15 | 0 | 0 | 0 | 0 | 0 |
| GSE-4086 | 93 | 89 | 94 | 49 | 38 | 62 | 0 | 0 | 0 | 0 | 0 | NA | NA |
| Emexp445-GSE-1056 | 5 | 0 | 6 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| Emexp445-GSE-4086 | 24 | 5 | 9 | 8 | 0 | 11 | 0 | 0 | 0 | 0 | 0 | NA | NA |
| GSE-1056-GSE-4086 | 21 | 7 | 37 | 4 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | NA | NA |
| Emexp445-GSE-1056-GSE-4086 | 4 | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | NA |
| 0.050 | A2. | Faeri. | Faeri. | GSA. | GSA. | GSA. | Globaltest. | Globaltest. | Globaltest. | Gsea. | Gsea. | Samgs. | Samgs. |
| Emexp445 | 62 | 18 | 42 | 14 | 5 | 23 | 26 | 45 | 0 | 0 | 0 | 0 | 0 |
| GSE-1056 | 51 | 27 | 80 | 8 | 2 | 12 | 33 | 70 | 40 | 11 | 0 | 15 | 69 |
| GSE-4086 | 119 | 119 | 142 | 49 | 38 | 62 | 32 | 185 | 0 | 0 | 0 | NA | NA |
| Emexp445-GSE-1056 | 15 | 6 | 34 | 1 | 0 | 0 | 4 | 20 | 0 | 0 | 0 | 0 | 0 |
| Emexp445-GSE-4086 | 48 | 14 | 37 | 8 | 0 | 13 | 13 | 44 | 0 | 0 | 0 | NA | NA |
| GSE-1056-GSE-4086 | 31 | 23 | 71 | 4 | 1 | 6 | 7 | 69 | 0 | 0 | 0 | NA | NA |
| Emexp445-GSE-1056-GSE-4086 | 13 | 4 | 31 | 1 | 0 | 0 | 2 | 20 | 0 | 0 | 0 | NA | NA |
The results are shown for each method, for significance thresholds of 0.05 and 0.01. The three datasets analyzed are E-MEXP-445, GSE-1056 and GSE-4086, and concern the same condition (oxygen deprivation). The first column contains the datasets analyzed. The first 3 rows of each table contain the top list obtained by each geneset analysis method. When several datasets are mentioned at the beginning of the row, the number of genesets in the other columns is the intersection of the results from different datasets