| Literature DB >> 23046413 |
Hui Huang1, Xiaogang Wu, Madhankumar Sonachalam, Sammed N Mandape, Ragini Pandey, Karl F MacDorman, Ping Wan, Jake Y Chen.
Abstract
BACKGROUND: Over the past decade, pathway and gene-set enrichment analysis has evolved into the study of high-throughput functional genomics. Owing to poorly annotated and incomplete pathway data, researchers have begun to combine pathway and gene-set enrichment analysis as well as network module-based approaches to identify crucial relationships between different molecular mechanisms.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23046413 PMCID: PMC3439733 DOI: 10.1186/1471-2105-13-S15-S2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1An overview of the gene-set data integration process. The figure shows the whole process of gene-set data integration and the basic statistics applied to gene-set data sources.
Number of overlapping genes between different data sources
| OMIM | GAD | MSigDB | GeneSigDB | miRecords | HPD | HAPPI* | |
|---|---|---|---|---|---|---|---|
| OMIM | 9012 | 1862 | 3489 | 2792 | 231 | 2559 | 3849 |
| GAD | 7293 | 6821 | 6450 | 432 | 3202 | 4922 | |
| MSigDB | 30525 | 17209 | 759 | 6229 | 10677 | ||
| GeneSigDB | 36791 | 900 | 5904 | 10395 | |||
| miRecords | 1395 | 443 | 725 | ||||
| HPD | 12228 | 10512 | |||||
| HAPPI | 21955 |
* Only PPIs of over 3-star quality are considered here; to calculate the overlap, protein IDs from HAPPI have been first converted to gene symbols.
Figure 2Gene-set scale distributions for PAGED molecule data. A gene-set scale refers to the number of molecules (i.e., genes) involved in a given gene set. The frequency on the y-axis refers to the count of all gene sets falling into the category of a particular gene-set scale size on the x-axis. The distributions are plotted under log scale for both the x-axis and y-axis. The linear trend line in red represents linear regression of PAGED distribution and the linear equation and its R-Square are listed.
Figure 3An overview for the core functionality of the online PAGED website. (A) The PAGED home page providing search by either disease name or gene list; (B) a webpage containing the list of gene sets retrieved as a result of a disease query; (C) a webpage containing the list of gene sets retrieved as a result of a gene list query; (D) an advanced search page in which the user can either search disease name or upload a gene-list to search; (E) a browse page listing the gene sets, their data source and number of genes.
Top 10 search results by querying colorectal cancer at the home page
| Gene-set Name | # of Genes | Data Source |
|---|---|---|
| colorectal cancer | 433 | GAD |
| Colorectal cancer | 134 | KEGG |
| Colorectal cancer | 14 | OMIM |
| Colorectal cancer, somatic | 12 | OMIM |
| Colorectal cancer, hereditary non-polyposis, type 8 | 7 | OMIM |
| Colorectal cancer, susceptibility to | 7 | OMIM |
| Colorectal cancer, hereditary non-polyposis, type 6 | 6 | OMIM |
| Breast and colorectal cancer, susceptibility to | 5 | OMIM |
| Colorectal Cancer | 5 | GAD |
Top search results of colorectal cancer advanced search
| Gene-set Name | Hits | P value | FDR | Data Source |
|---|---|---|---|---|
| Molecular Mechanisms of Cancer | 38 | 2.48E-17 | 7.04E-10 | Protein Lounge |
| PI3K Signaling | 33 | 2.01E-13 | 7.04E-10 | Protein Lounge |
| Akt Signaling | 27 | 9.6E-13 | 7.04E-10 | Protein Lounge |
| ERK Signaling | 24 | 1.53E-10 | 7.04E-10 | Protein Lounge |
| GSK3 Signaling | 23 | 1.32E-13 | 7.04E-10 | Protein Lounge |
| inactivation of gsk3 by akt causes accumulation of b-catenin in alveolar macrophages | 9 | 3.7E-11 | 7.04E-10 | BioCarta |
| atm signaling pathway | 8 | 6.28E-11 | 7.04E-10 | BioCarta |
| wnt signaling pathway | 7 | 7.7E-09 | 7.04E-10 | BioCarta |
| cell cycle: g2/m checkpoint | 7 | 2.14E-08 | 7.04E-10 | BioCarta |
| cell cycle: g1/s check point | 7 | 2.14E-08 | 7.04E-10 | BioCarta |
| Canonical Wnt signaling pathway | 8 | 9.24E-10 | 7.04E-10 | NCI-Nature |
| Presenilin action in Notch and Wnt signaling | 8 | 3.16E-08 | 7.04E-10 | NCI-Nature |
| Plasma membrane estrogen receptor signaling | 7 | 1.41E-08 | 7.04E-10 | NCI-Nature |
| FOXM1 transcription factor network | 7 | 2.48E-07 | 7.04E-10 | NCI-Nature |
| LPA receptor mediated events | 7 | 1.45E-06 | 7.04E-10 | NCI-Nature |
| Metabolism of xenobiotics by cytochrome P450 | 20 | 3.3E-25 | 7.04E-10 | KEGG |
| Drug metabolism - cytochrome P450 | 17 | 4.96E-21 | 7.04E-10 | KEGG |
| Bladder cancer | 15 | 3.29E-18 | 7.04E-10 | KEGG |
| Cytokine-cytokine receptor interaction | 15 | 1.39E-06 | 7.04E-10 | KEGG |
| Colorectal cancer | 14 | 4.43E-14 | 7.04E-10 | KEGG |
| p53 signaling pathway | 14 | 4.92E-14 | 7.04E-10 | KEGG |
| Prostate cancer | 14 | 1.66E-12 | 7.04E-10 | KEGG |
| Xenobiotics | 5 | 3.32E-08 | 7.04E-10 | Reactome |
| Formation of incision complex in GG-NER | 5 | 5.75E-06 | 7.04E-10 | Reactome |
| Global Genomic NER (GG-NER) | 5 | 5.75E-06 | 7.04E-10 | Reactome |
| Dual incision reaction in GG-NER | 5 | 5.75E-06 | 7.04E-10 | Reactome |
| Exocytosis of Alpha granule | 5 | 0.000217 | 1.95E-08 | Reactome |
| SIGNAL_TRANSDUCTION | 55 | 8.36E-28 | 7.04E-10 | MsigDB |
| BIOPOLYMER_METABOLIC_PROCESS | 49 | 4.16E-22 | 7.04E-10 | MsigDB |
| KEGG_PATHWAYS_IN_CANCER | 43 | 9.9E-46 | 7.04E-10 | MsigDB |
| NUCLEOBASENUCLEOSIDENUCLEOTIDE_AND_NUCLEIC_ACID_METABOLIC_PROCESS | 41 | 2.16E-20 | 7.04E-10 | MsigDB |
| NUCLEUS | 41 | 1.8E-18 | 7.04E-10 | MsigDB |
| Immune_Kong10_5640genes_ImmPort_ComprehensiveListofImmune-RelatedGenes | 114 | 3.61E-49 | 7.04E-10 | GeneSigDB |
| Lymphoma_Melendez05_4229genes | 81 | 1.57E-39 | 7.04E-10 | GeneSigDB |
| Breast_Farmer05_3198genes_basal_apocrine_luminal | 66 | 1.08E-21 | 7.04E-10 | GeneSigDB |
| Ovarian_Crijns09_2394Genes_17PathwayPredictor | 57 | 7.94E-30 | 7.04E-10 | GeneSigDB |
| StemCell_Nilsson07_3742genes | 45 | 4.86E-07 | 7.04E-10 | GeneSigDB |
| hsa-miR-19a | 3 | 1.49E-05 | 8.43E-09 | miRecords |
| [hsa-miR-21] | 3 | 0.000116 | 8.43E-09 | miRecords |
| hsa-miR-204 | 3 | 0.000164 | 1.95E-08 | miRecords |
| hsa-miR-21 | 3 | 0.000953 | 2.72E-07 | miRecords |
| hsa-miR-125b | 3 | 0.003089 | 2.72E-07 | miRecords |
Top search results of gene-based search from microarray datasets
| Gene-set Name | Hits | P value | FDR | Data Source |
|---|---|---|---|---|
| SABATES_COLORECTAL_ADENOMA_DN | 58 | 4.57E-96 | 2.76E-10 | MsigDB |
| Breast_Farmer05_3198genes_basal_apocrine_luminal | 35 | 2.91E-13 | 2.76E-10 | GeneSigDB |
| SABATES_COLORECTAL_ADENOMA_UP | 34 | 1.62E-57 | 2.76E-10 | MsigDB |
| Immune_Kong10_5640genes_ImmPort_ComprehensiveListofImmune-RelatedGenes | 34 | 3.56E-08 | 2.76E-10 | GeneSigDB |
| Leukemia_Pellegrini08_2692genes | 32 | 1.28E-15 | 2.76E-10 | GeneSigDB |
| Intestine_Vecchi07_1024genes | 28 | 3.91E-23 | 2.76E-10 | GeneSigDB |
| Viral_Buonomo11_5307genes | 25 | 6.45E-05 | 0.000109 | GeneSigDB |
| SMID_BREAST_CANCER_LUMINAL_B_DN | 23 | 4.16E-19 | 2.76E-10 | MsigDB |
| Lymphoma_Melendez05_4229genes | 22 | 2.03E-06 | 2.76E-10 | GeneSigDB |
| Colon_Kim04_235genes | 21 | 5.18E-30 | 2.76E-10 | GeneSigDB |
| Breast_Parker09_1918genes_IntrinsicGenes_Compilation | 21 | 1.18E-08 | 2.76E-10 | GeneSigDB |
Figure 4CRC-specific gene-set association network (GSAN) on the top gene sets from colorectal cancer study. Node size: Gene-set scale (Counting genes in each gene set); Node color: Gene-set data source; Edge width: Similarity score (≥ 0.1). All gene sets are highly connected to each other, suggesting their collaborative functions in colorectal cancer.
Figure 5CRC-specific gene-set association network (GSAN) with differential gene-set expressions. The differential gene expressions are from the differential analysis based on the microarray data, GSE8671. Node size: Gene-set scale (Counting genes in each gene set); Node color: Differential gene-set expression (NORM_ABS_FC); Node line color: Gene-set data source; and Edge width: Similarity score (≥ 0.1).
Top 20 gene sets ranked by differential gene-set expressions in the CRC-specific gene-set association network (GSAN)
| Gene-set name | Scale | Data Source | NORM_ABS_FC |
|---|---|---|---|
| Colon_Kim04_235genes | 151 | GeneSigDB | 48.58225017 |
| SABATES_COLORECTAL_ADENOMA_DN | 292 | MsigDB | 43.9233159 |
| SIGNAL_TRANSDUCTION | 1598 | MsigDB | 32.5957784 |
| Leukemia_Pellegrini08_2692genes | 2122 | GeneSigDB | 31.65148925 |
| SABATES_COLORECTAL_ADENOMA_UP | 142 | MsigDB | 31.65000681 |
| Breast_Parker09_1918genes_IntrinsicGenes _Compilation | 1734 | GeneSigDB | 20.85621131 |
| Lymphoma_Melendez05_4229genes | 2570 | GeneSigDB | 19.38449282 |
| Breast_Farmer05_3198genes_basal _apocrine _luminal | 3125 | GeneSigDB | 18.93820407 |
| SMID_BREAST_CANCER_LUMINAL_B_DN | 648 | MsigDB | 18.13762096 |
| Intestine_Vecchi07_1024genes | 796 | GeneSigDB | 16.68882931 |
| Ovarian_Crijns09_2394Genes _17PathwayPredictor | 1586 | GeneSigDB | 15.29529767 |
| hsa-miR-204 | 19 | miRecords | 14.37015815 |
| StemCell_Nilsson07_3742genes | 3624 | GeneSigDB | 12.47045771 |
| Immune_Kong10_5640genes_ImmPort _ComprehensiveListofImmune-RelatedGenes | 4549 | GeneSigDB | 11.91186233 |
| cell cycle: g1/s check point | 53 | BioCarta | 9.84279867 |
| Bladder cancer | 89 | KEGG | 7.885181064 |
| Drug metabolism - cytochrome P450 | 94 | KEGG | 7.837851592 |
| Metabolism of xenobiotics by cytochrome P450 | 103 | KEGG | 7.837805455 |
| hsa-miR-21 | 34 | miRecords | 7.001844224 |
| KEGG_PATHWAYS_IN_CANCER | 328 | MsigDB | 6.792625895 |