| Literature DB >> 23281744 |
Peilin Jia1, Yang Liu, Zhongming Zhao.
Abstract
BACKGROUND: Pathway analysis of large-scale omics data assists us with the examination of the cumulative effects of multiple functionally related genes, which are difficult to detect using the traditional single gene/marker analysis. So far, most of the genomic studies have been conducted in a single domain, e.g., by genome-wide association studies (GWAS) or microarray gene expression investigation. A combined analysis of disease susceptibility genes across multiple platforms at the pathway level is an urgent need because it can reveal more reliable and more biologically important information.Entities:
Mesh:
Year: 2012 PMID: 23281744 PMCID: PMC3524313 DOI: 10.1186/1752-0509-6-S3-S13
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Summary of genotyping (GWAS) data and microarray gene expression data.
| GWAS | Microarray gene expression | |
|---|---|---|
| Source | CGEMS [ | GEO: GDS2547 [ |
| # of features | 506,216 SNPs | 10,595 genes |
| # of samples | Cases:1146 | Cases: 64 |
| Phenotype | Prostate cancer | Subset 1 (17, normal prostate tissue) |
| Platform | Illumina HumanHap300 and HumanHap240 | Affymetrix Human Genome U95C array |
CGEMS: The Cancer Genetic Markers of Susceptibility (CGEMS) project. GEO: Gene Expression Omnibus.
Description of additional gene sets.
| Gene set | Size | Description |
|---|---|---|
| PGDB | 129 | Genes extracted from the PGDB database [ |
| GWAS_Top30 | 30 | Top 30 genes with the smallest association |
| GWAS_TopP-4 | 69 | Top 69 genes with association |
| DEG_LR_1 | 165 | DEGs in gene expression data with |
| DEG_LR_1.5 | 130 | DEGs in gene expression data with |
| DEG_LR_2 | 13 | DEGs in gene expression data with |
DEG: differentially expressed gene (DEG). FDR value was based on t-test. More details of gene sets are provided in the text.
Figure 1Workflow of the integrative pathway analysis. PGDB: Human Prostate Gene Database; DE: differentially expressed; ALIGATOR: Association List Go AnnoTatOR; SRT: SNP Ratio Test; GSEA: Gene Set Enrichment Analysis.
Parameters and summary of the significant pathways by each pathway analysis method.
| Method | Parameter(s) | # pathways ( | # pathways ( | The PGDB gene set | Other external gene sets |
|---|---|---|---|---|---|
| GenGen | 4 | 3 | N | N | |
| ALIGATOR | 0 | 0 | N | N | |
| SRT | 3 | 0 | N | N | |
| Plink set-based test | 15 | 15 | Y | N | |
| GSEA | 5 | 7 | N | N |
This table lists the parameters used in GenGen, ALIGATOR, SRT, the Plink set-based test, and the Gene Set Enrichment Analysis (GSEA). The third and forth columns contain the number of pathways selected by tier one criterion (nominal P < 0.01) and tier two criterion (FDR < 0.2), respectively. The fifth and sixth columns indicate whether the method could identify the PGDB gene set or other additional gene sets (more details available in main text).
π: number of permutations.
Significant pathways (FDR < 0.01) detected by the Plink set-based test.
| Pathway (KEGG ID) | ||
|---|---|---|
| Jak-STAT signaling pathway (hsa04630)* | 0.001 | 0.043 |
| Chronic myeloid leukemia (hsa05220) | 0.001 | 0.043 |
| Small cell lung cancer (hsa05222) | 0.001 | 0.043 |
| TGF-beta signaling pathway (hsa04350) | 0.002 | 0.043 |
| Endometrial cancer (hsa05213) | 0.002 | 0.043 |
| Thyroid cancer (hsa05216)* | 0.002 | 0.043 |
| Bladder cancer (hsa05219) | 0.002 | 0.043 |
| Acute myeloid leukemia (hsa05221) | 0.002 | 0.043 |
| Cell cycle (hsa04110) | 0.003 | 0.043 |
| Wnt signaling pathway (hsa04310) | 0.003 | 0.043 |
| Fc gamma R-mediated phagocytosis (hsa04666) | 0.003 | 0.043 |
| Regulation of actin cytoskeleton (hsa04810) | 0.003 | 0.043 |
| Colorectal cancer (hsa05210) | 0.003 | 0.043 |
| 0.004 | 0.053 | |
| ErbB signaling pathway (hsa04012) | 0.008 | 0.099 |
*Overlap with SRT.
†Overlap with GSEA.
List of "Jak-STAT signaling pathway (hsa04630)" genes with gene-wise association P < 0.001 in CGEMS prostate cancer GWAS data.
| Gene symbol | Most significant SNP | Position (bp) | Genomic region | ||
|---|---|---|---|---|---|
| rs7837688 | 8 | 128539360 | Intron | 4.96 × 10-7 | |
| rs909486 | 22 | 37323988 | Intron | 1.85 × 10-4 | |
| rs11071981 | 15 | 68416575 | Intron | 2.63 × 10-4 | |
| rs3118470 | 10 | 6101713 | Intron | 3.29 × 10-4 | |
| rs1999494 | 13 | 81000505 | Intron | 4.01 × 10-4 | |
| rs12538332 | 7 | 127839654 | Intron | 5.24 × 10-4 |
Chr.: chromosome. bp: base pair.
Significant pathways in prostate cancer microarray gene expression data detected by the GSEA method.
| Pathway (KEGG ID) | ||||
|---|---|---|---|---|
| Fc gamma R-mediated phagocytosis (hsa04666)* | 0.645 | 1.809 | < 0.001 | 0.131 |
| Focal adhesion (hsa04510) | 0.482 | 1.658 | 0.004 | 0.176 |
| Dilated cardiomyopathy (hsa05414) | 0.672 | 1.905 | 0.004 | 0.101 |
| Hypertrophic cardiomyopathy (HCM) (hsa05410) | 0.619 | 1.740 | 0.009 | 0.122 |
| Regulation of actin cytoskeleton (hsa04810)* | 0.464 | 1.714 | 0.009 | 0.128 |
| Leukocyte transendothelial migration (hsa04670) | 0.594 | 1.760 | 0.011 | 0.163 |
| Arrhythmogenic right ventricular cardiomyopathy (ARVC) (hsa05412) | 0.663 | 1.760 | 0.012 | 0.123 |
ES: enrichment score. NES: normalized enrichment score.
*Overlap with the Plink set-based results.
Overlap with the GenGen results.
Figure 2Comparison of the significant pathways found in genome-wide association studies and microarray gene expression datasets by different methods. This Venn diagram shows the comparison of significant pathways detected using the prostate cancer GWAS dataset using three methods: GenGen, Plink set-based test (PlinkSet), and the SNP Ratio Test (SRT), and the significant pathways found in microarray gene expression data by the Gene Set Enrichment Analysis (GSEA) method. Note that ALIGATOR identified no significant pathways and was not included in the comparison.
Significant pathways (P < 0.01) by a combined analysis using the Plink set-based test and GSEA.
| Pathway (KEGG ID) | GSEA | Plink | ||
|---|---|---|---|---|
| Fc gamma R-mediated phagocytosis (hsa04666) | < 0.001 | 0.003 | 6.18 × 10-8 | 9.15 × 10-6 |
| Regulation of actin cytoskeleton (hsa04810) | 0.009 | 0.003 | 3.34 × 10-4 | 2.47 × 10-2 |
| Jak-STAT signaling pathway (hsa04630) | 0.084 | 0.001 | 8.79 × 10-4 | 3.56 × 10-2 |
| Dilated cardiomyopathy (hsa05414) | 0.003 | 0.024 | 9.63 × 10-4 | 3.56 × 10-2 |
| Small cell lung cancer (hsa05222) | 0.266 | 0.001 | 2.45 × 10-3 | 7.27 × 10-2 |
| Hypertrophic cardiomyopathy (HCM) (hsa05410) | 0.009 | 0.051 | 4.20 × 10-3 | 9.62 × 10-2 |
| Cell cycle (hsa04110) | 0.251 | 0.003 | 6.16 × 10-3 | 9.62 × 10-2 |
| Arrhythmogenic right ventricular cardiomyopathy (ARVC) (hsa05412) | 0.012 | 0.068 | 6.46 × 10-3 | 9.62 × 10-2 |
| Chronic myeloid leukemia (hsa05220) | 0.843 | 0.001 | 6.80 × 10-3 | 9.62 × 10-2 |
| Bladder cancer (hsa05219) | 0.422 | 0.002 | 6.80 × 10-3 | 9.62 × 10-2 |
| Wnt signaling pathway (hsa04310) | 0.297 | 0.003 | 7.14 × 10-3 | 9.62 × 10-2 |
| TGF-beta signaling pathway (hsa04350) | 0.508 | 0.002 | 8.01 × 10-3 | 9.66 × 10-2 |
| Axon guidance (hsa04360) | 0.016 | 0.074 | 9.18 × 10-3 | 9.66 × 10-2 |