| Literature DB >> 23566118 |
Chiyong Kang1, Hyeji Yu, Gwan-Su Yi.
Abstract
BACKGROUND: Due to the low statistical power of individual markers from a genome-wide association study (GWAS), detecting causal single nucleotide polymorphisms (SNPs) for complex diseases is a challenge. SNP combinations are suggested to compensate for the low statistical power of individual markers, but SNP combinations from GWAS generate high computational complexity.Entities:
Mesh:
Year: 2013 PMID: 23566118 PMCID: PMC3618247 DOI: 10.1186/1472-6947-13-S1-S3
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Databases used for enrichment analysis of expanded functional module.
| Database | Gene Set Category | Web address |
|---|---|---|
| BioCarta | Pathway | |
| COFECO | Complex | |
| GO | Function | |
| KEGG | Pathway | |
| KEGG Modules | Pathway | |
| MetaCyc | Pathway | |
| MSigDB | TF-target, miRNA-target | |
| NCI-Nature | Pathway | |
| PANTHER | Pathway | |
| Reactome | Pathway | |
| UniPathway | Pathway |
Error rates of SNP combinations from GWAS dataset with Bonferroni threshold based cutoff criteria.
| Bonferroni threshold | Number of SNPs | No. of SNPs in SNP combination | Error rate |
|---|---|---|---|
| r | 82 | 82 | 0.117553 |
| 2r | 164 | 67 | 0.116953 |
| 5r | 410 | 69 | 0.114954 |
| 10r | 820 | 88 | 0.114154 |
Figure 1Error rates of RF analysis with/without variable selection.
Error rates of SNP combinations from a GWAS dataset with p-value range-based cutoff criteria.
| p-value range | No. of SNPs in p-value range | No. of selected SNPs in SNP combination | Error rate |
|---|---|---|---|
| <0.01 | 854 | 114 | 0.116953 |
| <0.05 | 2960 | 83 | 0.114754 |
| <0.1 | 5297 | 95 | 0.115131 |
| <0.2 | 9831 | 91 | 0.114731 |
| <0.3 | 14192 | 104 | 0.102938 |
| <0.4 | 18407 | 87 | 0.104937 |
| <0.5 | 22612 | 134 | 0.106136 |
| <0.6 | 26743 | 101 | 0.102538 |
| <0.7 | 30797 | 59 | 0.109934 |
| <0.8 | 34789 | 85 | 0.104138 |
| <0.9 | 38815 | 60 | 0.113332 |
| <1 | 42798 | 83 | 0.114731 |
Figure 2P-values and variable importance values of SNPs from the SNP combination.
Pathway functional modules from SNP combination.
| Functional Module Name | Category | # of genes | # of genes in functional module | p-value | FDR |
|---|---|---|---|---|---|
| Focal adhesion | KEGG | 6 | 175 | 3.144E-03 | 7.982E-03 |
| Hemostasis | Reactome | 6 | 207 | 7.080E-03 | 1.091E-02 |
| ErbB1 downstream signaling | NCI-Nature | 4 | 101 | 9.465E-03 | 1.662E-02 |
| Endometrial cancer | KEGG | 3 | 48 | 7.073E-03 | 1.755E-02 |
| Formation of Platelet plug | Reactome | 4 | 106 | 1.116E-02 | 1.840E-02 |
| PDGF signaling pathway | PANTHER | 4 | 108 | 1.190E-02 | 2.053E-02 |
| Viral myocarditis | KEGG | 3 | 54 | 9.792E-03 | 2.506E-02 |
| BCR signaling pathway | NCI-Nature | 3 | 63 | 1.487E-02 | 2.703E-02 |
| amine and polyamine degradation | UniPathway | 1 | 4 | 3.236E-02 | 3.236E-02 |
| Rho GTPase cycle | Reactome | 3 | 73 | 2.196E-02 | 3.420E-02 |
| Regulation of RAC1 activity | NCI-Nature | 2 | 26 | 1.900E-02 | 3.498E-02 |
| Signaling by Rho GTPases | Reactome | 3 | 73 | 2.196E-02 | 3.633E-02 |
| Thrombin-mediated activation of PARs | Reactome | 1 | 3 | 2.437E-02 | 3.841E-02 |
| Fc epsilon RI signaling pathway | KEGGM | 3 | 63 | 1.487E-02 | 4.051E-02 |
| EGF receptor (ErbB1) signaling pathway | NCI-Nature | 4 | 132 | 2.313E-02 | 4.096E-02 |
TF-target functional modules and miRNA-target functional modules from SNP combinations.
| Functional Module Name | Category | # of genes | # of genes in functional module | p-value | FDR |
|---|---|---|---|---|---|
| TGACAGNY_V$MEIS1_01 | TF-target | 15 | 620 | 1.553E-04 | 1.901E-04 |
| V$STAT4_01 | TF-target | 8 | 200 | 2.348E-04 | 2.786E-04 |
| YTATTTTNR_V$MEF2_02 | TF-target | 13 | 545 | 5.034E-04 | 6.157E-04 |
| V$CDP_02 | TF-target | 5 | 89 | 8.079E-04 | 9.780E-04 |
| V$RORA1_01 | TF-target | 7 | 193 | 1.037E-03 | 1.249E-03 |
| V$YY1_01 | TF-target | 7 | 195 | 1.101E-03 | 1.262E-03 |
| TGCCAAR_V$NF1_Q6 | TF-target | 12 | 545 | 1.658E-03 | 1.731E-03 |
| TTGTTT_V$FOXO4_01 | TF-target | 24 | 1549 | 1.435E-03 | 1.873E-03 |
| V$OCT1_06 | TF-target | 7 | 218 | 2.086E-03 | 2.121E-03 |
| V$NKX25_01 | TF-target | 5 | 102 | 1.491E-03 | 2.128E-03 |
| V$CDC5_01 | TF-target | 7 | 217 | 2.032E-03 | 2.720E-03 |
| V$AR_Q6 | TF-target | 6 | 178 | 3.419E-03 | 3.618E-03 |
| V$POU6F1_01 | TF-target | 6 | 185 | 4.129E-03 | 4.403E-03 |
| V$POU3F2_01 | TF-target | 4 | 83 | 4.764E-03 | 5.687E-03 |
| V$STAT5A_03 | TF-target | 6 | 198 | 5.732E-03 | 5.794E-03 |
| V$STAT6_02 | TF-target | 6 | 192 | 4.944E-03 | 5.855E-03 |
| V$EVI1_03 | TF-target | 3 | 44 | 5.546E-03 | 7.165E-03 |
| V$PBX1_01 | TF-target | 6 | 194 | 5.197E-03 | 7.461E-03 |
| V$SRY_02 | TF-target | 6 | 196 | 5.460E-03 | 7.484E-03 |
| V$AFP1_Q6 | TF-target | 6 | 204 | 6.607E-03 | 7.991E-03 |
| ACTTTAT,MIR-142-5P | miRNA-target | 7 | 254 | 4.854E-03 | 9.791E-03 |
Selected pathways from expanded GSEA with a p-value < 0.05 in the WTCCC T2D dataset
| Database | Pathway | Size | p-value | FDR |
|---|---|---|---|---|
| PANTHER | WNT signaling pathway | 130 | 0.001 | 0.056 |
| KEGG | Calcium signaling pathway | 105 | 0.002 | 0.073 |
| KEGG | Pathways in cancer | 193 | 0.002 | 0.081 |
| KEGG | Focal adhesion | 132 | 0.004 | 0.090 |
| KEGG | Neuroactive ligand-receptor interaction | 127 | 0.01 | 0.153 |
| KEGG | MAPK signaling pathway | 146 | 0.012 | 0.168 |
| KEGG | Cell adhesion molecules (CAMs) | 61 | 0.016 | 0.208 |
| NCI-Nature | Regulation of RhoA activity | 73 | 0.03 | 0.287 |
| PANTHER | Huntington disease | 58 | 0.03 | 0.277 |
| Reactome | Signalling by NGF | 107 | 0.03 | 0.359 |
| KEGG | Non-small cell lung cancer | 38 | 0.031 | 0.303 |
| KEGG | Natural killer cell mediated cytotoxicity | 56 | 0.035 | 0.232 |
| PANTHER | Muscarinic acetylcholine receptor 1 and 3 signaling pathway | 30 | 0.035 | 0.325 |
| Reactome | Integrin cell surface interactions | 57 | 0.036 | 0.330 |
| KEGG | Axon guidance | 74 | 0.036 | 0.419 |
| Reactome | Cell Cycle, Mitotic | 120 | 0.039 | 0.254 |
| NCI-Nature | Notch signaling pathway | 49 | 0.04 | 0.259 |
| NCI-Nature | Signaling events mediated by focal adhesion kinase | 42 | 0.041 | 0.260 |
| NCI-Nature | EGF receptor (ErbB1) signaling pathway | 94 | 0.043 | 0.379 |
| Reactome | Gene Expression | 146 | 0.043 | 0.379 |
| KEGG | ErbB signaling pathway | 55 | 0.043 | 0.379 |
| NCI-Nature | Neurotrophic factor-mediated Trk receptor signaling | 67 | 0.044 | 0.384 |
| PANTHER | Beta1 adrenergic receptor signaling pathway | 24 | 0.044 | 0.379 |
| NCI-Nature | Hypoxic and oxygen homeostasis regulation of HIF-1-alpha | 46 | 0.044 | 0.274 |
| PANTHER | Heterotrimeric G-protein signaling pathway-Gq alpha and Go alpha mediated pathway | 68 | 0.045 | 0.492 |
| KEGG | Cytokine-cytokine receptor interaction | 105 | 0.046 | 0.393 |
| Reactome | Signaling in Immune system | 118 | 0.046 | 0.399 |
| Reactome | G-protein mediated events | 27 | 0.047 | 0.346 |
| NCI-Nature | Thromboxane A2 receptor signaling | 40 | 0.047 | 0.346 |
| PANTHER | FGF signaling pathway | 60 | 0.048 | 0.403 |
| KEGG | Adherens junction | 52 | 0.048 | 0.406 |
| KEGG | Leukocyte transendothelial migration | 62 | 0.049 | 0.356 |
| Reactome | Integration of energy metabolism | 79 | 0.049 | 0.363 |
| Reactome | Hemostasis | 145 | 0.049 | 0.526 |
RF-based prediction error rates of SNP sets from functional module-based filtration and SNP combinations with various thresholds from the WTCCC T2D dataset.
| Dataset | Functional Module Description | Number of SNPs | Error Rate | Number of Selected SNPs | Error Rate with Variable Selection |
|---|---|---|---|---|---|
| MIR | CACTGCC,MIR-34A, MIR-34C,MIR-449 | 1876 | 39.02% | 6 | 32.12% |
| TF | V$NKX25_02 | 1678 | 38.56% | 59 | 33.06% |
| MIR | ACTTTAT,MIR-142-5P | 1572 | 38.32% | 35 | 33.42% |
| Average | Average of results from 66 functional modules | 1614.86 | 39.18% | 32.24 | 36.98% |
| Random | Average of results from 66 SNP sets consisting of randomly selected 1615 SNPs | 1615 | 38.77% | 113.17 | 36.57% |
| Combination | Top 5 functional modules | 7590 | 38.96% | 36 | 32.78% |
| Combination | Top 66 functional modules | 25663 | 38.66% | 62 | 17.43% |
| p-value based | Top 1 SNP | 1 | 33.93% | 1 | 33.93% |
| p-value based | Top 10 SNPs | 10 | 27.19% | 10 | 27.19% |
| p-value based | SNPs with Bonferroni threshold | 82 | 11.76% | 82 | 11.76% |
| p-value based | SNPs with p-value < 0.01 | 854 | 14.79% | 114 | 11.70% |
| p-value based | All SNPs | 42798 | 37.02% | 83 | 11.47% |