| Literature DB >> 23967329 |
Quan Wang1, Peilin Jia, Karen T Cuenco, Zhen Zeng, Eleanor Feingold, Mary L Marazita, Lily Wang, Zhongming Zhao.
Abstract
Gene set-based analysis of genome-wide association study (GWAS) data has recently emerged as a useful approach to examine the joint effects of multiple risk loci in complex human diseases or phenotypes. Dental caries is a common, chronic, and complex disease leading to a decrease in quality of life worldwide. In this study, we applied the approaches of gene set enrichment analysis to a major dental caries GWAS dataset, which consists of 537 cases and 605 controls. Using four complementary gene set analysis methods, we analyzed 1331 Gene Ontology (GO) terms collected from the Molecular Signatures Database (MSigDB). Setting false discovery rate (FDR) threshold as 0.05, we identified 13 significantly associated GO terms. Additionally, 17 terms were further included as marginally associated because they were top ranked by each method, although their FDR is higher than 0.05. In total, we identified 30 promising GO terms, including 'Sphingoid metabolic process,' 'Ubiquitin protein ligase activity,' 'Regulation of cytokine secretion,' and 'Ceramide metabolic process.' These GO terms encompass broad functions that potentially interact and contribute to the oral immune response related to caries development, which have not been reported in the standard single marker based analysis. Collectively, our gene set enrichment analysis provided complementary insights into the molecular mechanisms and polygenic interactions in dental caries, revealing promising association signals that could not be detected through single marker analysis of GWAS data.Entities:
Mesh:
Year: 2013 PMID: 23967329 PMCID: PMC3743773 DOI: 10.1371/journal.pone.0072653
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Gene Ontology (GO) terms significantly associated with dental caries (FDR < 0.05).
| GO term | # genes in term | Ratio of significant SNPs | Contributing genes | Method (FDR |
| Protein secretion | 32 | 39/739 |
| GenGen (<0.001) |
| Cytokine secretion | 18 | 23/468 |
| GenGen (<0.001) |
| Regulation of protein secretion | 22 | 27/391 |
| GenGen (<0.001) |
| Regulation of cytokine secretion | 16 | 21/276 |
| GenGen (<0.001) |
| Regulation of axonogenesis | 10 | 45/575 |
| Mixed model (<0.001) |
| Regulation of neurogenesis | 14 | 45/651 |
| Mixed model (0.003) |
| Central nervous system development | 122 | 372/6137 |
| Mixed model (0.005) |
| Ligase activity forming carbon nitrogen bonds | 68 | 111/1485 |
| Mixed model (0.009) |
| Ubiquitin protein ligase activity | 49 | 84/1175 |
| Mixed model (0.013) |
| Small conjugating protein ligase activity | 51 | 84/1209 |
| Mixed model (0.014) |
| Glycoprotein catabolic process | 12 | 21/267 |
| Mixed model (0.031) |
| Axonogenesis | 43 | 171/2782 |
| Mixed model (0.035) |
| Cell matrix junction | 16 | 45/447 |
| Mixed model (0.035) |
The numerator is the number of SNPs with a p-value < 0.05 from the dental caries GWAS, and the denominator is the total number of SNPs mapped to the genes in each GO term.
Genes containing at least one SNP with a p-value < 0.05 from the dental caries GWAS are regarded as contributing genes.
FDR adjustment is based on Benjamini-Hochberg method [31].
Top 9 Gene Ontology (GO) terms identified by each of the four enrichment methods.
| GenGen | ALIGATOR | SNP Ratio Test | Mixed model | ||||
| GO term | p-value (FDR) | GO term | p-value (FDR) | GO term | p-value (FDR) | GO term | p-value (FDR) |
|
|
|
| 0.004 (1.000) |
| 0.001 (0.266) |
|
|
|
|
| Transcription corepressor activity (93) | 0.008 (1.000) | Neuropeptide receptor activity (21) | 0.001 (0.266) |
|
|
|
|
| DNA helicase activity (24) | 0.009 (1.000) | Neuropeptide binding (22) | 0.001 (0.266) |
|
|
|
|
|
| 0.011 (1.000) | Translation initiation factor activity (23) | 0.001 (0.266) |
|
|
|
| 0.001 (0.222) |
| 0.014 (1.000) | Small protein conjugating enzyme activity (52) | 0.002 (0.266) |
|
|
| Positive regulation of protein secretion (12) | 0.001 (0.222) | Sensory perception (188) | 0.014 (1.000) |
| 0.002 (0.266) |
|
|
| Rho guanyl nucleotide exchange factor activity (11) | 0.002 (0.380) |
| 0.015 (1.000) | Peptide receptor activity (49) | 0.002 (0.266) |
|
|
| Acid amino acid ligase activity (57) | 0.003 (0.499) | ADP binding (11) | 0.015 (1.000) | Translation regulator activity (38) | 0.002 (0.266) |
|
|
| Monocarboxylic acid transmembrane transporter activity (11) | 0.005 (0.666) |
| 0.017 (1.000) | Transferase activity transferring hexosyl groups (75) | 0.002 (0.266) |
|
|
GO terms in bold are those significant terms after adjustment for multiple testing (Benjamini-Hochberg method [31], FDR < 0.05). Underlined GO terms are those identified by more than one method.
The number in parentheses denotes the number of genes in the corresponding GO term.
Enriched genes in the 30 top Gene Ontology (GO) terms.
| Gene | # terms involved | Ratio of significant SNPs | Most significant SNP | p-value |
|
| 6 | 28/512 | rs574165 | 0.001 |
|
| 6 | 2/36 | rs16857407 | 0.020 |
|
| 5 | 1/8 | rs1455137 | 0.035 |
|
| 5 | 1/11 | rs10744774 | 0.019 |
|
| 5 | 2/72 | rs6589485 | 0.006 |
|
| 5 | 5/23 | rs10416565 | 0.008 |
|
| 5 | 2/13 | rs2249466 | 0.027 |
|
| 5 | 2/22 | rs3107606 | 0.009 |
|
| 5 | 2/7 | rs3781619 | 0.013 |
|
| 5 | 3/8 | rs3817681 | 0.016 |
|
| 5 | 3/5 | rs335675 | 0.011 |
|
| 5 | 1/28 | rs831627 | 0.004 |
|
| 5 | 8/29 | rs738263 | 0.007 |
|
| 5 | 1/3 | rs3103174 | 0.017 |
|
| 5 | 3/9 | rs11042978 | 0.002 |
|
| 5 | 3/22 | rs9783885 | 0.021 |
|
| 5 | 6/18 | rs11969250 | 0.004 |
|
| 5 | 3/40 | rs9988572 | 0.001 |
|
| 5 | 2/42 | rs552705 | 0.029 |
|
| 5 | 2/16 | rs12672417 | 0.020 |
|
| 5 | 2/7 | rs769052 | 0.033 |
|
| 5 | 4/20 | rs12629302 | 0.008 |
|
| 5 | 1/24 | rs10246707 | 0.024 |
|
| 5 | 1/10 | rs13054355 | 0.006 |
|
| 5 | 3/8 | rs7249714 | 0.006 |
|
| 5 | 4/27 | rs10102559 | 0.003 |
|
| 5 | 1/24 | rs1547258 | 0.030 |
|
| 5 | 3/28 | rs7200005 | 0.011 |
|
| 5 | 1/4 | rs10988111 | 0.017 |
|
| 4 | 1/6 | rs10047459 | 0.015 |
|
| 4 | 2/19 | rs2880829 | 0.004 |
|
| 4 | 1/16 | rs8084404 | 0.040 |
|
| 4 | 1/7 | rs5906761 | 0.016 |
|
| 4 | 2/19 | rs7259148 | 0.005 |
|
| 4 | 15/171 | rs9836971 | 0.005 |
|
| 4 | 6/106 | rs12503652 | 0.003 |
Genes that have at least one SNP with a p-value < 0.05 and are involved in at least 4 gene sets were listed.
The numerator is the number of SNPs with a p-value < 0.05 in a gene and the denominator is the total number of SNPs mapped to the gene.
The p-value of the most significant SNP in the gene.