| Literature DB >> 24146904 |
Quan Wang1, Peilin Jia, Karen T Cuenco, Eleanor Feingold, Mary L Marazita, Lily Wang, Zhongming Zhao.
Abstract
A number of genetic studies have suggested numerous susceptibility genes for dental caries over the past decade with few definite conclusions. The rapid accumulation of relevant information, along with the complex architecture of the disease, provides a challenging but also unique opportunity to review and integrate the heterogeneous data for follow-up validation and exploration. In this study, we collected and curated candidate genes from four major categories: association studies, linkage scans, gene expression analyses, and literature mining. Candidate genes were prioritized according to the magnitude of evidence related to dental caries. We then searched for dense modules enriched with the prioritized candidate genes through their protein-protein interactions (PPIs). We identified 23 modules comprising of 53 genes. Functional analyses of these 53 genes revealed three major clusters: cytokine network relevant genes, matrix metalloproteinases (MMPs) family, and transforming growth factor-beta (TGF-β) family, all of which have been previously implicated to play important roles in tooth development and carious lesions. Through our extensive data collection and an integrative application of gene prioritization and PPI network analyses, we built a dental caries-specific sub-network for the first time. Our study provided insights into the molecular mechanisms underlying dental caries. The framework we proposed in this work can be applied to other complex diseases.Entities:
Mesh:
Year: 2013 PMID: 24146904 PMCID: PMC3795720 DOI: 10.1371/journal.pone.0076666
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The workflow of this study.
First, the biological knowledgebase BioGraph [34] was explored to identify training gene set, and candidate genes were collected from previous studies and publications. We obtained 11 training genes and 1214 candidate genes in this data collection step. Second, a computational method ENDEAVOUR [13] was utilized to prioritize the candidate genes. In this step, a ranked list of 960 candidate genes that could be recognized by ENDEAVOUR was generated. Third, dmGWAS [33] was employed to search for the dense modules upon human protein-protein interaction (PPI) network collected by Protein Interaction Network Analysis (PINA) platform [53]. This resulted in 469 dense modules. Finally, the 469 modules were evaluated and the top 23 ones were selected as promising modules.
Dental caries candidate genes in four categories.
| Category | # candidate genes | Reference |
| Association | 12 |
|
| 8 |
| |
| 12 |
| |
| 6 |
| |
| 20 |
| |
| Linkage | 349 |
|
| Expression | 13 |
|
| 324 |
| |
| 8 |
| |
| Literature | 570 | – |
| Total | 1214 | – |
The total number is smaller than the sum of the four categories due to redundancy.
Figure 2Overlap of candidate genes between four categories.
The top ranked genes have a higher probability of belonging to multiple categories.
| # source categories | p-value | |||
| 1 | 2 | 3 | ||
| Full prioritized list | 862 | 92 | 6 | – |
| Top 50 genes | 30 | 18 | 2 | 1.79×10−7 |
| Top 100 genes | 67 | 29 | 4 | 1.08×10−8 |
| Top 200 genes | 154 | 40 | 6 | 4.37×10−6 |
p-values computed by Fisher’s exact test.
The top 50 genes in the prioritized candidate gene list by ENDEAVOUR [13].
Figure 3The sub-network containing 53 DCgenes from the selected 23 modules (top 5% of all modules generated by dmGWAS).
Three gene clusters with plausible functions were included: cytokine network relevant genes (CCL2, CCL5, CCL8, CCL3, CXCL1, CXCL5, CCL7, CCL4, CCR5, and CCR10), matrix metalloproteinases (MMPs) family genes (MMP2, MMP3, MMP1, and MMP9), and transforming growth factor-beta (TGF-β) family genes (TGFB1, TGFBR2, TGFB2, TGFBR3, and TGFBR1), all of which have been previously implicated to play important roles in tooth development and carious lesions.
The 53 DCgenes residing in the top 23 dense modules.
| Gene symbol | Training gene | Source | ENDEAVOUR rank | Degree |
|
| Yes | Expression, Literature | 1 | 4 |
|
| Yes | Association, Literature | 3 | 3 |
|
| Yes | Association, Literature | 4 | 3 |
|
| Yes | Literature | 5 | 4 |
|
| Yes | Literature | 6 | 1 |
|
| No | Expression, Literature | 7 | 5 |
|
| No | Expression, Literature | 11 | 4 |
|
| No | Expression, Literature | 12 | 2 |
|
| No | Expression, Literature | 13 | 3 |
|
| No | Literature | 14 | 2 |
|
| No | Expression, Literature | 15 | 1 |
|
| No | Expression | 16 | 1 |
|
| No | Literature | 19 | 5 |
|
| No | Expression | 20 | 1 |
|
| No | Association, Literature | 25 | 5 |
|
| No | Literature | 26 | 2 |
|
| No | Literature | 27 | 2 |
|
| No | Expression, Literature | 30 | 7 |
|
| No | Expression | 33 | 4 |
|
| No | Expression, Literature | 36 | 5 |
|
| No | Expression, Literature | 42 | 6 |
|
| No | Literature | 45 | 3 |
|
| No | Expression | 71 | 3 |
|
| No | Literature | 80 | 2 |
|
| No | Expression | 82 | 2 |
|
| No | Literature | 85 | 1 |
|
| No | Linkage | 89 | 1 |
|
| No | Literature | 93 | 2 |
|
| No | Literature | 95 | 1 |
|
| No | Association, Expression, Literature | 97 | 3 |
|
| No | Expression | 100 | 1 |
|
| No | Literature | 108 | 3 |
|
| No | Literature | 111 | 2 |
|
| No | Expression | 116 | 2 |
|
| No | Expression | 122 | 5 |
|
| No | Expression | 126 | 2 |
|
| No | Literature | 127 | 3 |
|
| No | Literature | 132 | 2 |
|
| No | Association, Literature | 136 | 5 |
|
| No | Expression | 155 | 3 |
|
| No | Literature | 176 | 8 |
|
| No | Expression | 189 | 4 |
|
| No | Expression | 219 | 3 |
|
| No | Literature | 238 | 7 |
|
| No | Literature | 304 | 4 |
|
| No | Expression | 319 | 1 |
|
| No | Expression | 340 | 2 |
|
| No | Expression | 377 | 7 |
|
| No | Literature | 394 | 2 |
|
| No | Expression | 398 | 1 |
|
| No | Literature | 756 | 3 |
|
| No | Association, Literature | 871 | 1 |
|
| No | Linkage, Literature | 880 | 1 |
The degree of a node is the number of its neighbors in the sub-network.