| Literature DB >> 22759569 |
Min Wu1, Chee-Keong Kwoh, Teresa M Przytycka, Jing Li, Jie Zheng.
Abstract
The regulatory mechanism of recombination is a fundamental problem in genomics, with wide applications in genome-wide association studies, birth-defect diseases, molecular evolution, cancer research, etc. In mammalian genomes, recombination events cluster into short genomic regions called "recombination hotspots". Recently, a 13-mer motif enriched in hotspots is identified as a candidate cis-regulatory element of human recombination hotspots; moreover, a zinc finger protein, PRDM9, binds to this motif and is associated with variation of recombination phenotype in human and mouse genomes, thus is a trans-acting regulator of recombination hotspots. However, this pair of cis and trans-regulators covers only a fraction of hotspots, thus other regulators of recombination hotspots remain to be discovered. In this paper, we propose an approach to predicting additional trans-regulators from DNA-binding proteins by comparing their enrichment of binding sites in hotspots. Applying this approach on newly mapped mouse hotspots genome-wide, we confirmed that PRDM9 is a major trans-regulator of hotspots. In addition, a list of top candidate trans-regulators of mouse hotspots is reported. Using GO analysis we observed that the top genes are enriched with function of histone modification, highlighting the epigenetic regulatory mechanisms of recombination hotspots.Entities:
Year: 2012 PMID: 22759569 PMCID: PMC3380740 DOI: 10.1186/1477-5956-10-S1-S11
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
Figure 1The flowchart of our method for predicting tran-regulators of recombination hotspots. Figure 1 shows the framework for predicting tran-regulators of recombination hotspots.
Figure 2The distribution of hotspots in each chromosome. Figure 2 shows the number of hotspots for each chromosome.
Figure 3The odds ratio scores of mouse PRDM9 with different p-value thresholds for FIMO search. When we used different p-value thresholds for FIMO search, we will get different odds ratio scores. Figure 3 shows the impact of the p-value threshold on the odds ratio scores of mouse PRDM9.
Number of occurrences of mouse PRDM9 motif (FIMO, q-value <0.05) on hotspots and coldspots
| # hits | # regions with hit(s) | # regions without hit | |
|---|---|---|---|
| Hotspots | 4954 | 1405 | 8469 |
| Coldspots | 4598.8 | 1120.35 | 8753.65 |
The numbers of FIMO hits of mouse TFs from JASPAR database with top 12 O, ranked by the p-values of the odds ratios (PRDM9's results are also shown in this table)
| p-value | ||||||
|---|---|---|---|---|---|---|
| KLF4 | 886 | 8988 | 643.15 | 9230.85 | 1.415 | |
| ZFX | 437 | 9437 | 329 | 9545 | 1.343 | |
| CTCF | 1002 | 8872 | 769.9 | 9104.1 | 1.336 | |
| PRDM9 | 1405 | 8469 | 1120.35 | 8753.65 | 1.30 | |
| RXRA | 1015 | 8859 | 819.55 | 9054.45 | 1.266 | |
| ESRRB | 792 | 9082 | 663.4 | 9210.6 | 1.211 | 0.0002 |
| GABPA | 110 | 9764 | 67.05 | 9806.95 | 1.648 | 0.0006 |
| MYCN | 197 | 9677 | 137.45 | 9736.55 | 1.440 | 0.0006 |
| SPZ1 | 322 | 9552 | 249.55 | 9624.45 | 1.300 | 0.0013 |
| MYC | 129 | 9745 | 91 | 9783 | 1.423 | 0.0061 |
| PAX5 | 194 | 9680 | 151.35 | 9722.65 | 1.287 | 0.0113 |
| EGR1 | 85 | 9789 | 61.55 | 9812.45 | 1.384 | 0.0343 |
| T | 189 | 9685 | 155.95 | 9718.05 | 1.216 | 0.0411 |
The numbers of FIMO hits of mouse TFs from TRANSFAC database with top 10 O, ranked by the p-values of the odds ratios (PRDM9's results are also shown in this table)
| p-value | ||||||
|---|---|---|---|---|---|---|
| MYOD1 | 483 | 9391 | 341.05 | 9532.95 | 1.434 | |
| PRDM9 | 1405 | 8469 | 1120.35 | 8753.65 | 1.30 | |
| MYC/MAX | 98 | 9776 | 58.25 | 9815.75 | 1.689 | 0.0009 |
| USF2 | 111 | 9763 | 70.05 | 9803.95 | 1.591 | 0.0014 |
| ATF4 | 79 | 9795 | 44.55 | 9829.45 | 1.780 | 0.0015 |
| USF1 | 101 | 9773 | 63.45 | 9810.55 | 1.598 | 0.0019 |
| AHR | 97 | 9777 | 62.3 | 9811.7 | 1.563 | 0.0034 |
| ARNT | 91 | 9783 | 60.2 | 9813.8 | 1.516 | 0.0071 |
| ETS1 | 64 | 9810 | 40.95 | 9833.05 | 1.567 | 0.0157 |
| ATF4 | 67 | 9807 | 44.35 | 9829.65 | 1.514 | 0.0181 |
| CNTN2 | 43 | 9831 | 27.6 | 9846.4 | 1.56 | 0.0480 |
Semantic similarity between JASPAR HG genes and two recombination related terms
| Genes | Similarity scores |
|---|---|
| PRDM9 | 0.327 |
| SPZ1 | 0.319 |
| CTCF | 0.249 |
| PAX5 | 0.182 |
| GABPA | 0.180 |
| EGR1 | 0.175 |
| ESRRB | 0.166 |
| MYC | 0.166 |
| KLF4 | 0.163 |
| RXRA | 0.157 |
| ZFX | 0.145 |
| T | 0.141 |
| MYCN | 0.126 |
GO terms enriched in JASPAR TFs with high odd ratio scores (with top 15 gap scores)
| Rank | GO terms | GO term descriptions | |
|---|---|---|---|
| 1 | GO:0016571 | histone methylation | 0.282 |
| 2 | GO:0018022 | peptidyl-lysine methylation | 0.203 |
| 3 | GO:0051573 | negative regulation of histone H3-K9 methylation | 0.201 |
| 4 | GO:0031060 | regulation of histone methylation | 0.184 |
| 5 | GO:0051574 | positive regulation of histone H3-K9 methylation | 0.168 |
| 6 | GO:0016568 | chromatin modification | 0.165 |
| 7 | GO:0051571 | positive regulation of histone H3-K4 methylation | 0.165 |
| 8 | GO:0006338 | chromatin remodeling | 0.157 |
| 9 | GO:0035065 | regulation of histone acetylation | 0.148 |
| 10 | GO:0006306 | DNA methylation | 0.146 |
| 11 | GO:0010216 | maintenance of DNA methylation | 0.143 |
| 12 | GO:0016584 | nucleosome positioning | 0.135 |
| 13 | GO:0016485 | protein processing | 0.121 |
| 14 | GO:0031065 | positive regulation of histone deacetylation | 0.119 |
| 15 | GO:0018108 | peptidyl-tyrosine phosphorylation | 0.117 |
GO terms enriched in TRANSFAC TFs with high odd ratio scores (with top 15 gap scores)
| Rank | GO terms | GO term descriptions | |
|---|---|---|---|
| 1 | GO:0016571 | histone methylation | 0.105 |
| 2 | GO:0051574 | positive regulation of histone H3-K9 methylation | 0.0927 |
| 3 | GO:0031060 | regulation of histone methylation | 0.0925 |
| 4 | GO:0051571 | positive regulation of histone H3-K4 methylation | 0.0916 |
| 5 | GO:0051573 | negative regulation of histone H3-K9 methylation | 0.0865 |
| 6 | GO:0000432 | positive regulation of transcription from RNA polymerase II promoter by glucose | 0.0839 |
| 7 | GO:0043619 | regulation of transcription from RNA polymerase II promoter in response to oxidative stress | 0.0826 |
| 8 | GO:0006357 | regulation of transcription from RNA polymerase II promoter | 0.0791 |
| 9 | GO:0035065 | regulation of histone acetylation | 0.0786 |
| 10 | GO:0031065 | positive regulation of histone deacetylation | 0.0766 |
| 11 | GO:0018022 | peptidyl-lysine methylation | 0.0764 |
| 12 | GO:0006355 | regulation of transcription, DNA-dependent | 0.0763 |
| 13 | GO:0000122 | negative regulation of transcription from RNA polymerase II promoter | 0.0746 |
| 14 | GO:0046016 | positive regulation of transcription by glucose | 0.0738 |
| 15 | GO:0045944 | positive regulation of transcription from RNA polymerase II promoter | 0.0732 |
Hotspot coverage of JASPAR TFs with high odd ratio scores (JASPAR HG genes). The second column (HM) shows the number of hotspots covered by the JASPAR HG gene in the first column. The third column (| ∩ HS(PRDM9) |) shows the number of common hotspots covered by PRDM9 and the JASPAR HG gene
| Genes in | | ∩ | |
|---|---|---|
| T | 189 | 24 |
| PAX5 | 194 | 47 |
| KLF4 | 886 | 177 |
| GABPA | 110 | 23 |
| RXRA | 1015 | 157 |
| MYCN | 197 | 44 |
| SPZ1 | 322 | 63 |
| CTCF | 1002 | 163 |
| ESRRB | 792 | 110 |
| ZFX | 437 | 112 |
| MYC | 129 | 29 |
| EGR1 | 85 | 21 |
Figure 4Hotspot coverage graph. Figure 4 shows the hotspot coverage graph where each node is a gene with high odds ratio score and edges between nodes represent their hotspot coverage similarities. Here each color stands for a gene cluster.
Figure 5Venn diagram of the hotspot coverage. Figure 5 shows the hotspot coverage of the above 4 clusters in Figure 4.