| Literature DB >> 20875111 |
Mun-Kit Choy1, Mehregan Movassagh, Hock-Guan Goh, Martin R Bennett, Thomas A Down, Roger S Y Foo.
Abstract
BACKGROUND: DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS") but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq) not to be biological transcription factor binding sites ("empirical TFBS"). We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20875111 PMCID: PMC2997012 DOI: 10.1186/1471-2164-11-519
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Datasets of genomic locations. Set 1: Predicted TFBS based on conserved consensus motifs for 106 transcription factor families (N = 3,749,417). Set 2: Random genomic locations (N = 20,982). Set 3: Predicted TFBS for 17 transcription factor families (N = 771,221). Set 4: Biologically proven TFBS (empirical TFBS by ChIP-seq) with conserved consensus motifs (N = 40,876).
Figure 2Genome-wide conserved consensus motifs (predicted TFBS) are hyper-methylated in human hearts and sperm. Methylation scores were determined across the genome for hearts and sperm using BATMAN [21]. The profiles of these scores (A: hearts, B: sperm) were plotted against locations of conserved consensus motifs for 106 transcription factor families centred on the predicted TF binding site (based on co-ordinates obtained from the UCSC genome web browser: TFBS Conserved track; "set 1", N = 3,749,417 locations). Methylation profile at random genomic locations was analyzed as a negative control and reflects a "neutral" methylation pattern at these locations ("set 2", N = 20,982).
List of 106 transcription factor families (from UCSC genome web browser, Conserved TFBS track) and their detailed methylation profiles in hearts and sperm.
| SPERM | |||
|---|---|---|---|
| Hyper-methylated | Hypo-methylated | ||
| AHR-ARNT, AML1, AP, AREB6, ARP1, ATF, BACH, BRACH, CDP, CEBP, CHOP, COMP1, COUP, CP2, CREB, EN1, ER, FAC1, GATA, GCNF, GFI1, GR, HEN1, HMX1, HOX, HSF, HTF, IK, ISRE, LMO2COM, LUN1, LYF1, MEIS1, MIF1, MRF2, MSX1, MYB, MYCMAX, MYOD, MYOGNF1, MZF1, NCX, NF1, NFE2, NFKB, NRSF, OLF1, EP300, P53, PAX, PBX1, PPAR, RFX1, ROAZ, RORA, RP58, RREB1, SEF1, SPZ1, SREBP, SRF, STAT, TAL1-E47, TCF, TGIF, USF, XBP1, YY1, ZIC, ZID | - | ||
| CETS1P54, E2F, E4BP4, EGR, ELK1, MAZR, NFY, NRF1, SRY | CHX10, FOX, FREAC, LHX3, MEF2, POU, RSRFC4, S8, HFH, SOX, SP1, TATA, TBP | ||
| BRN2, CREL, HLF, IRF, NFAT, NGFIC, NMYC, TST1 CDC5, OCT | CART1, NKX, EVI1, HNF | ||
List of 17 transcription factor families from ENCODE (UCSC genome web browser) and other published sources, and their detailed methylation profiles in hearts and sperm.
| Motif family | Heart | Sperm |
|---|---|---|
| E2F1 | Hypo-methylated | Hypo-methylated |
| NFY1 | Hypo-methylated | Hypo-methylated |
| YY11 | Hypo-methylated | Hypo-methylated |
| MYCMAX1 | Hypo-methylated | Hyper-methylated |
| NFKB1 | Hypo-methylated | Hyper-methylated |
| AP1 | Hypo-methylated | Hyper-methylated |
| NRSF2 | Hypo-methylated | Neutral |
| SREBP3 | Hypo-methylated | Neutral |
| SRF2 | Hypo-methylated | Neutral |
| STAT4 | Hypo-methylated | Neutral |
| TCF1 | Hypo-methylated | Neutral |
| GATA1 | Neutral | Hyper-methylated |
| NFE21 | Neutral | Neutral |
| OCT5 | Neutral | Neutral |
| SOX5 | Neutral | Neutral |
| EP3005 | Neutral | Neutral |
| TP536 | Neutral | Neutral |
1Yale/UCD/Harvard (ENCODE)
2HudsonAlpha Institute
3Yale/UCD/Harvard/Duke UNC/UT, Genome Institute of Singapore (GIS)
4Yale/UCD/Harvard, GIS
5Lister et al, Nature 2009
6GIS
Figure 3Predicted TFBS with conserved consensus motifs are hyper-methylated unless they are also biologically proven TFBS (empirical TFBS). Aggregate methylation profiles for predicted TFBS with conserved consensus motifs for a subset of 17 transcription factor families are hyper-methylated in hearts (A) and sperm (B) (N = 771,221); but hypo-methylated if they are also biologically proven TFBS (empirical TFBS by ChIP-seq; "set 4", N = 40,876) (C: hearts and D: sperm).
Figure 4Predicted TFBS with conserved consensus motifs residing outside of CGI are hyper-methylated. Aggregate methylation profiles for predicted TFBS with conserved consensus motifs ("set 3") that reside in CGI (A: hearts, B: sperm), and outside of CGI (C: hearts, D: sperm) showing that conserved consensus motifs are hypo-methylated when within CGI but hyper-methylated when outside of CGI.
Figure 5Empirical TFBS with conserved consensus motifs are hypo-methylated in CGI but have a neutral methylation pattern when outside of CGI in the heart. Methylation profiles for empirical TFBS with conserved consensus motifs ("set 4") that resided in CGI (A: hearts, B: sperm), and outside CGI (C: hearts, D: sperm).