| Literature DB >> 19004867 |
Jonathan D Turner1, Laetitia P L Pelascini, Joana A Macedo, Claude P Muller.
Abstract
The transcription start sites (TSS) and promoters of many genes are located in upstream CpG islands. Methylation within such islands is known for both imprinted and oncogenes, although poorly studied for other genes, especially those with complex CpG islands containing multiple first exons and promoters. The glucocorticoid receptor (GR) CpG island contains seven alternative first exons and their promoters. Here we show for the five GR promoters activated in PBMCs that methylation patterns are highly variable between individuals. The majority of positions were methylated at levels >25% in at least one donor affecting each promoter and TSS. We also examined the evolutionarily conserved transcription factor binding sites (TFBS) using an improved in silico phylogenetic footprinting technique. The majority of these contain methylatable CpG sites, suggesting that methylation may orchestrates alternative first exon usage, silencing and controlling tissue-specific expression. The heterogeneity observed may reflect epigenetic mechanisms of GR fine tuning, programmed by early life environment and events. With 78% of evolutionarily conserved alternative first exons falling into such complex CpG islands, their internal structure and epigenetic modifications are bound to be biologically important, and may be a common transcriptional control mechanism used throughout many phyla.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19004867 PMCID: PMC2602793 DOI: 10.1093/nar/gkn897
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
PCR primers and reaction conditions for bisulphite sequencing
| Primer sequence | Amplified region | Length (bp) | [MgCl2] (mM) | [Primers] (μM) | ||
|---|---|---|---|---|---|---|
| Promoter 1D | ||||||
| DA | Fwd: 5′-ATATTAGATAATGTATAGGGAATYGTTTAT-3′ Rev: 5′-CCRCCRCCATCTTAATAAAATACAATATC-3′ | -4724 to -4584 | 140 | 49 | 2 | 0.5 |
| DB | Fwd: 5′-TGTTAAGATGGTGGTYGYGGGGAYGG-3′ Rev: 5′-AACTACCRCAACTCCACCTAATCC-3′ | -4644 to -4438 | 206 | 59 | 2 | 0.1 |
| DBII | Fwd: 5′-GGTYGYGGGGAYGGGTTGGYGATATTGT-3′ Rev: 5′-CCTACTCRAACRCTCRACCACAACC-3′ | -4632 to -4460 | 172 | 59 | 1 | 0.1 |
| Promoter 1E | ||||||
| E | Fwd: 5′-GGGAGTTGAAYGTTGGTATTTTAAAGTTG-3′ Rev: 5′-CTCRAAAAAAATTACACRCCAAATAC-3′ | -4129 to -3781 | 348 | 54 | 2 | 0.1 |
| EII | Fwd: 5′-GTATTTTGTTTTATTTGTAGGGGTAGG-3′ Rev: 5′-TACAAAACCTCCAACRAACTAAATT-3′ | -4097 to -3804 | 293 | 55 | 1 | 0.1 |
| Promoter 1F | ||||||
| FA | Fwd: 5′-TGAAGATTYGGTYGTTTAGATGAT-3′ Rev: 5′-ACCRAATTACRTAAAATATATCACTTCRAA-3′ | -3605 to -3378 | 227 | 50 | 2 | 1 |
| FAII | Fwd: 5′-TGGTGGGGGATTTGTYGGTAYGYGA-3′ Rev: 5′-TCACTTCRAAAAAAACTACRAAATTACA-3′ | -3577 to -3398 | 179 | 52 | 2 | 0.5 |
| FB | Fwd: 5′-GGTYGAGAYGTTGYGGTATYGTTTTYGTG-3′ Rev: 5′-CCTTAACRACAAACRCCRCCAATAC-3′ | -3452 to -3268 | 184 | 50 | 4 | 1 |
| FBII | Fwd: 5′-GTTGYGGTATYGTTTTYGTGTAATTT-3′ Rev: 5′-ACRAATAACAACRAACRAACCACAA-3′ | -3443 to -3297 | 146 | 53 | 2 | 0.1 |
| FC | Fwd: 5′-TTGTGGTTYGTTYGTTGTTATTYGTAGG-3′ Rev: 5′-CACCRAATTTCTCCAATTTCTTTTCTC-3′ | -3321 to -3177 | 144 | 54 | 2 | 1 |
| FCII | Fwd: 5′-TTGTGGTTYGTTYGTTGTTATTYGTAGG-3′ Rev: 5′-CAATTTCTTTTCTCRCTACCTCCTTCC-3′ | -3321 to -3190 | 131 | 52 | 2 | 0.5 |
| Promoter 1H | ||||||
| HA | Fwd: 5′-TTYGGTTGYGGYGGGAATTGYGGAYGGTG-3′ Rev: 5′-AAACTAATAAAAATTTATAAACTCC-3′ | -2422 to -2258 | 164 | 56 | 2 | 0.5 |
| HAII | Fwd: 5′-GGTGGYGGGYGAGYGGTTTTTTTGTTAGAG-3′ Rev: 5′-ACTCCCRCRACRACCCCCRAATTATCTC-3′ | -2397 to -2276 | 121 | 58 | 1 | 0.1 |
| HB | Fwd: 5′-GGGYGYGTTYGTTTTTTYGAGGTGTYGTTG-3′ Rev: 5′-CTCCCCCTCRACCCRACCAAA-3′ | -2341 to -2135 | 206 | 55 | 1 | 0.1 |
| HBII | Fwd: 5′-GAGATAATTYGGGGGTYGTYGYGGGAG-3′ Rev: 5′-ACCCRACCAAAAAACRCCTAC-3′ | -2305 to -2145 | 160 | 56 | 1 | 0.1 |
| HC | Fwd: 5′-TTTYGTAGGYGTTTTTTGGTYGGGTYGAG-3′ Rev: 5′-AATTCAAACRCRACTTAACRTTCACCACRAA-3′ | -2169 to -1967 | 202 | 51 | 2 | 1 |
| HCII | Fwd: 5′-TTTYGTAGGYGTTTTTTGGTYGGGTYGAG-3′ Rev: 5′-CCAAAATTCCCRCRAAAAATAAAAAACTC-3′ | -2169 to -2055 | 114 | 55 | 3 | 0.1 |
| HD | Fwd: 5′-GGTYGTTYGATATTYGTTTTYGTGGTG-3′ Rev: 5′-CCCRCTTATACACCCTCAC-3′ | -2015 to -1844 | 171 | 52 | 1 | 0.1 |
| HDII | Fwd: 5′-GTGGTGAAYGTTAAGTYGYGTTTG-3′ Rev: 5′-CCRCACRCCCTCCTCAAACCA-3′ | -1994 to -1868 | 126 | 53 | 2 | 0.1 |
| HDIII | Fwd: 5′-GGTYGTTYGATATTYGTTTTYGTGGTG-3′ Rev: 5′-CCRCACRCCCTCCTCAAACCA-3′ | -2015 to -1868 | 147 | 52 | 1 | 1 |
aFwd, forward or sense primer; Rev, reverse or antisense primer. Degenerate nucleotides following IUPAC codes.
bLocations given with respect to the ATG start codon.
cTm, annealing temperature in PCR.
Regulatory element of the GR promoter region can be predicted by in silico phylogenetic footprinting
| Promoter | Transcription factor | Identification technique | Prediction | |
|---|---|---|---|---|
| 1B | SP1 (x3) | RG, FP, E | 5,4,5 | ✓ |
| 1B–D | YY-1 (x3) | FP, D, E | 5 | ✓ |
| 1D | n/d | |||
| 1E | n/d | |||
| 1F | NGFI-A | ChIP | 5 | ✓ |
| 1G | n/d | |||
| 1H | n/d |
n/d: Not experimentally determined.
aYY-1-binding site initially identified as part of promoter 1B, subsequent identification of alternate first exons would suggest that it is part of promoter 1D.
bThis YY-1-binding site is outside the CpG island and not considered further.
Figure 1Schematic representation of the Glucocorticoid receptor CpG island internal structure, the bisulphite sequencing strategy and primer locations. Known TFBS are shown as arrows and numbered from the ATG translation start codon in exon 2. § from (32), † from (53), ‡ homologous to the known rat site (25,27), ¶ the ephemeric exon 1G has only been detected in the rat as its homologue 18.
Figure 2.In silico phylogenetic footprinting and bisulphite sequencing of promoter 1D. (A) In silico phylogenetic footprinting covers the start of the CpG island 340 nucleotides upstream of exon 1D through to exon 1D. (B) Percentage methylation was measured by direct electropherogram reading after bisulphite sequencing of 26 donors, covering CpG 6–27. Methylation levels are expressed by colour, levels from the scale at the panel top. (C) CpG identifiers and the unsequenced region of promoter 1D (grey background). (D) All TRANSFAC predicted human TFBS in promoter 1D are significant ISPF predictions.
Figure 3.In silico phylogenetic footprinting and bisulphite sequencing of promoters 1J and 1E. (A) In silico phylogenetic footprinting covers the end of exon 1D through promoter 1J, exon 1J, to exon 1E. (B) Percentage methylation was measured by direct electropherogram reading after bisulphite sequencing of 26 donors, covering CpG 16–27. Methylation levels are expressed by colour, levels from the scale at the panel top. (C) CpG identifiers and the unsequenced region of promoter 1J (grey background). Exon 1J is underlined. (D) TRANSFAC prediction for promoter 1J and E. Significant ISPF predictions are numbered corresponding to (A). Predictions in bold are high-quality ISPF predictions containing CpG dinucleotides.
Figure 4.In silico phylogenetic footprinting and bisulphite sequencing of promoter 1F. (A) In silico phylogenetic footprinting covers the end of exon 1B through to exon 1F. (B) Percentage methylation was measured by direct electropherogram reading (CpG), or sequencing of 10–12 clones per donor (CpG) after bisulphite sequencing of 26 donors, covering CpG 1–42. Methylation levels are expressed by colour, levels from the scale at the panel top. Blank squares indicate positions for which no data was obtainable. (C) CpG identifiers within the sequence of promoter 1F. (D) TRANSFAC prediction for promoter 1F. Significant ISPF predictions are numbered corresponding to (A). Predictions in bold are high-quality ISPF predictions containing CpG dinucleotides.
Figure 5.In silico phylogenetic footprinting and bisulphite sequencing of promoter 1H. (A) In silico phylogenetic footprinting covers promoter 1H and part of exon 1H. (B) Percentage methylation was measured by direct electropherogram reading after bisulphite sequencing of 26 donors, covering CpG 1–42. Methylation levels are expressed by colour, levels from the scale at the panel top. Blank squares indicate positions for which no data was obtainable. (C) CpG identifiers within the sequence of promoter 1H and the unsequenced region (grey background). (D) TRANSFAC prediction for promoter 1H. Significant ISPF predictions are numbered corresponding to (A). Predictions in bold are high-quality ISPF predictions containing CpG dinucleotides.