| Literature DB >> 19084536 |
Miriana Petrovich1, Dmitry B Veprintsev.
Abstract
Methylation of DNA is one of the mechanisms controlling the expression landscape of the genome. Its pattern is altered in cancer and often results in the hypermethylation of the promoter regions and abnormal expression of tumour suppressor genes. Methylation of CpG dinucleotides located in the binding sites of transcription factors may contribute to the development of cancers by preventing their binding or altering their specificity. We studied the effects of CpG methylation on DNA recognition by the tumour suppressor p53, a transcription factor involved in the response to carcinogenic stress. p53 recognises a large number of DNA sequences, many of which contain CpG dinucleotides. We systematically substituted a CpG dinucleotide at each position in the consensus p53 DNA binding sequence and identified substitutions tolerated by p53. We compared the binding affinities of methylated versus non-methylated sequences by fluorescence anisotropy titration. We found that binding of p53 was not affected by cytosine methylation in a majority of cases. However, for a few sequences containing multiple CpG dinucleotides, such as sites in the RB and Met genes, methylation resulted in a four- to sixfold increase in binding of p53. This approach can be used to quantify the effects of CpG methylation on the DNA recognition by other DNA-binding proteins.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19084536 PMCID: PMC2666794 DOI: 10.1016/j.jmb.2008.11.054
Source DB: PubMed Journal: J Mol Biol ISSN: 0022-2836 Impact factor: 5.469
Fig. 1To access the whole variety of DNA sequences recognised by p53, we systematically introduced CpG or 5-methyl-CpG at every possible position in the sequence.
Sequences of DNA used in the study
| 300 | CGCGGACATGTCCGGACATGTCCCGC |
| 301 | CGC |
| m301 | CGC |
| 302 | CGCG |
| m302 | CGCG |
| 303 | CGCGG |
| m303 | CGCGG |
| 304 | CGCGGA |
| m304 | CGCGGA |
| 305 | CGCGGAC |
| m305 | CGCGGAC |
| 306 | CGCGGACA |
| m306 | CGCGGACA |
| 307 | CGCGGACAT |
| m307 | CGCGGACAT |
| 308 | CGCGGACATG |
| m308 | CGCGGACATG |
| 309 | CGCGGACATGT |
| m309 | CGCGGACATGT |
| 310 | CGCGGACATGTC |
| m310 | CGCGGACATGTC |
| 322 | CGCGGA |
| m322 | CGCGGA |
| 323 | CGCGGA |
| m323 | CGCGGA |
| 324 | CGCGGA |
| m324 | CGCGGA |
C⁎ represents 5-methylcytosine. CpG dinucleotides are underlined. The sequences were generated by systematically substituting the specific dinucleotides from positions 1 to 10 in a half-site (i.e., 4–13 including 3-bp flanking region) with CpG or 5-methyl-CpG. Only one strand is shown. The complementary strand was also methylated at the CpG dinucleotide. Because of the symmetry of the sequence only the first half of the site needed to be analysed.
Fig. 2Fluorescence anisotropy R reflects the tumbling rate of molecules in solution. It is ideal for studying strong protein–DNA interactions, as the complex formed is larger and tumbles more slowly than does the unbound oligonucleotide. A displacement of the Alexa488-labelled reporter oligonucleotide from the complex by the unlabelled competitor oligonucleotide allows accurate measurement of the difference in the Kd between two sequences. For clarity, the data shown represent a single titration using reference sequence number 300 from Table 1. Reporter DNA sequence (20 nM) was mixed with 125 nM full-length p53 protein. Competitor DNA was added to the sample in 25 steps. Such a titration was done for every DNA sequence studied. We multiplexed our measurements to perform 96 such titrations in parallel, allowing simultaneous analysis of a number of different sequences (see Materials and Methods for details).
Fig. 3(a) Effects of CpG methylation on affinity of p53 for DNA. CpG (black) or 5-methyl-CpG (red) dinucleotide was systematically substituted into the reference sequence at positions 1–10 of the first half-site. The CpG position refers to the position of the C in the first half of the p53 binding sequence GGACATGTCC. The difference in affinity relative to the unmethylated reference sequence is shown (Δlog Kd = log Kd(i) − log Kd(ref)). The log Kd of the reference sequence was − 7.55. (b) The difference in the affinity that can be attributed specifically to methylation is the difference in height of the bars at the same position, ΔΔlog Kd. = Δlog Kd(met) − Δlog Kd(non-met). Introduction of CpG dinucleotide at positions 4 and 6 has small overall impact on the affinity of p53 and results in the biggest methylation-specific response. Error bars represent one standard deviation based on at least three individual titrations.
Fig. 4Effects of single-strand methylation on binding of p53 to DNA. (a) Schematic diagram representing a consensus sequence of a p53 half-site. (b) A change in the affinity caused by a specific methylated cytosine or a combination of cytosines in a CpG dinucleotide. Methylation of CpG dinucleotides at positions 4 and 6 has the largest effect on binding of p53. These modifications affect invariant nucleotides at positions 4 and 6′, respectively. When methylated cytosine is present on only one strand, the effects of methylation at positions 5 and 6 had a similar effect as double methylation. 4, methylation on the top strand; 4′, methylation on the bottom strand; 4 + 4′, methylation on both strands. Likewise for position 6. The log Kd of the reference sequence was − 7.55.
Fig. 5Effects of multiple methylated CpG on the affinity of p53 for DNA. (a) Multiple CpG substitution results in progressively weaker binding because of the increasingly suboptimal DNA sequence. However, if the CpG dinucleotides are methylated, the overall increase in the log Kd is smaller. (b) The difference between the affinity of the methylated and non-methylated sequence increased proportionally to the number of methylated CpG dinucleotides. The log Kd of the reference sequence was − 7.55.
A selection of p53 binding sites containing CpG dinucleotides from a data set of experimentally confirmed p53 binding sites as summarised
| Target gene and binding site | DNA sequence | Predicted log | Start position | Length | Positions of CpG | Δlog |
|---|---|---|---|---|---|---|
| Cell cycle | ||||||
| Cyclin G | AGACCTGCC | − 6.83 | 1 | 20 | 10 | 0 |
| Cyclin G,C | AGGCTTGCC | − 6.83 | 1 | 20 | 10 | 0 |
| BTG2 | AGTC | − 4.94 | 1 | 20 | 5, 4 | 0 |
| GDF PTGF-b, SBS01 in Ref. | AGCCATGCC | − 6.70 | 1 | 20 | 10 | 0 |
| RB | GGG | − 6.40 | 1 | 23 | 4, 2, 4, 6, 10 | − 0.42 |
| Growth | ||||||
| TGFA | AGCCAAGTCTTGGCAAG | − 6.30 | 1 | 20 | 2 | − 0.06 |
| Death receptor | ||||||
| TNFRSF Killer/DRS in Ref. | GGGCATGTC | − 7.09 | 1 | 20 | 1,10 | 0.04 |
| PIDD PIDD in Ref. | AGGCCTGCCTG | − 6.75 | 1 | 28 | Linker | 0 |
| DNA repair | ||||||
| PCNA | ACATATGCC | − 6.83 | 1 | 20 | 10 | 0 |
| Pcna | GAACAAGTC | − 6.83 | 1 | 20 | 10 | 0 |
| Apoptosis | ||||||
| BAX-B,A | AGACAAGCCTGGG | − 6.47 | 1 | 20 | 6 | − 0.12 |
| BAX-mouse | AGGCAAGCTTTGAACTTG | − 6.67 | 1 | 21 | 2 | − 0.06 |
| BAX-human | GGGCAGGCC | − 6.99 | 1 | 20 | 1, 10 | 0.04 |
| IRDD cathepsin Dsite 1 | AAGCTGGGC | − 5.56⁎ | 1 | 20 | 10 | 0 |
| ei24/PIG8 | TGGCAGGC | − 6.56 | 1 | 20 | 9 | 0 |
| MCG10, RE-1 (PCBB4) | GAACTTAAGAC | − 6.11 | 3 | 28 | 10 | 0 |
| NOXA | AGGCTTGCCC | − 6.87 | 1 | 20 | 9 | 0 |
| p53aip1 | TCTCTTGCC | − 6.73 | 1 | 20 | 1, 10 | 0.04 |
| PERP, 2097 | G | − 6.33 | 1 | 24 | 2 | − 0.06 |
| cFOS,O | AGGCTTGCCC | − 6.87 | 1 | 20 | 9 | 0 |
| Positive regulation | ||||||
| mdm2 (13) | GGTCAAGTTGGGACA | − 6.66 | 1 | 20 | 4 | − 0.12 |
| S100A2 | GGGCATGTGTGGGCA | − 6.93 | 1 | 20 | 4 | − 0.12 |
| Negative regulation | ||||||
| cFOS,O | GGACTTGTCTGAG | − 6.44 | 1 | 20 | 4, 6 | − 0.24 |
| Met | GGA | − 6.20 | 8 | 27 | 4, 6, 8 | − 0.34 |
| Cytokeratin 8 | C | − 5.85⁎ | 1 | 22 | 2 | − 0.06 |
| EEf1A1 EF-1 a E4 in Ref. | GGGCAGACC | − 6.61 | 1 | 22 | 10 | 0 |
| EEf1A1 EF−1 a, E2 in Ref. | GGACA | − 6.75 | 1 | 24 | 6, 10 | − 0.12 |
| EGFR | GAGCTAGA | − 5.81⁎ | 1 | 23 | 9, 10 | 0 |
| Sgk | AACTCAGGCTGCCTCCTG | − 6.26 | 2 | 27 | 9 | 0 |
| TAP1 | GGGCTTGGCCCTGC | − 6.67 | 1 | 26 | Linker | 0 |
| TIMP-3 | GGGCTTGCTTGA | − 6.22 | 1 | 28 | Linker | 0 |
| Human repressor | ||||||
| ODC1 | GGGCT | − 6.16 | 1 | 30 | 6, 6, 10 | − 0.24 |
| ODC1 | GGACCAGTTCCAGG | − 6.15 | 1 | 25 | 6, 10 | − 0.12 |
| slc38 | AACCATGCTGTTACA | − 6.58 | 11 | 20 | 6 | − 0.12 |
| p22/PRG1, IER3 in Ref. | CCACATGCCT | − 6.93 | 1 | 20 | 9 | 0 |
| scd | GGGC | − 5.92⁎ | 1 | 21 | 5 | 0.1 |
| Hspa8 | GCACTAGTTCTGGACCT | − 6.22 | 1 | 28 | 6,8,10 | − 0.22 |
| NOS3 | GAGCCTCCCAGC | − 6.19 | 1 | 23 | 10 | 0 |
CpG dinucleotides are underlined. Sequences are presented in the same order. Please note that the exact arrangement of the binding site elements (i.e., location of the quarter-sites) is different from those presented for binding elements where the start position value is greater than one, as this arrangement was predicted to result in stronger binding. The position of CpG dinucleotide refers to the respective position in the half-site.
Effects of individual CpG dinucleotides cancel each other.