| Literature DB >> 18234719 |
Dmitry B Veprintsev1, Alan R Fersht.
Abstract
The tumour suppressor p53 is a transcription factor that binds DNA in the vicinity of the genes it controls. The affinity of p53 for specific binding sites relative to other DNA sequences is an inherent driving force for specificity, all other things being equal. We measured the binding affinities of systematically mutated consensus p53 DNA-binding sequences using automated fluorescence anisotropy titrations. Based on measurements of the effects of every possible single base-pair substitution of a consensus sequence, we defined the DNA sequence with the highest affinity for full-length p53 and quantified the effects of deviation from it on the strength of protein-DNA interaction. The contributions of individual nucleotides were to a first approximation independent and additive. But, in some cases we observed significant deviations from additivity. Based on affinity data, we constructed a binding predictor that mirrored the existing p53 consensus sequence definition. We used it to search for high-affinity binding sites in the genome and to predict the effects of single-nucleotide polymorphisms in these sites. Although there was some correlation between the K(d) and biological function, the spread of the K(d)s by itself was not sufficient to explain the activation of different pathways by changes in p53 concentration alone.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18234719 PMCID: PMC2275157 DOI: 10.1093/nar/gkm1040
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 3.Quantitative sequence logo of the p53 DNA-binding preferences. The height of the bars represents the number of times by which the amount of protein bound decreases, due to the base substitution. It shows the biggest possible effect caused by three alternative substitutions. The height of individual letters is proportional to the amount of transcription factor bound to individual sequence variants under identical conditions. The sequence of the second half-site is identical to the first one when read on the non-coding strand of DNA in the 5′–3′ direction. The sequence logo takes complementarity of DNA into account. The sequence with the highest affinity for p53 is GG(A/G)CATGCCCGGGCATG(T/C)CC.
Figure 1.Fluorescence anisotropy R reflects the tumbling rate of molecules in solution. It is ideal for studying strong protein–DNA interactions as the complex formed is larger and tumbles more slowly than the unbound oligonucleotide. A displacement of the reference labelled oligonucleotide from the complex by the unlabelled oligonucleotide allows accurate measurement of the difference in the Kd between two sequences.
Oligonucleotide sequences used to construct binding predictor
| Sequence No. | Sequence | Substitution |
|---|---|---|
| 600 | CGC GGACATGTCC GGACATGTCC CGC | Ref sequence |
| 601 | CGC | 1A |
| 602 | CGC | 1T |
| 603 | CGC | 1C |
| 604 | CGC G | 2A |
| 605 | CGC G | 2T |
| 606 | CGC G | 2C |
| 607 | CGC GG | 3T |
| 608 | CGC GG | 3G |
| 609 | CGC GG | 3C |
| 610 | CGC GGA | 4A |
| 611 | CGC GGA | 4T |
| 612 | CGC GGA | 4G |
| 613 | CGC GGAC | 5T |
| 614 | CGC GGAC | 5G |
| 615 | CGC GGAC | 5C |
| 616 | CGC GGACA | 6A |
| 617 | CGC GGACA | 6G |
| 618 | CGC GGACA | 6C |
| 619 | CGC GGACAT | 7A |
| 620 | CGC GGACAT | 7T |
| 621 | CGC GGACAT | 7C |
| 622 | CGC GGACATG | 8A |
| 623 | CGC GGACATG | 8G |
| 624 | CGC GGACATG | 8C |
| 625 | CGC GGACATGT | 9A |
| 626 | CGC GGACATGT | 9T |
| 627 | CGC GGACATGT | 9G |
| 628 | CGC GGACATGTC | 10A |
| 629 | CGC GGACATGTC | 10T |
| 630 | CGC GGACATGTC | 10G |
The sequence for the coding strand is shown. Oligonucleotides were annealed with corresponding complementary oligonucleotides to form double-stranded oligonucleotides.
Figure 2.The effects of individual substitutions in the reference sequence on the affinity of p53 for its response element. Positive bars correspond to weaker interactions. Only first 10 bp of 20 bp site are shown because of the mirror symmetry of the response element. See Table 1 for description of individual sequences. Reference sequence bound p53 with the logKd of −7.51 (30 nM).
Binding predictor matrix definition
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Highest affinity sequence | G | G | A/G | C | A | T | G | C | C | C | G | G | G | C | A | T | G | T/A | C | C |
| A | 0.05 | 0.03 | 0.00 | 0.59 | 0.00 | 0.16 | 0.55 | 0.25 | 0.15 | 0.12 | 0.03 | 0.10 | 0.05 | 0.62 | 0.00 | 0.21 | 0.31 | 0.15 | 0.10 | 0.07 |
| T | 0.07 | 0.10 | 0.15 | 0.31 | 0.21 | 0.00 | 0.62 | 0.05 | 0.10 | 0.03 | 0.12 | 0.15 | 0.25 | 0.55 | 0.16 | 0.00 | 0.59 | 0.00 | 0.03 | 0.05 |
| G | 0.00 | 0.00 | 0.00 | 0.55 | 0.28 | 0.35 | 0.00 | 0.47 | 0.29 | 0.13 | 0.00 | 0.00 | 0.00 | 0.61 | 0.32 | 0.47 | 0.00 | 0.40 | 0.18 | 0.11 |
| C | 0.11 | 0.18 | 0.40 | 0.00 | 0.47 | 0.32 | 0.61 | 0.00 | 0.00 | 0.00 | 0.13 | 0.29 | 0.47 | 0.00 | 0.35 | 0.28 | 0.55 | 0.00 | 0.00 | 0.00 |
Each cell contains a difference ΔlogKd between the oligonucleotide containing a single substitution indicated in a row header and the tightest binding sequence. The highest affinity sequence shown in the top row had a logKd of −7.61.
Figure 4.Correspondence of the measured and predicted affinities of naturally occurring response elements. Black line represents ‘perfect’ prediction.
Oligonucleotide sequences used to test predicted logKds
| Name of the promoter | Sequence( | log | log |
|---|---|---|---|
| Cyclin G | CGCAGACCTGCCCGGGCAAGCCTCGC | −7.02 | −6.83 |
| 14-3-3s | CGCAGGCATGTGCCACCATGCCCCGC | −6.85 | −6.52 |
| CDK1NA (p21) 3′ site | CGCGAAGAAGACTGGGCATGTCTCGC | −6.9 | −6.54 |
| p53R2 | CGCTGACATGCCCAGGCATGTCTCGC | −7.35 | −7.46 |
| PCNA | CGCGAACAAGTCCGGGCATATGTCGC | −6.82 | −6.83 |
| Maspin | CGCGAACATGTTGGAGGCCTTTTGCGC | −6.89 | −6.19 |
| PUMA BS2 | CGCCTGCAAGTCCTGACTTGTCCCGC | −6.88 | −6.86 |
| Noxa | CGCAGGCTTGCCCCGGCAAGTTGCGC | −6.93 | −6.87 |
| p53AIP1 | CGCTCTCTTGCCCGGGCTTGTCGCGC | −6.92 | −6.73 |
| Bax A | CGCTCACAAGTTAGAGACAAGCCTCGC | −6.67 | −6.59 |
| IGF-BP3 | CGCAAACAAGCCACCAACATGCTTCGC | −6.77 | −6.89 |
| p53DINP1 | CGCGAACTTGGGGGAACATGTTTCGC | −6.78 | −6.25 |
| PUMA | CGCCTCCTTGCCTTGGGCTAGGCCCGC | −6.8 | −5.99 |
| rad51 | AAACTCGCGCAGGATCAAGCTCT | −5.96 | −6.12 |
| PA26 | GGACAAGTCTCAACAAGTTC | −6.47 | −6.85 |
| Bax B | AGACAAGCCTGGGCGTGGGC | −6.24 | −6.47 |
| MMP2 | AGACAAGCCTGAACTTGTCT | −6.39 | −7.01 |
| CDK1NA (p21) 5′ site | CAACATGTTGGGACATGTTC | −7.14 | −7.11 |
| MDM2_RE1 | GGTCAAGTTGGGACACGTCC | −6.47 | −6.69 |
| MDM2_RE2 | GAGCTAAGTCCTGACATGTCT | −6.41 | −6.42 |
Figure 5.A plot of p53 affinity for DNA as a function of bp position. A 50 bp region around the 5′ p53-binding site, 2209 bp upsream of the CDK1NA (p21) gene is shown (Chr6: 36752204:36752223 in NCBI 36.2 genome release). The circle marks the known p53-binding site.
Figure 6.Number of putative p53-binding sites of the length of 20 or 21 bp identified in the human genome (release 36.2) grows exponentially with increasing cutoff value.